JP7800517B2

JP7800517B2 - Optimization device, optimization method, and computer program

Info

Publication number: JP7800517B2
Application number: JP2023130687A
Authority: JP
Inventors: 明久岡田; 啓介大滝; 広顕吉田
Original assignee: Toyota Central R&D Labs Inc
Current assignee: Toyota Central R&D Labs Inc
Priority date: 2023-08-10
Filing date: 2023-08-10
Publication date: 2026-01-16
Anticipated expiration: 2043-08-10
Also published as: JP2025025669A

Description

本発明は、最適化装置、最適化方法、および、コンピュータプログラムに関する。 The present invention relates to an optimization device, an optimization method, and a computer program.

従来から、組合せ最適化問題を解くための最適化装置が知られている（例えば、非特許文献１）。 Optimization devices for solving combinatorial optimization problems have been known for some time (for example, Non-Patent Document 1).

S. Feld, C. Eoch, T. Gabor, C. Seidal, F. Neukart, I. Galter, W. Mauerer and C. Linnhoff-Popien "A Hybrid Solution Method for the Capacitated Vehicle Routing Problem Using a Quantum Annealer" Frontiers in ICT, VOL 6,2019, [令和５年７月１０日検索]、インターネット＜https://www.frontiersin.org/articles/10.3389/fict.2019.00013/full＞S. Feld, C. Eoch, T. Gabor, C. Seidal, F. Neukart, I. Galter, W. Mauerer and C. Linnhoff-Popien, "A Hybrid Solution Method for the Capacitated Vehicle Routing Problem Using a Quantum Annealer," Frontiers in ICT, VOL 6, 2019, [Retrieved July 10, 2023], Internet: <https://www.frontiersin.org/articles/10.3389/fict.2019.00013/full>

しかしながら、特許文献１のような先行技術によっても、最適化装置において、定式化が困難な制約条件が課せられている組合せ最適化問題について、最適化された組合せを求める技術については、なお、改善の余地があった。 However, even with prior art such as that disclosed in Patent Document 1, there was still room for improvement in the technology used by optimization devices to find optimized combinations for combinatorial optimization problems that involve constraints that are difficult to formulate.

本発明は、上述した課題を解決するためになされたものであり、最適化装置において、定式化が困難な制約条件が課せられている組合せ最適化問題でも最適化された組合せを求めることができる技術を提供することを目的とする。 The present invention was made to solve the above-mentioned problems, and aims to provide technology in an optimization device that can find optimized combinations even for combinatorial optimization problems that have constraints that are difficult to formulate.

本発明は、上述の課題の少なくとも一部を解決するためになされたものであり、以下の形態として実現することが可能である。 The present invention has been made to solve at least some of the above-mentioned problems, and can be realized in the following forms.

（１）本発明の一形態によれば、制約条件が課されている組合せ最適化問題を解くための最適化装置が提供される。この最適化装置は、組合せの基本構成要素に関する基本情報と、前記制約条件に関する制約情報とが入力される入力部と、前記基本情報を用いて、最適化された組合せのデータを取得するデータ取得部と、前記データ取得部が取得する最適化された組合せのデータと前記制約情報とを用いて、前記基本情報を補正する補正部と、前記補正部が補正した前記基本情報を用いて、前記制約条件を学習した新たな基本情報を取得する学習部と、を備え、前記データ取得部は、前記学習部が取得する新たな基本情報を用いて、最適化された組合せのデータを新たに取得し、前記補正部は、前記データ取得部が取得する最適化された組合せの新たなデータと、前記制約情報とを用いて、前記学習部が取得する新たな基本情報を補正し、前記学習部は、前記補正部が補正した新たな基本情報を用いて、前記制約条件をさらに学習した、さらに新たな基本情報を取得する。 (1) One aspect of the present invention provides an optimization device for solving combinatorial optimization problems that are subject to constraints. The optimization device includes an input unit that receives basic information about basic components of combinations and constraint information about the constraints; a data acquisition unit that acquires data about optimized combinations using the basic information; a correction unit that corrects the basic information using the data about the optimized combinations acquired by the data acquisition unit and the constraint information; and a learning unit that acquires new basic information that has learned the constraints using the basic information corrected by the correction unit. The data acquisition unit acquires new data about the optimized combinations using the new basic information acquired by the learning unit, the correction unit uses the new data about the optimized combinations acquired by the data acquisition unit and the constraint information to correct the new basic information acquired by the learning unit, and the learning unit acquires new basic information that has further learned the constraints using the new basic information corrected by the correction unit.

この構成によれば、補正部は、データ取得部が取得する最適化された組合せのデータと制約情報とを用いて、基本情報を補正する。学習部は、補正部が補正した基本情報を用いて、制約条件を学習した新たな基本情報を取得する。データ取得部は、学習部が取得する新たな基本情報を用いて、最適化された組合せのデータを新たに取得する。これにより、定式化が困難であり、基本情報として情報を含めにくい制約条件が課されている組合せ最適化問題であっても、最適化された組合せを求めることができる。また、補正部は、データ取得部が取得する、最適化された組合せの新たなデータと制約情報とを用いて、新たな基本情報を補正し、学習部は、補正部が補正した新たな基本情報を用いて、制約条件をさらに学習した、さらに新たな基本情報を取得する。このように、データ取得部による最適化された組合せのデータの取得と、学習部による制約条件を学習した基本情報の取得とを逐次的に繰り返すことで、制約条件を満たす組合せの中でも、さらに最適化された組合せを求めることができる。したがって、定式化が困難な制約条件が課せられている組合せ最適化問題において最適化された組合せを求めることができる。 With this configuration, the correction unit corrects the basic information using the data of the optimized combination and the constraint information acquired by the data acquisition unit. The learning unit uses the basic information corrected by the correction unit to acquire new basic information that has learned the constraints. The data acquisition unit uses the new basic information acquired by the learning unit to acquire new data of the optimized combination. This makes it possible to find optimized combinations even for combinatorial optimization problems that are difficult to formulate and have constraints that are difficult to include as basic information. The correction unit also corrects the new basic information using the new data of the optimized combination and the constraint information acquired by the data acquisition unit, and the learning unit uses the new basic information corrected by the correction unit to acquire new basic information that has further learned the constraints. In this way, by sequentially repeating the acquisition of data of the optimized combination by the data acquisition unit and the acquisition of basic information that has learned the constraints by the learning unit, it is possible to find even more optimized combinations from among combinations that satisfy the constraints. Therefore, it is possible to find optimized combinations for combinatorial optimization problems that have constraints that are difficult to formulate.

（２）上記形態の最適化装置において、前記補正部は、前記データ取得部が取得する最適化された組合せが、前記制約条件を満たすか否かを判定する条件判定部と、最適化された組合せが前記制約条件を満たしていないと前記条件判定部が判定する場合、前記基本情報を更新する情報更新部と、を含んでもよい。この構成によれば、補正部は、データ取得部が取得する最適化された組合せが制約条件を満たすか否かによって、基本情報を更新する。学習部は、データ取得部が最適化された組合せの新たなデータを取得するため、条件判定部による判定を受けて補正された基本情報を用いて、制約条件を学習した新たな基本情報を取得する。これにより、制約条件を満たす組合せの中でも、さらに最適化された組合せを求めることができる。 (2) In the optimization device of the above aspect, the correction unit may include a condition determination unit that determines whether the optimized combination acquired by the data acquisition unit satisfies the constraint conditions, and an information update unit that updates the basic information when the condition determination unit determines that the optimized combination does not satisfy the constraint conditions. According to this configuration, the correction unit updates the basic information depending on whether the optimized combination acquired by the data acquisition unit satisfies the constraint conditions. The learning unit acquires new basic information that has learned the constraint conditions using the basic information corrected in response to the determination by the condition determination unit, in order for the data acquisition unit to acquire new data for the optimized combination. This makes it possible to determine a more optimized combination from among the combinations that satisfy the constraint conditions.

（３）上記形態の最適化装置において、前記データ取得部は、最適化された組合せの評価値を算出し、前記情報更新部は、最適化された組合せが前記制約条件を満たしていないと前記条件判定部が判定する場合、前記制約条件に対応する罰則値を用いて前記評価値を変更することで、前記基本情報を更新してもよい。この構成によれば、情報更新部は、最適化された組合せが制約条件を満たしていないと条件判定部が判定する場合、制約条件に対応する罰則値を用いて最適化された組合せの評価値を変更するため、新たな基本情報に、制約情報を確実に含めることができる。これにより、制約条件を満たす組合せの中でも、さらに最適化された組合せを求めることができる。 (3) In the optimization device of the above aspect, the data acquisition unit may calculate an evaluation value of the optimized combination, and the information update unit may update the basic information by changing the evaluation value using a penalty value corresponding to the constraint condition when the condition determination unit determines that the optimized combination does not satisfy the constraint condition. With this configuration, when the condition determination unit determines that the optimized combination does not satisfy the constraint condition, the information update unit changes the evaluation value of the optimized combination using a penalty value corresponding to the constraint condition, thereby ensuring that the constraint information is included in the new basic information. This makes it possible to find a more optimized combination from among the combinations that satisfy the constraint condition.

（４）上記形態の最適化装置は、前記評価値に前記罰則値を加算した修正評価値が、あらかじめ設定されている基準点以下であるか否かを判定する評価判定部を備え、前記学習部は、前記修正評価値が前記基準点より大きいと前記評価判定部が判定する場合、前記情報更新部によって更新された新たな基本情報を用いて、前記制約条件を学習した新たな基本情報を取得してもよい。この構成によれば、学習部は、評価値に罰則値を加算した修正評価値が、あらかじめ設定されている基準点以下であるか否かの判定に従って、制約条件を学習した新たな基本情報を取得し、データ取得部は、新たな基本情報を用いて、最適化された組合せの新たなデータを取得する。これにより、最適化が一定の水準となった組合せのデータが取得されると、最適化装置による組合せ最適化問題の解決を終了することができるため、最適化された組合せのデータの取得に要する時間を短くすることができる。 (4) The optimization device of the above aspect may include an evaluation/determination unit that determines whether a corrected evaluation value, obtained by adding the penalty value to the evaluation value, is equal to or less than a predetermined reference point, and the learning unit may acquire new basic information that has learned the constraint conditions using new basic information updated by the information update unit when the evaluation/determination unit determines that the corrected evaluation value is greater than the reference point. According to this configuration, the learning unit acquires new basic information that has learned the constraint conditions in accordance with a determination of whether the corrected evaluation value, obtained by adding the penalty value to the evaluation value, is equal to or less than a predetermined reference point, and the data acquisition unit acquires new data of optimized combinations using the new basic information. As a result, once data of combinations for which optimization has reached a certain level has been acquired, the optimization device can terminate solving the combinatorial optimization problem, thereby shortening the time required to acquire data of optimized combinations.

（５）上記形態の最適化装置において、前記学習部は、二次制約なし二値変数最適化形式となっている、新たな基本情報を取得してもよい。この構成によれば、学習部は、二次制約なし二値変数最適化形式となっている、新たな基本情報を取得する。これにより、最適化された組合せのデータの取得において、シミュレーテッドアニーリングや量子アニーリングなどのイジングマシンを用いることができる。 (5) In the optimization device of the above embodiment, the learning unit may acquire new basic information in a format of binary variable optimization without quadratic constraints. According to this configuration, the learning unit acquires new basic information in a format of binary variable optimization without quadratic constraints. This makes it possible to use Ising machines such as simulated annealing and quantum annealing to acquire data on optimized combinations.

（６）上記形態の最適化装置において、前記データ取得部は、二次制約なし二値変数最適化ソルバーを用いて、最適化された組合せのデータを取得してもよい。この構成によれば、データ取得部は、イジングマシンなどの二次制約なし二値変数最適化ソルバーを用いて、最適化された組合せのデータを取得する。これにより、最適化された組合せのデータの取得に要する時間を短くすることができる。 (6) In the optimization device of the above aspect, the data acquisition unit may acquire data on optimized combinations using a binary variable optimization solver without quadratic constraints. According to this configuration, the data acquisition unit acquires data on optimized combinations using a binary variable optimization solver without quadratic constraints, such as an Ising machine. This reduces the time required to acquire data on optimized combinations.

（７）本発明の別の形態によれば、最適化装置を用いて制約条件が課されている組合せ最適化問題を解く最適化方法が提供される。この最適化方法は、組合せの基本構成要素に関する基本情報と、前記制約条件に関する制約情報とが入力される入力工程と、前記基本情報を用いて、最適化された組合せのデータを取得する第１データ取得工程と、前記第１データ取得工程において取得された、最適化された組合せのデータと前記制約情報とを用いて、前記基本情報を補正する補正工程と、前記補正工程において補正された基本情報を用いて、前記制約条件を学習した新たな基本情報を取得する学習工程と、前記学習工程において取得された新たな基本情報を用いて、最適化された組合せのデータを新たに取得する第２データ取得工程と、を備える。この構成によれば、補正工程では、第１データ取得工程において取得される最適化された組合せのデータと制約情報とを用いて、基本情報を補正する。学習工程では、補正工程において補正された基本情報を用いて、制約条件を学習した新たな基本情報を取得し、第２データ取得工程では、制約条件を学習した新たな基本情報を用いて、最適化された組合せのデータを新たに取得する。これにより、例えば、定式化が困難な制約条件を満たす組合せの中から最適化された組合せを求めることができる。また、第２データ取得工程において取得される、最適化された組合せの新たなデータと制約情報とを用いて、学習工程では、制約条件をさらに学習した、さらに新たな基本情報を取得することができる。このように、最適化された組合せのデータの取得と、制約条件を学習した基本情報の取得とを逐次的に繰り返すことで、制約条件を満たす組合せの中でも、さらに最適化された組合せを求めることができる。 (7) Another aspect of the present invention provides an optimization method for solving a combinatorial optimization problem that is subject to constraints using an optimization device. This optimization method includes an input process for inputting basic information regarding basic components of a combination and constraint information regarding the constraints; a first data acquisition process for acquiring data of an optimized combination using the basic information; a correction process for correcting the basic information using the data of the optimized combination acquired in the first data acquisition process and the constraint information; a learning process for acquiring new basic information that has learned the constraints using the basic information corrected in the correction process; and a second data acquisition process for acquiring new data of the optimized combination using the new basic information acquired in the learning process. According to this configuration, in the correction process, the basic information is corrected using the data of the optimized combination acquired in the first data acquisition process and the constraint information. In the learning process, new basic information that has learned the constraints is acquired using the basic information corrected in the correction process, and in the second data acquisition process, new data of the optimized combination is acquired using the new basic information that has learned the constraints. This makes it possible to find, for example, an optimized combination from among combinations that satisfy constraint conditions that are difficult to formulate. Furthermore, using the new data and constraint information for the optimized combination acquired in the second data acquisition process, new basic information that further learns the constraint conditions can be acquired in the learning process. In this way, by sequentially repeating the acquisition of data for the optimized combination and the acquisition of basic information that learns the constraint conditions, it is possible to find a further optimized combination from among combinations that satisfy the constraint conditions.

（８）本発明のさらに別の形態によれば、制約条件が課されている組合せ最適化問題の解決をコンピュータに実行させるコンピュータプログラムが提供される。このコンピュータプログラムは、組合せの基本構成要素に関する基本情報と、前記制約条件に関する制約情報とが入力される入力機能と、前記基本情報を用いて、最適化された組合せのデータを取得する第１データ取得機能と、前記第１データ取得機能が取得する最適化された組合せのデータと前記制約情報とを用いて、前記基本情報を補正する補正機能と、前記補正機能によって補正された基本情報を用いて、前記制約条件を学習した新たな基本情報を取得する学習機能と、前記学習機能が取得する新たな基本情報を用いて、最適化された組合せのデータを新たに取得する第２データ取得機能と、をコンピュータに実行させる。この構成によれば、コンピュータは、補正機能が、第１データ取得機能によって取得される最適化された組合せのデータと制約情報とを用いて、基本情報を補正する。学習機能は、補正機能によって補正された基本情報を用いて、制約条件を学習した新たな基本情報を取得し、第２データ取得機能は、制約条件を学習した新たな基本情報を用いて、最適化された組合せのデータを新たに取得する。これにより、例えば、定式化が困難な制約条件を満たす組合せの中から最適化された組合せを求めることができる。また、第２データ取得機能によって取得する、最適化された組合せの新たなデータと制約情報とを用いて、学習機能では、制約条件をさらに学習した、さらに新たな基本情報を取得することができる。このように、最適化された組合せのデータの取得と、制約条件を学習した基本情報の取得とを逐次的に繰り返すことで、制約条件を満たす組合せの中でも、さらに最適化された組合せを求めることができる。 (8) According to yet another aspect of the present invention, a computer program is provided that causes a computer to solve a combinatorial optimization problem that is subject to constraints. The computer program causes the computer to execute the following functions: an input function that inputs basic information regarding basic components of a combination and constraint information regarding the constraints; a first data acquisition function that acquires data of an optimized combination using the basic information; a correction function that corrects the basic information using the data of the optimized combination acquired by the first data acquisition function and the constraint information; a learning function that acquires new basic information that has learned the constraints using the basic information corrected by the correction function; and a second data acquisition function that acquires new data of the optimized combination using the new basic information acquired by the learning function. According to this configuration, the correction function of the computer corrects the basic information using the data of the optimized combination acquired by the first data acquisition function and the constraint information. The learning function uses the basic information corrected by the correction function to acquire new basic information that has learned the constraints, and the second data acquisition function acquires new data of the optimized combination using the new basic information that has learned the constraints. This makes it possible, for example, to find an optimized combination from among combinations that satisfy constraint conditions that are difficult to formulate. Furthermore, using new data and constraint information for the optimized combination obtained by the second data acquisition function, the learning function can acquire new basic information that further learns the constraint conditions. In this way, by sequentially repeating the acquisition of data for the optimized combination and the acquisition of basic information that learns the constraint conditions, it is possible to find an even more optimized combination from among combinations that satisfy the constraint conditions.

なお、本発明は、種々の態様で実現することが可能であり、例えば、最適化装置を含むシステム、これら装置およびシステムの制御方法、これら装置およびシステムにおいて最適化方法を実行させるコンピュータプログラム、そのコンピュータプログラムを配布するためのサーバ装置、そのコンピュータプログラムを記憶した一時的でない記憶媒体等の形態で実現することができる。 The present invention can be realized in various forms, such as a system including an optimization device, a control method for these devices and systems, a computer program for executing the optimization method in these devices and systems, a server device for distributing the computer program, and a non-transitory storage medium on which the computer program is stored.

第１実施形態の最適化装置の概略ブロック図である。FIG. 1 is a schematic block diagram of an optimization device according to a first embodiment. 第１実施形態の最適化方法の概略を説明するフローチャートである。1 is a flowchart outlining an optimization method according to a first embodiment. 組合せ最適化問題の具体例を説明する第１の図である。FIG. 1 is a first diagram illustrating a specific example of a combinatorial optimization problem. 組合せ最適化問題の具体例を説明する第２の図である。FIG. 10 is a second diagram illustrating a specific example of a combinatorial optimization problem. 第１実施形態の最適化方法のフローチャートの具体例である。1 is a specific example of a flowchart of an optimization method according to the first embodiment. 第１実施形態の最適化方法の効果を説明する図である。10A and 10B are diagrams illustrating the effects of the optimization method according to the first embodiment. 第１実施形態の最適化方法を説明する概略模式図である。FIG. 2 is a schematic diagram illustrating an optimization method according to the first embodiment.

＜第１実施形態＞
図１は、第１実施形態の最適化装置１の概略ブロック図である。本実施形態の最適化装置１は、種々の組合せ最適化問題を解くための装置であって、特に、定式化が困難な制約条件が課されている最適化問題を解くことに適した装置である。本実施形態の最適化装置１は、パーソナルコンピュータ（ＰｅｒｓｏｎａｌＣｏｍｐｕｔｅｒ：ＰＣ）で構成されている。最適化装置１は、ユーザからの入力を受け付ける入力部１０と、記憶部２０と、各種プログラムを実行するＣＰＵ（ＣｅｎｔｒａｌＰｒｏｃｅｓｓｉｎｇＵｎｉｔ）３０と、プログラムの実行結果に関する情報を出力する出力部４０と、他の装置と有線または無線により各種情報を送受信可能な送受信部５０と、を備える。なお、本実施形態では、最適化装置１は、１台のパーソナルコンピュータであるとしたが、複数の装置から構成されていてもよい。例えば、入力部１０および出力部４０を備える入出力端末と、記憶部２０およびＣＰＵ３０を備える演算装置とが別々の場所に存在しており、入出力端末と演算装置とがネットワークを介して接続されていてもよい。 First Embodiment
FIG. 1 is a schematic block diagram of an optimization device 1 according to a first embodiment. The optimization device 1 according to this embodiment is a device for solving various combinatorial optimization problems, particularly suitable for solving optimization problems with constraints that are difficult to formulate. The optimization device 1 according to this embodiment is configured as a personal computer (PC). The optimization device 1 includes an input unit 10 that accepts input from a user, a storage unit 20, a CPU (Central Processing Unit) 30 that executes various programs, an output unit 40 that outputs information related to the execution results of the programs, and a transmission/reception unit 50 that can transmit and receive various information to and from other devices via wired or wireless connections. While the optimization device 1 according to this embodiment is described as a single personal computer, it may also be configured as a plurality of devices. For example, an input/output terminal including the input unit 10 and the output unit 40 and a calculation device including the storage unit 20 and the CPU 30 may be located in separate locations, and the input/output terminal and the calculation device may be connected via a network.

入力部１０は、最適化装置１において探索される組合せの基本構成要素に関する基本情報と、組合せに課される制約条件に関する制約情報とが入力される。本実施形態では、基本構成要素とは、組合せの探索において組合せられる要素そのものを指し、基本情報とは、最適化装置１が組合せのデータを取得するために必要な情報を指す。例えば、後述する電気自動車の移動経路問題では、基本構成要素は、電気自動車が訪問すべき２地点間の移動コストであり、基本情報は、電気自動車が訪問すべき地点の数、充電ステーションの数などである。制約情報とは、探索される組合せが守るべき条件（制約条件）に関する情報を指す。例えば、電気自動車の移動経路問題では、制約情報は、電気自動車が備えるバッテリの最大電気容量などである。本実施形態では、入力部１０は、キーボードとマウスとにより構成されている。なお、入力部１０は、これに限定されず、基本情報と制約情報とが記憶されている、ＵＳＢメモリやフラッシュメモリなど外部の記録デバイスとのインターフェースであってもよい。また、本実施形態における基本情報や制約情報の入力は、入力部１０によるものに限定されず、送受信部５０を用いて、図示しない他の装置からこれらの情報を受信することで入力されてもよい。 The input unit 10 receives input of basic information about the basic components of the combinations searched for by the optimization device 1 and constraint information about the constraints imposed on the combinations. In this embodiment, the basic components refer to the elements themselves that are combined in the search for combinations, and the basic information refers to information necessary for the optimization device 1 to obtain combination data. For example, in the electric vehicle travel route problem described below, the basic components are the travel cost between two locations that the electric vehicle must visit, and the basic information includes the number of locations that the electric vehicle must visit and the number of charging stations. The constraint information refers to information about the conditions (constraints) that the combinations searched for must observe. For example, in the electric vehicle travel route problem, the constraint information includes the maximum electric capacity of the battery equipped in the electric vehicle. In this embodiment, the input unit 10 is composed of a keyboard and a mouse. However, the input unit 10 is not limited to this and may also be an interface with an external storage device, such as a USB memory or flash memory, on which the basic information and constraint information are stored. Furthermore, in this embodiment, the input of basic information and constraint information is not limited to being via the input unit 10, but may also be input by receiving this information from another device (not shown) using the transceiver unit 50.

記憶部２０は、入力部１０によって入力される基本情報や制約情報を収納するとともに、組合せ最適化問題の解決を実行するためのコンピュータプログラムが収容されている。本実施形態では、記憶部２０は、メモリ装置の総称であり、ＲＯＭ（ＲｅａｄＯｎｌｙＭｅｍｏｒｙ）、ＲＡＭ（ＲａｎｄｏｍＡｃｃｅｓｓＭｅｍｏｒｙ）、ソリッドステートドライブ（ＳＳＤ）、ハードディスク（ＨＤＤ）、フラッシュメモリカードなど様々な形式の記憶装置を使用することができる。 The storage unit 20 stores the basic information and constraint information input by the input unit 10, as well as a computer program for solving combinatorial optimization problems. In this embodiment, the storage unit 20 is a general term for memory devices, and various types of storage devices can be used, such as ROM (Read Only Memory), RAM (Random Access Memory), solid state drives (SSDs), hard disks (HDDs), and flash memory cards.

ＣＰＵ３０は、記憶部２０のＲＯＭに格納されているプログラムをＲＡＭに展開することにより、各種プログラムの機能を実行する。ＣＰＵ３０は、データ取得部３１と、補正部３２と、評価判定部３３と、学習部３４と、して機能する。 The CPU 30 executes the functions of various programs by loading programs stored in the ROM of the storage unit 20 into the RAM. The CPU 30 functions as a data acquisition unit 31, a correction unit 32, an evaluation/determination unit 33, and a learning unit 34.

データ取得部３１は、基本情報を用いて、最適化された組合せのデータを取得する。データ取得部３１における最適化された組合せのデータを取得する方法は、入力部１０によって入力される基本情報を用いて取得する場合と、学習部３４における学習によって取得される新たな基本情報を用いて取得する場合との２通りがある。本実施形態では、データ取得部３１は、取得した最適化された組合せのデータについて、組合せの最適度を評価するための評価値を算出する。データ取得部３１の機能の詳細は、後述する。 The data acquisition unit 31 acquires data on optimized combinations using basic information. There are two methods for acquiring data on optimized combinations in the data acquisition unit 31: acquiring data using basic information input by the input unit 10, or acquiring data using new basic information acquired through learning in the learning unit 34. In this embodiment, the data acquisition unit 31 calculates an evaluation value for the acquired optimized combination data to evaluate the optimality of the combination. The functions of the data acquisition unit 31 will be described in detail below.

補正部３２は、データ取得部３１が取得する最適化された組合せのデータと、制約情報とを用いて、基本情報を補正する。補正部３２は、条件判定部３２ａと、情報更新部３２ｂと、を有する。条件判定部３２ａは、データ取得部３１によって取得される最適化された組合せが制約情報に含まれる制約条件を満たすか否かを判定する。情報更新部３２ｂは、条件判定部３２ａにおける判定結果に応じて基本情報を更新する。本実施形態では、情報更新部３２ｂは、データ取得部３１が算出する評価値と、制約条件に対応する罰則値とを用いて、基本情報を更新する。補正部３２の機能の詳細は、後述する。 The correction unit 32 corrects the basic information using the data of the optimized combination acquired by the data acquisition unit 31 and the constraint information. The correction unit 32 has a condition determination unit 32a and an information update unit 32b. The condition determination unit 32a determines whether the optimized combination acquired by the data acquisition unit 31 satisfies the constraint conditions included in the constraint information. The information update unit 32b updates the basic information according to the determination result of the condition determination unit 32a. In this embodiment, the information update unit 32b updates the basic information using the evaluation value calculated by the data acquisition unit 31 and the penalty value corresponding to the constraint condition. The function of the correction unit 32 will be described in detail below.

評価判定部３３は、データ取得部３１が取得する、最適化された組合せの最適度を判定する。本実施形態では、評価判定部３３は、補正部３２が補正した基本情報を用いて、最適度を判定する。評価判定部３３の機能の詳細は、後述する。 The evaluation and determination unit 33 determines the optimality of the optimized combination acquired by the data acquisition unit 31. In this embodiment, the evaluation and determination unit 33 determines the optimality using the basic information corrected by the correction unit 32. Details of the function of the evaluation and determination unit 33 will be described later.

学習部３４は、補正部３２が補正した基本情報を用いて、制約条件を学習した新たな基本情報を取得する。本実施形態の学習部３４は、補正部３２が補正した基本情報に対して、ベイズ推定を用いて新たな基本情報を取得する。学習部３４の機能の詳細は、後述する。 The learning unit 34 uses the basic information corrected by the correction unit 32 to obtain new basic information that has learned the constraints. In this embodiment, the learning unit 34 obtains new basic information using Bayesian estimation for the basic information corrected by the correction unit 32. Details of the function of the learning unit 34 will be described later.

図２は、本実施形態の最適化方法の概略を説明するフローチャートである。図２は、本実施形態の最適化方法のフローチャートを一般的な表現で表したものである。図２に示すフローチャートによって説明される最適化方法は、制約条件が課される組合せ最適化問題（以下、「制約付き最適化問題」という）に適用される。本実施形態の最適化方法が適用される制約付き最適化問題は、入力ｘに対する解ｚについて満たすべき制約条件が課されている状況において、評価値ｙが最も良い解ｚを与える入力ｘを求めることを目的としている。さらに、本実施形態の制約付き最適化問題は、解ｚについて満たすべき制約条件を定式化することが困難である点に特徴がある。 Figure 2 is a flowchart outlining the optimization method of this embodiment. Figure 2 is a generalized representation of the flowchart of the optimization method of this embodiment. The optimization method illustrated by the flowchart in Figure 2 is applied to combinatorial optimization problems to which constraints are imposed (hereinafter referred to as "constrained optimization problems"). Constrained optimization problems to which the optimization method of this embodiment is applied aim to find an input x that gives the solution z with the best evaluation value y in a situation where constraints that must be satisfied for a solution z for input x are imposed. Furthermore, the constrained optimization problem of this embodiment is characterized by the difficulty of formulating the constraints that must be satisfied for solution z.

本実施形態の制約付き最適化問題を解くための最適化方法では、図２に示すように、最初に、入力部１０によって基本情報と制約情報とを含む基礎データが入力される（ステップＳ１）。次に、データ取得部３１は、基本情報を用いて制約なしの最適化問題を解く（ステップＳ２）。これにより、データ取得部３１は、入力ｘにおける解ｚを導出し、最適化された入力ｘと解ｚとの組合せのデータを取得する。ステップＳ２で導出される解ｚには、制約条件が課されていない。データ取得部３１は、入力ｘにおける解ｚを導出すると、入力ｘにおける解ｚに対する仮の評価値ａを求める（ステップＳ３）。ステップＳ１は、特許請求の範囲の「入力工程」に相当する。ステップＳ２は、特許請求の範囲の「第１データ取得工程」に相当する。 In the optimization method for solving a constrained optimization problem of this embodiment, as shown in FIG. 2, first, basic data including basic information and constraint information is input by the input unit 10 (step S1). Next, the data acquisition unit 31 solves the unconstrained optimization problem using the basic information (step S2). As a result, the data acquisition unit 31 derives a solution z for the input x and acquires data for the optimized combination of the input x and solution z. No constraints are imposed on the solution z derived in step S2. After deriving the solution z for the input x, the data acquisition unit 31 calculates a provisional evaluation value a for the solution z for the input x (step S3). Step S1 corresponds to the "input step" in the claims. Step S2 corresponds to the "first data acquisition step" in the claims.

次に、条件判定部３２ａは、データ取得部３１が取得した入力ｘと解ｚとの組合せのデータと制約情報とを用いて、解ｚが制約条件を充足しているか否かを判定する（ステップＳ４）。解ｚは制約条件を充足していると条件判定部３２ａが判定する場合（ステップＳ４：ＹＥＳ）、ステップＳ５に移行する。解ｚは制約条件を充足していないと条件判定部３２ａが判定する場合（ステップＳ４：ＮＯ）、ステップＳ６に移行する。 Next, the condition determination unit 32a uses the data for the combination of input x and solution z and the constraint information acquired by the data acquisition unit 31 to determine whether solution z satisfies the constraint conditions (step S4). If the condition determination unit 32a determines that solution z satisfies the constraint conditions (step S4: YES), the process proceeds to step S5. If the condition determination unit 32a determines that solution z does not satisfy the constraint conditions (step S4: NO), the process proceeds to step S6.

ステップＳ５では、条件判定部３２ａは、仮の評価値ａを入力ｘにおける評価値ｙとする（ステップＳ５）。一方、ステップＳ６では、条件判定部３２ａは、仮の評価値ａに、制約条件に対応する罰則値ｂを用いた演算によって得られる値（修正評価値）を、入力ｘにおける評価値ｙとする（ステップＳ６）。なお、ステップＳ６での罰則値ｂを用いた演算は、例えば、加算や減算などの四則演算などであるが、これらに限定されない。ステップＳ６での演算による評価値ｙの大きさの変化は、後述するステップＳ８での最適度の判定において重要となる。したがって、本実施形態の最適化方法が適用される制約付き最適化問題の内容によって、ステップＳ６での演算の内容は異なる。 In step S5, the condition determination unit 32a sets the provisional evaluation value a as the evaluation value y for input x (step S5). Meanwhile, in step S6, the condition determination unit 32a sets the value (modified evaluation value) obtained by performing an operation using the provisional evaluation value a and the penalty value b corresponding to the constraint condition as the evaluation value y for input x (step S6). Note that the operation using the penalty value b in step S6 may be, for example, an arithmetic operation such as addition or subtraction, but is not limited to these. The change in the magnitude of the evaluation value y due to the operation in step S6 is important in determining the optimality in step S8, which will be described later. Therefore, the operation in step S6 will differ depending on the content of the constrained optimization problem to which the optimization method of this embodiment is applied.

ステップＳ５またはステップＳ６の次に、情報更新部３２ｂは、入力ｘにおける評価値ｙをステップＳ５またはステップＳ６の処理に従って更新する（ステップＳ７）。すなわち、補正部３２は、制約条件に対応する罰則値ｂを用いて評価値を変更することで、直前のステップＳ２において、最適化された入力ｘと解ｚとの組合せのデータを取得する際に用いた基本情報を、制約条件を考慮した基本情報に補正する。ステップＳ４からステップＳ７までは、特許請求の範囲の「補正工程」に相当する。 After step S5 or step S6, the information update unit 32b updates the evaluation value y for the input x according to the processing of step S5 or step S6 (step S7). That is, the correction unit 32 changes the evaluation value using the penalty value b corresponding to the constraint, thereby correcting the basic information used in obtaining data for the optimized combination of input x and solution z in the previous step S2 to basic information that takes the constraint into account. Steps S4 to S7 correspond to the "correction process" in the claims.

次に、評価判定部３３は、入力ｘにおける評価値ｙがあらかじめ設定されている基準を満たすか否かを判定する（ステップＳ８）。ステップＳ８では、例えば、ステップＳ７で更新された評価値ｙが基準値以下であるか否かを判定する。評価値ｙは基準値以下であると評価判定部３３が判定する場合（ステップＳ８：ＹＥＳ）、今回の最適化方法は、終了する。評価値ｙは基準値以下でないと評価判定部３３が判定する場合（ステップＳ８：ＮＯ）、ステップＳ９に移行する。なお、評価判定部３３における判定の方法は、基準値以下であるか否かに限定されない。例えば、基準値以上としてもよいし、所定の範囲内に含まれるなどとしてもよい。 Next, the evaluation determination unit 33 determines whether the evaluation value y for the input x satisfies a preset criterion (step S8). In step S8, for example, it determines whether the evaluation value y updated in step S7 is equal to or less than a reference value. If the evaluation determination unit 33 determines that the evaluation value y is equal to or less than the reference value (step S8: YES), the current optimization method ends. If the evaluation determination unit 33 determines that the evaluation value y is not equal to or less than the reference value (step S8: NO), the process proceeds to step S9. Note that the method of determination in the evaluation determination unit 33 is not limited to whether it is equal to or less than the reference value. For example, it may be equal to or greater than the reference value, or it may be within a predetermined range.

ステップＳ８において、評価値ｙは基準値以下でないと判定されると、ベイズ推定によって入力ｘと評価値ｙとの関係を学習する（ステップＳ９）。具体的には、学習部３４は、ｙ＝ｆ（ｘ）で表されるベイズ推定において、ステップＳ７で更新された評価値ｙを用いて、入力ｘと評価値ｙとの関係を学習する。これにより、学習部３４は、ステップＳ３からＳ７における判定の対象とならなかった組合せに関する情報も含む、制約条件を学習した新たな基本情報を取得することができる。ステップＳ９は、特許請求の範囲の「学習工程」に相当する。 If it is determined in step S8 that the evaluation value y is not less than the reference value, the relationship between the input x and the evaluation value y is learned using Bayesian estimation (step S9). Specifically, the learning unit 34 uses the evaluation value y updated in step S7 to learn the relationship between the input x and the evaluation value y in Bayesian estimation expressed as y = f(x). This allows the learning unit 34 to acquire new basic information that has learned the constraints, including information about combinations that were not subject to the judgment in steps S3 to S7. Step S9 corresponds to the "learning process" in the claims.

次に、新たな入力ｘを決定する（ステップＳ１０）。具体的には、データ取得部３１は、ステップＳ９において取得した新たな基本情報を用いて、最適な入力と推定される新たな入力ｘを決定する。なお、ステップＳ９およびステップＳ１０における入力ｘを決定する方法は、ベイズ推定に限定されない。 Next, a new input x is determined (step S10). Specifically, the data acquisition unit 31 uses the new basic information acquired in step S9 to determine a new input x that is estimated to be the optimal input. Note that the method for determining the input x in steps S9 and S10 is not limited to Bayesian estimation.

ステップＳ１０の次に、ステップＳ２に戻り、ステップＳ９において取得した新たな基本情報を用いて、制約なしの最適化問題を解く（ステップＳ２）。本実施形態の最適化方法では、ステップＳ２から２つの判定（ステップＳ４およびステップＳ８）を経てステップＳ１０に至るまでの処理を逐次的に行うことで、評価値ｙが最も良い解ｚを与える入力ｘを求めることができる。ステップＳ１０およびステップＳ１０の次のステップＳ２は、特許請求の範囲の「第２データ取得工程」に相当する。 After step S10, the process returns to step S2, where the new basic information acquired in step S9 is used to solve the unconstrained optimization problem (step S2). In the optimization method of this embodiment, by sequentially performing the processes from step S2 through two judgments (steps S4 and S8) to step S10, it is possible to determine the input x that gives the solution z with the best evaluation value y. Step S10 and the step S2 that follows step S10 correspond to the "second data acquisition step" in the claims.

次に、具体例を用いて、本実施形態の最適化方法を説明する。本実施形態では、制約付き最適化問題として、電気自動車の最大電気容量を考慮した電気自動車の移動経路を最適化する。以下説明する、本実施形態の最適化方法が適用される組合せ最適化問題を、単に、「電気自動車の移動経路問題」という。本実施形態の最適化方法は、ユーザによって基本情報と制約情報が入力されると、実行される。 Next, the optimization method of this embodiment will be explained using a specific example. In this embodiment, the constrained optimization problem is to optimize the travel route of an electric vehicle, taking into account the maximum electric capacity of the electric vehicle. The combinatorial optimization problem to which the optimization method of this embodiment is applied, as described below, will be simply referred to as the "electric vehicle travel route problem." The optimization method of this embodiment is executed when the user inputs basic information and constraint information.

図３は、本実施形態の組合せ最適化問題の具体例を説明する第１の図である。図３は、電気自動車の移動経路問題において、電気自動車ＥＶが訪問すべき地点Ｐ１～Ｐ１０と、地点Ｐ１～Ｐ１０のうちのいくつかに設置される充電ステーションとの関係の一例を示した図である。図３では、電気自動車ＥＶが訪問すべき地点Ｐ１～Ｐ１０のうち、３か所の地点Ｐ３，Ｐ６，Ｐ１０のそれぞれに、充電ステーションＳｐ１～Ｓｐ３のそれぞれが設置されている。電気自動車の移動経路問題では、電気自動車ＥＶは、充電ステーションＳｐ１～Ｓｐ３のそれぞれでの充電を利用しつつ、１０か所の訪問すべき地点Ｐ１～Ｐ１０を全て訪問する。充電ステーションＳｐ１～Ｓｐ３のそれぞれが設置される位置は、本実施形態の電気自動車の移動経路問題において、最適化の対象となっている。充電ステーションＳｐ１～Ｓｐ３のそれぞれが設置される位置に関する情報は、初期設定として、基本情報に含まれる。なお、図３では、１０か所の地点Ｐ１～Ｐ１０のそれぞれの関係を模式的に示しているだけであり、地点Ｐ１～Ｐ１０のそれぞれの距離や方向を示すものではない。 Figure 3 is the first diagram illustrating a specific example of a combinatorial optimization problem of this embodiment. Figure 3 is a diagram showing an example of the relationship between points P1 to P10 that an electric vehicle EV must visit and charging stations installed at some of the points P1 to P10 in an electric vehicle travel path problem. In Figure 3, charging stations Sp1 to Sp3 are installed at three of the points P1 to P10 that the electric vehicle EV must visit: P3, P6, and P10. In the electric vehicle travel path problem, the electric vehicle EV visits all 10 points P1 to P10 that it must visit while charging at each of the charging stations Sp1 to Sp3. The locations where each of the charging stations Sp1 to Sp3 is installed are the targets of optimization in the electric vehicle travel path problem of this embodiment. Information regarding the locations where each of the charging stations Sp1 to Sp3 is installed is included in the basic information as an initial setting. Note that Figure 3 only shows the schematic relationship between the 10 points P1 to P10, and does not indicate the distance or direction between each of the points P1 to P10.

図４は、本実施形態の組合せ最適化問題の具体例を説明する第２の図である。図４には、電気自動車が備えるバッテリの消費量に関する情報が示されている。具体的には、図３で示した地点Ｐ１～Ｐ１０のうちの２地点間の移動において消費する電気の量を示している。図４では、横軸に電気自動車が出発する地点（出発地点）を示し、縦軸に電気自動車が到着する地点（到着地点）を示している。図４には、出発地点と到着地点との関係において、消費するバッテリの消費量をドットの密度で示している。具体的には、ドットの密度が高いほど、バッテリの消費量が多いことを示している。図４に示す情報は、基本情報に含まれる。 Figure 4 is a second diagram illustrating a specific example of a combinatorial optimization problem according to this embodiment. Figure 4 shows information related to the consumption of the battery provided in an electric vehicle. Specifically, it shows the amount of electricity consumed when traveling between two of the points P1 to P10 shown in Figure 3. In Figure 4, the horizontal axis shows the point from which the electric vehicle departs (departure point), and the vertical axis shows the point at which the electric vehicle arrives (arrival point). Figure 4 shows the amount of battery consumption in relation to the departure point and the arrival point as dot density. Specifically, the higher the dot density, the greater the battery consumption. The information shown in Figure 4 is included in the basic information.

電気自動車が備えるバッテリの消費量は、例えば、出発地点をＡとし到着地点をＢとした場合と、出発地点をＢとし到着地点をＡとした場合とでは、異なる。図４を参照して説明すると、出発地点を地点Ｐ６とし到着地点を地点Ｐ８とした場合、バッテリの消費量Ｃｓ６８は、５近くになるが、出発地点を地点Ｐ６とし到着地点を地点Ｐ８とした場合、バッテリの消費量Ｃｓ８６は、４程度となる。これは、２地点の高低差などによって、例えば、下り坂がある場合、バッテリは、放電することなく充電することができるからである。したがって、２地点間の関係では、バッテリの消費量が０より小さい値となる場合も存在する。 The battery consumption of an electric vehicle differs, for example, when the departure point is A and the destination point is B, compared to when the departure point is B and the destination point is A. Referring to Figure 4, if the departure point is P6 and the destination point is P8, battery consumption Cs68 will be close to 5, but if the departure point is P6 and the destination point is P8, battery consumption Cs86 will be around 4. This is because, for example, if there is a downhill slope due to the difference in elevation between the two points, the battery can be charged without discharging. Therefore, between two points, battery consumption may be less than 0.

本実施形態では、電気自動車が備えるバッテリの最大電気容量を３とし、１か所の充電ステーションにおいて、バッテリの電気容量２に相当する電気量をバッテリに充電することができるとする。本実施形態での電気自動車の移動経路問題では、バッテリの電気量が最大電気容量である３を越えてはならず、また、０以下になることも許されない容量制約（制約条件）において、充電ステーションを最適な地点に配置し、バッテリの消費量（移動コスト）が最も少ない経路を求めることとなる。すなわち、本実施形態の最適化方法が適用される電気自動車の移動経路問題では、電気自動車が備えるバッテリの容量制約を考慮しつつ、充電ステーションが配置される位置と電気自動車の移動経路との両方を最適化することとなる。 In this embodiment, the maximum electrical capacity of the battery equipped in an electric vehicle is assumed to be 3, and an amount of electricity equivalent to the battery's electrical capacity of 2 can be charged to the battery at one charging station. In the electric vehicle travel route problem in this embodiment, the charging station is located at the optimal location and a route with the lowest battery consumption (travel cost) is found under the capacity constraint (constraint condition) that the amount of electricity in the battery must not exceed the maximum electrical capacity of 3 and must not fall below 0. In other words, in the electric vehicle travel route problem to which the optimization method of this embodiment is applied, both the location of the charging station and the electric vehicle travel route are optimized while taking into account the capacity constraint of the battery equipped in the electric vehicle.

図５は、本実施形態の最適化方法のフローチャートの具体例である。電気自動車の移動経路問題における最適化方法では、最初に、基礎データを入力する（ステップＳ１１）。ステップＳ１１では、入力部１０によって電気自動車の移動経路問題における基本情報と制約情報とが最適化装置１に入力される。ステップＳ１１において入力される基本情報には、上述したように、電気自動車が訪問すべき地点の数、充電ステーションの数、図４で示した電気自動車が訪問すべき地点の間における移動コストなどに関する情報が含まれる。充電ステーションが配置される位置に関する情報は、上述したように、初期設定として基本情報に含まれている。以降、便宜的に、充電ステーションが配置される位置を「充電ステーションの位置ｘ」とする。充電ステーションの位置ｘは、図２で説明した最適化方法の一般表現における入力ｘに対応する。 Figure 5 is a specific example of a flowchart of the optimization method of this embodiment. In the optimization method for the electric vehicle travel path problem, basic data is first input (step S11). In step S11, basic information and constraint information for the electric vehicle travel path problem are input to the optimization device 1 by the input unit 10. As described above, the basic information input in step S11 includes information on the number of locations to be visited by the electric vehicle, the number of charging stations, and the travel costs between the locations to be visited by the electric vehicle shown in Figure 4. As described above, information on the locations where charging stations will be installed is included in the basic information as an initial setting. Hereinafter, for convenience, the locations where charging stations will be installed will be referred to as "charging station location x." Charging station location x corresponds to input x in the general expression of the optimization method described in Figure 2.

次に、容量制約なしの最適経路問題を解く（ステップＳ１２）。ステップＳ１２では、データ取得部３１は、基本情報を用いて電気自動車の移動経路問題を解く。これにより、データ取得部３１は、充電ステーションの位置ｘにおける最適移動経路ｚを「最適化された組合せのデータ」として取得する。ステップＳ１２では、充電ステーションの位置ｘに関する情報を含む基本情報を用いて電気自動車の移動経路問題を解くため、電気自動車のバッテリの電気量が最大電気容量である３以下であり、かつ、０より大きいことは、最適移動経路ｚを取得するにあたって課されない。ステップＳ１２において取得される最適移動経路ｚは、図２で説明した最適化方法の一般表現における解ｚに対応する。 Next, the optimal route problem without capacity constraints is solved (step S12). In step S12, the data acquisition unit 31 solves the electric vehicle travel route problem using basic information. As a result, the data acquisition unit 31 acquires the optimal travel route z for the charging station location x as "optimized combination data." In step S12, the electric vehicle travel route problem is solved using basic information including information about the charging station location x, so the amount of electricity in the electric vehicle's battery must not exceed 3, which is the maximum electrical capacity, and must not be greater than 0 when acquiring the optimal travel route z. The optimal travel route z acquired in step S12 corresponds to the solution z in the general expression of the optimization method described in Figure 2.

次に、最適移動経路の評価値を求める（ステップＳ１３）。ステップＳ１３では、データ取得部３１は、ステップＳ１２において取得した、充電ステーションの位置ｘにおける最適移動経路ｚの評価値ａを算出する。評価値ａは、ステップＳ１２において電気自動車の移動経路問題を解いたときに算出される移動コストで代用することができる。しかしながら、評価値は、これに限定されない。評価値ａは、図２で説明した最適化方法の一般表現における仮の評価値ａに対応する。 Next, an evaluation value of the optimal travel route is calculated (step S13). In step S13, the data acquisition unit 31 calculates an evaluation value a of the optimal travel route z at the charging station location x acquired in step S12. The evaluation value a can be substituted with the travel cost calculated when solving the electric vehicle travel route problem in step S12. However, the evaluation value is not limited to this. The evaluation value a corresponds to the provisional evaluation value a in the general expression of the optimization method described in Figure 2.

次に、最適移動経路はバッテリの容量制約を充足しているか否かを判定する（ステップＳ１４）。ステップＳ１４では、条件判定部３２ａは、ステップＳ１２において求められた、充電ステーションの位置ｘにおける電気自動車の最適移動経路ｚについて、電気自動車のバッテリの電気量が３以下であり、かつ、０より大きいか否かを判定する。最適移動経路ｚは容量制約を充足していると条件判定部３２ａが判定する場合（ステップＳ１４：ＹＥＳ）、ステップＳ１５に移行する。最適移動経路ｚは容量制約を充足していないと条件判定部３２ａが判定する場合（ステップＳ１４：ＮＯ）、ステップＳ１６に移行する。 Next, it is determined whether the optimal travel route satisfies the battery capacity constraint (step S14). In step S14, the condition determination unit 32a determines whether the amount of electricity in the battery of the electric vehicle for the optimal travel route z of the electric vehicle at the charging station position x calculated in step S12 is 3 or less and greater than 0. If the condition determination unit 32a determines that the optimal travel route z satisfies the capacity constraint (step S14: YES), the process proceeds to step S15. If the condition determination unit 32a determines that the optimal travel route z does not satisfy the capacity constraint (step S14: NO), the process proceeds to step S16.

ステップＳ１４において、最適移動経路ｚは容量制約を充足していると判定される場合、ステップＳ１５では、条件判定部３２ａは、ステップＳ１３において算出した評価値ａを最適移動経路ｚの移動コストｙとする（ステップＳ１５）。移動コストｙは、図２で説明した最適化方法の一般表現における評価値ｙに対応する。 If it is determined in step S14 that the optimal travel route z satisfies the capacity constraint, in step S15, the condition determination unit 32a sets the evaluation value a calculated in step S13 as the travel cost y of the optimal travel route z (step S15). The travel cost y corresponds to the evaluation value y in the general expression of the optimization method described in Figure 2.

一方、ステップＳ１４において、最適移動経路ｚは容量制約を充足していないと判定される場合、評価値ａと罰則値ｂとの合計を最適移動経路ｚの移動コストｙとする（ステップＳ１６）。ステップＳ１６では、条件判定部３２ａは、ステップＳ１３において算出した評価値ａに、容量制約に対応する罰則値ｂを加算することで得られる値（修正評価値）を、最適移動経路ｚの移動コストｙとする。すなわち、最適移動経路が容量制約を充足していない場合、その最適移動経路の移動コストは、容量制約を充足している最適移動経路に比べて、大きくなりやすい。なお、電気自動車の移動経路問題では、移動コストは小さい方が望ましいため、容量制約を充足していない場合には、評価値に罰則値を加算した値を移動コストとしている。 On the other hand, if it is determined in step S14 that the optimal travel route z does not satisfy the capacity constraint, the sum of the evaluation value a and the penalty value b is set as the travel cost y of the optimal travel route z (step S16). In step S16, the condition determination unit 32a sets the value obtained by adding the penalty value b corresponding to the capacity constraint to the evaluation value a calculated in step S13 (modified evaluation value) as the travel cost y of the optimal travel route z. In other words, if the optimal travel route does not satisfy the capacity constraint, the travel cost of that optimal travel route is likely to be higher than that of an optimal travel route that satisfies the capacity constraint. Note that in electric vehicle travel route problems, a small travel cost is desirable, so if the capacity constraint is not satisfied, the travel cost is set as the value obtained by adding the penalty value to the evaluation value.

ステップＳ１５またはステップＳ１６の次に、移動コストを更新する（ステップＳ１７）。ステップＳ１７では、情報更新部３２ｂは、充電ステーションの位置ｘにおける移動コストｙをステップＳ１５またはステップＳ１６での処理を受けて更新し、容量制約を考慮した基本情報に補正する。 Following step S15 or step S16, the travel cost is updated (step S17). In step S17, the information update unit 32b updates the travel cost y at the charging station position x following the processing in step S15 or step S16, and corrects it to basic information that takes capacity constraints into account.

次に、基準値以下の移動コストを取得したか否かを判定する（ステップＳ１８）。ステップＳ１８では、評価判定部３３は、ステップＳ１７において更新された移動コストがあらかじめ設定されている基準値以下であるか否かを判定する。基準値以下の移動コストｙを取得したと評価判定部３３が判定する場合（ステップＳ１８：ＹＥＳ）、今回の最適化方法を終了する。基準値以下の移動コストｙを取得していないと評価判定部３３が判定する場合（ステップＳ１８：ＮＯ）、ステップＳ１９に移行する。 Next, it is determined whether a movement cost equal to or less than a reference value has been acquired (step S18). In step S18, the evaluation and determination unit 33 determines whether the movement cost updated in step S17 is equal to or less than a preset reference value. If the evaluation and determination unit 33 determines that a movement cost y equal to or less than the reference value has been acquired (step S18: YES), the current optimization method is terminated. If the evaluation and determination unit 33 determines that a movement cost y equal to or less than the reference value has not been acquired (step S18: NO), the process proceeds to step S19.

ステップＳ１８において、基準値以下の移動コストｙを取得していないと判定されると、ベイズ推定における係数行列を推定する（ステップＳ１９）。ステップＳ１９では、学習部３４は、ベイズ推定による以下の式（１）の係数行列Ａを推定する。これにより、学習部３４は、容量制約を学習した新たな基本情報を取得する。学習部３４が取得する新たな基本情報には、ステップＳ１３からＳ１７における判定の対象とならなかった組合せに関する情報も含まれている。本実施形態では、学習部３４は、二次制約なし二値変数最適化（ＱＵＢＯ：ＱｕａｄｒａｔｉｃＵｎｃｏｎｓｔｒａｉｎｅｄＢｉｎａｒｙＯｐｔｉｍｉｚａｔｉｏｎ）形式となっている、新たな基本情報を取得することができる。
If it is determined in step S18 that a travel cost y equal to or less than the reference value has not been acquired, the coefficient matrix for Bayesian estimation is estimated (step S19). In step S19, the learning unit 34 estimates the coefficient matrix A of the following equation (1) using Bayesian estimation. As a result, the learning unit 34 acquires new basic information that has learned the capacity constraints. The new basic information acquired by the learning unit 34 also includes information about combinations that were not subject to the determination in steps S13 to S17. In this embodiment, the learning unit 34 can acquire new basic information in the form of quadratic unconstrained binary optimization (QUBO).

次に、充電ステーションの新たな位置ｘを決定する（ステップＳ２０）。具体的には、データ取得部３１は、ステップＳ１９において推定された係数行列Ａを用いて、充電ステーションの新たな位置ｘを決定する。 Next, a new position x of the charging station is determined (step S20). Specifically, the data acquisition unit 31 determines the new position x of the charging station using the coefficient matrix A estimated in step S19.

ステップＳ２０の後、ステップＳ２０で決定された充電ステーションの新たな位置ｘにおいて、容量制約なしの最適経路問題を解く（ステップＳ１２）。ステップＳ１２では、データ取得部３１は、新たに決定された充電ステーションの位置ｘにおいて、ステップＳ２０で取得した新たな基本情報を用いて電気自動車の移動経路問題を解く。本実施形態では、２回目以降のステップＳ１２においては、イジングマシンなどの二次制約なし二値変数最適化ソルバーを用いて、最適化された組合せのデータを取得する。これにより、データ取得部３１は、学習部３４が取得する、容量制約を学習した新たな基本情報を用いて、最適化された組合せのデータを新たに取得する。 After step S20, an optimal route problem without capacity constraints is solved at the new charging station position x determined in step S20 (step S12). In step S12, the data acquisition unit 31 solves the electric vehicle travel route problem at the newly determined charging station position x using the new basic information acquired in step S20. In this embodiment, in the second and subsequent iterations of step S12, optimized combination data is acquired using a binary variable optimization solver without quadratic constraints, such as an Ising machine. As a result, the data acquisition unit 31 acquires new optimized combination data using the new basic information acquired by the learning unit 34 that has learned the capacity constraints.

本実施形態の最適化方法では、このようにして、ステップＳ１２からステップＳ２０までを繰り返し行い、ステップＳ１８において、移動コストｙが基準値以下となる最適移動経路ｚを、最終的な最適移動経路ｚとして決定する。決定された電気自動車の最適移動経路は、出力部４０を用いて出力される。 In this manner, the optimization method of this embodiment repeats steps S12 to S20, and in step S18, the optimal travel route z for which the travel cost y is equal to or less than the reference value is determined as the final optimal travel route z. The determined optimal travel route for the electric vehicle is output using the output unit 40.

図６は、本実施形態の最適化方法の効果を説明する図である。図６には、図５で説明した最適化方法における、電気自動車の移動コストと、充電ステーションの位置の探索回数との関係について、充電ステーションを３カ所に置いた場合の計算結果を示している。図６の横軸には、充電ステーションの位置の探索回数（ステップＳ２０を実行した回数）を示し、縦軸には、それぞれの回数の探索を行ったときに求められた、電気自動車の移動コストを示している。図６には、ステップＳ１４での判定において、容量制約を違反した場合には、評価値に罰則値として１０を加えた場合の結果を示している（ステップＳ１６参照）。図６に示すように、おおよそ５０回程度の探索で容量制約に違反しない解が見つかることがわかる。さらに、１３０回程度の探索で容量制約の違反がない移動コストとして、７．６９の解が見つかることが分かる。 Figure 6 is a diagram illustrating the effects of the optimization method of this embodiment. Figure 6 shows the calculation results for the relationship between the travel cost of an electric vehicle and the number of searches for charging station locations, using the optimization method described in Figure 5, when three charging stations are located. The horizontal axis of Figure 6 shows the number of searches for charging station locations (the number of times step S20 was executed), and the vertical axis shows the travel cost of the electric vehicle calculated for each search. Figure 6 also shows the results when a penalty value of 10 is added to the evaluation value if the capacity constraint is violated in the determination made in step S14 (see step S16). As shown in Figure 6, it can be seen that a solution that does not violate the capacity constraint can be found after approximately 50 searches. Furthermore, it can be seen that a solution with a travel cost of 7.69 that does not violate the capacity constraint can be found after approximately 130 searches.

図７は、本実施形態の最適化方法を説明する概略模式図である。図７に示す概略模式図は、ＢａｙｅｓｉａｎＯｐｔｉｍｉｚａｔｉｏｎｏｆＣｏｍｂｉｎａｔｏｒｉａｌＳｔｒｕｃｔｕｒｅ（以下、「ＢＯＣＳ」という）を利用した学習プロセスの模式図を示している。図７の概略模式図で表されている最適化方法では、組合せを表す二値変数のビット列である入力ｘと出力ｚとの関係が不明となっている。図７では、入力ｘに対する出力ｚの関係が不明な系をｚ＝ｆ（ｘ）として示しており、図３～図６で説明した電気自動車の移動経路問題がこれに当たる。 Figure 7 is a schematic diagram illustrating the optimization method of this embodiment. The schematic diagram shown in Figure 7 illustrates a learning process using Bayesian Optimization of Combinatorial Structure (hereinafter referred to as "BOCS"). In the optimization method illustrated in the schematic diagram in Figure 7, the relationship between input x, which is a bit string of binary variables representing combinations, and output z is unknown. In Figure 7, a system in which the relationship between input x and output z is unknown is shown as z = f(x), and this corresponds to the electric vehicle travel route problem described in Figures 3 to 6.

図７に示す最適化方法では、入力ｘに対する出力ｚの関係が不明な系に対して、最初に、ある入力ｘに対する出力ｚのデータの組を取得する（図７に示す矢印ＰＲ１）。図３～図６で説明した電気自動車の移動経路問題では、入力ｘは、充電ステーションの位置ｘに対応し、出力ｚは、電気自動車の最適移動経路ｚに対応する。 In the optimization method shown in Figure 7, for a system where the relationship between input x and output z is unknown, a data set of output z for a given input x is first obtained (arrow PR1 in Figure 7). In the electric vehicle travel route problem described in Figures 3 to 6, input x corresponds to the location x of the charging station, and output z corresponds to the optimal travel route z for the electric vehicle.

次に、取得した入力ｘ（充電ステーションの位置）に対する出力ｚ（電気自動車の最適移動経路）のデータの組を用いて、入力ｘと出力ｚとの関係をＱＵＢＯ（ＱｕａｄｒａｔｉｃＵｎｃｏｎｓｔｒａｉｎｅｄＢｉｎａｒｙＯｐｔｉｍｉｚａｔｉｏｎ）形式で学習する（図７に示す矢印ＰＲ２）。入力ｘと出力ｚの関係をＱＵＢＯ形式で学習するとき、出力ｚが制約条件を充足している否かを判定する。出力ｚが制約条件を充足していない場合、制約条件に対応する罰則値を用いて入力ｘと出力ｚとの関係に更新することで、制約条件を学習することができる。図３～図６で説明した電気自動車の移動経路問題では、電気自動車が備えるバッテリの電気容量に関する容量制約の違反があった場合に評価値と罰則値との合計を電気自動車の移動コストとすることが、容量制約を学習することにあたる。 Next, using the acquired data set of input x (location of charging stations) and output z (optimal electric vehicle travel route), the relationship between input x and output z is learned in QUBO (Quadratic Unconstrained Binary Optimization) format (arrow PR2 in Figure 7). When learning the relationship between input x and output z in QUBO format, it is determined whether output z satisfies the constraint. If output z does not satisfy the constraint, the constraint can be learned by updating the relationship between input x and output z using a penalty value corresponding to the constraint. In the electric vehicle travel route problem described in Figures 3 to 6, learning the capacity constraint means that if there is a violation of the capacity constraint related to the electric capacity of the battery equipped in the electric vehicle, the sum of the evaluation value and the penalty value is set as the travel cost of the electric vehicle.

次に、学習された関係の下、最適な出力ｚを与える次の入力ｘの候補を探索する（図７に示す矢印ＰＲ３）。最適な入力候補を探索する場合、入力ｘと出力ｚの関係をＱＵＢＯ形式で学習しているため、シミュレーテッドアニーリング（ＳＡ：ＳｉｍｕｌａｔｅｄＡｎｎｅａｌｉｎｇ）や量子アニーリング（ＱＡ：ＱｕａｎｔｕｍＡｎｎｅａｌｉｎｇ）を利用することができる。図３～図６で説明した電気自動車の移動経路問題では、ステップＳ２０での充電ステーションの新たな位置の探索がこれにあたる。このように、ＢＯＣＳを利用した制限条件の学習プロセスでは、矢印ＰＲ１から矢印ＰＲ３までのサイクルを探索回数１回として探索を繰り返し、入力ｘに対する出力ｚのデータの組が得られるたびに係数行列Ａを更新する。これにより、より高い精度で最適な出力ｚを与える入力ｘを見つけることができる。なお、ＢＯＣＳを利用した制限条件の学習プロセスでは、事前に設定した回数の探索を逐次的に繰り返す。 Next, based on the learned relationship, a search is performed for the next input x candidate that provides the optimal output z (arrow PR3 in Figure 7). When searching for the optimal input candidate, simulated annealing (SA) or quantum annealing (QA) can be used because the relationship between input x and output z is learned using the QUBO format. In the electric vehicle routing problem described in Figures 3 to 6, this corresponds to the search for a new charging station location in step S20. In this way, in the constraint condition learning process using BOCS, the search is repeated, with the cycle from arrow PR1 to arrow PR3 counting as one search, and the coefficient matrix A is updated each time a data set of output z for input x is obtained. This makes it possible to find the input x that provides the optimal output z with greater accuracy. Note that in the constraint condition learning process using BOCS, searches are sequentially repeated a predetermined number of times.

以上説明した、本実施形態の最適化装置１によれば、補正部３２は、データ取得部３１が取得する最適化された組合せのデータと制約情報とを用いて、基本情報を補正する。学習部３４は、補正部３２が補正した基本情報を用いて、制約条件を学習した新たな基本情報を取得する。データ取得部３１は、制約条件を学習した新たな基本情報を用いて、最適化された組合せのデータを新たに取得する。これにより、定式化が困難であり、最適化された組合せのデータを取得する際に基本情報として情報を含めにくい制約条件が課されている組合せ最適化問題であっても、制約条件を満たす組合せの中から最適化された組合せを求めることができる。また、補正部３２は、データ取得部３１が取得する、最適化された組合せの新たなデータと制約情報とを用いて、新たな基本情報を補正し、学習部３４は、補正部３２が補正した新たな基本情報を用いて、制約条件をさらに学習した、さらに新たな基本情報を取得する。このように、データ取得部３１による最適化された組合せのデータの取得と、学習部３４による制約条件を学習した新たな基本情報の取得とを逐次的に繰り返すことで、制約条件を満たす組合せの中でも、さらに最適化された組合せを求めることができる。したがって、定式化が困難な制約条件が課せられている組合せ最適化問題において最適化された組合せを求めることができる。 According to the optimization device 1 of this embodiment described above, the correction unit 32 corrects the basic information using the optimized combination data and constraint information acquired by the data acquisition unit 31. The learning unit 34 uses the basic information corrected by the correction unit 32 to acquire new basic information that has learned the constraint conditions. The data acquisition unit 31 acquires new data of the optimized combination using the new basic information that has learned the constraint conditions. This makes it possible to find an optimized combination from among combinations that satisfy the constraint conditions, even for a combinatorial optimization problem that is difficult to formulate and has constraint conditions that make it difficult to include information as basic information when acquiring data of the optimized combination. Furthermore, the correction unit 32 corrects the new basic information using the new data of the optimized combination and constraint information acquired by the data acquisition unit 31, and the learning unit 34 uses the new basic information corrected by the correction unit 32 to acquire new basic information that has further learned the constraint conditions. In this way, by sequentially repeating the acquisition of data on optimized combinations by the data acquisition unit 31 and the acquisition of new basic information that has learned the constraints by the learning unit 34, it is possible to find even more optimized combinations from among the combinations that satisfy the constraints. Therefore, it is possible to find optimized combinations for combinatorial optimization problems that are subject to constraints that are difficult to formulate.

また、本実施形態の最適化装置１によれば、補正部３２は、データ取得部３１が取得する最適化された組合せが制約条件を満たすか否かによって基本情報を更新し、学習部３４は、データ取得部３１が最適化された組合せの新たなデータを取得するため、制約条件を学習した新たな基本情報を取得する。これにより、制約条件を満たす組合せの中でも、さらに最適化された組合せを求めることができる。 Furthermore, according to the optimization device 1 of this embodiment, the correction unit 32 updates the basic information depending on whether the optimized combination acquired by the data acquisition unit 31 satisfies the constraint conditions, and the learning unit 34 acquires new basic information that has learned the constraint conditions in order to acquire new data for the combination optimized by the data acquisition unit 31. This makes it possible to find a more optimized combination from among the combinations that satisfy the constraint conditions.

また、本実施形態の最適化装置１によれば、情報更新部３２ｂは、最適化された組合せが制約条件を満たしていないと条件判定部３２ａが判定する場合、制約条件に対応する罰則値を用いて最適化された組合せの評価値を変更するため、新たな基本情報に、制約情報を確実に含めることができる。これにより、制約条件を満たす組合せの中でも、さらに最適化された組合せを求めることができる。 Furthermore, according to the optimization device 1 of this embodiment, when the condition determination unit 32a determines that the optimized combination does not satisfy the constraint conditions, the information update unit 32b changes the evaluation value of the optimized combination using a penalty value corresponding to the constraint conditions, thereby ensuring that the constraint information is included in the new basic information. This makes it possible to find a more optimized combination from among the combinations that satisfy the constraint conditions.

また、本実施形態の最適化装置１によれば、学習部３４は、評価値に罰則値を加算した値が、あらかじめ設定されている基準点以下であるか否かの判定に従って、制約条件を学習した新たな基本情報を取得し、データ取得部３１は、新たな基本情報を用いて、最適化された組合せの新たなデータを取得する。これにより、最適化が一定の水準となった組合せのデータが取得されると、最適化装置１による組合せ最適化問題の解決を終了することができるため、最適化された組合せのデータの取得に要する時間を短くすることができる。 Furthermore, according to the optimization device 1 of this embodiment, the learning unit 34 acquires new basic information that learns the constraint conditions in accordance with a determination of whether the value obtained by adding the penalty value to the evaluation value is equal to or less than a preset reference point, and the data acquisition unit 31 acquires new data on optimized combinations using the new basic information. As a result, once data on combinations for which optimization has reached a certain level has been acquired, the optimization device 1 can end its solution of the combinatorial optimization problem, thereby shortening the time required to acquire data on optimized combinations.

また、本実施形態の最適化装置１によれば、学習部３４は、二次制約なし二値変数最適化形式となっている、新たな基本情報を取得する。これにより、最適化された組合せのデータの取得において、シミュレーテッドアニーリングや量子アニーリングなどのイジングマシンを用いることができる。 Furthermore, according to the optimization device 1 of this embodiment, the learning unit 34 acquires new basic information in the form of binary variable optimization without quadratic constraints. This makes it possible to use Ising machines such as simulated annealing and quantum annealing to acquire data on optimized combinations.

また、本実施形態の最適化装置１によれば、データ取得部３１は、イジングマシンなどの二次制約なし二値変数最適化ソルバーを用いて、最適化された組合せのデータを取得する。これにより、最適化された組合せのデータの取得に要する時間を短くすることができる。 Furthermore, according to the optimization device 1 of this embodiment, the data acquisition unit 31 acquires data on optimized combinations using a binary variable optimization solver without quadratic constraints, such as an Ising machine. This reduces the time required to acquire data on optimized combinations.

また、本実施形態の最適化方法によれば、図２に示すように、ステップＳ４からステップＳ７までにおいて、ステップＳ２において取得される最適化された組合せのデータと制約情報とを用いて、基本情報を補正する。ステップＳ９において、ステップＳ４からステップＳ７までにおいて補正された基本情報を用いて、制約条件を学習した新たな基本情報を取得し、ステップＳ１０およびステップＳ１０の次のステップＳ２において、制約条件を学習した新たな基本情報を用いて、最適化された組合せのデータを新たに取得する。これにより、例えば、定式化が困難な制約条件を満たす組合せの中から最適化された組合せを求めることができる。また、２回目以降のステップＳ９では、ステップＳ１０および２回目以降のステップＳ２において取得する最適化された組合せの新たなデータと、制約情報とを用いて補正された新たな基本情報を用いて、制約条件をさらに学習した、さらに新たな基本情報を取得する。このように、ステップＳ１０およびステップＳ１０の次のステップＳ２における最適化された組合せのデータの取得と、ステップＳ９における制約条件を学習した基本情報の取得とを逐次的に繰り返すことで、制約条件を満たす組合せの中でも、さらに最適化された組合せを求めることができる。 Furthermore, according to the optimization method of this embodiment, as shown in FIG. 2, in steps S4 to S7, basic information is corrected using the optimized combination data and constraint information acquired in step S2. In step S9, new basic information with constraint conditions learned is acquired using the basic information corrected in steps S4 to S7. Then, in step S10 and the step S2 following step S10, new data for the optimized combination is acquired using the new basic information with constraint conditions learned. This makes it possible to determine, for example, optimized combinations from combinations that satisfy constraint conditions that are difficult to formulate. Furthermore, in the second and subsequent steps of step S9, new basic information with further constraint conditions learned is acquired using the new data for the optimized combination acquired in step S10 and the step S2 following step S10 and new basic information corrected using the constraint information. In this way, by sequentially repeating the acquisition of optimized combination data in step S10 and the step S2 following step S10, and the acquisition of basic information with constraint conditions learned in step S9, it is possible to determine even more optimized combinations from among combinations that satisfy constraint conditions.

また、本実施形態のコンピュータプログラムによれば、コンピュータは、補正機能が、第１データ取得機能が取得する最適化された組合せのデータと制約情報とを用いて、基本情報を補正する。学習機能は、補正機能によって補正された基本情報を用いて、制約条件を学習した新たな基本情報を取得し、第２データ取得機能は、制約条件を学習した新たな基本情報を用いて、最適化された組合せのデータを新たに取得する。これにより、例えば、定式化が困難な制約条件を満たす組合せの中から最適化された組合せを求めることができる。また、第２データ取得機能によって取得する、最適化された組合せの新たなデータと制約情報とを用いて、学習機能では、制約条件をさらに学習した、さらに新たな基本情報を取得する。このように、最適化された組合せのデータの取得と、制約条件を学習した基本情報の取得とを逐次的に繰り返すことで、制約条件を満たす組合せの中でも、さらに最適化された組合せを求めることができる。 Furthermore, according to the computer program of this embodiment, the correction function corrects the basic information using the data of the optimized combination acquired by the first data acquisition function and the constraint information. The learning function uses the basic information corrected by the correction function to acquire new basic information that has learned the constraint conditions, and the second data acquisition function uses the new basic information that has learned the constraint conditions to acquire new data of the optimized combination. This makes it possible, for example, to determine an optimized combination from among combinations that satisfy constraint conditions that are difficult to formulate. Furthermore, using the new data of the optimized combination acquired by the second data acquisition function and the constraint information, the learning function acquires further new basic information that has further learned the constraint conditions. In this way, by sequentially repeating the acquisition of data of the optimized combination and the acquisition of basic information that has learned the constraint conditions, it is possible to determine a further optimized combination from among combinations that satisfy the constraint conditions.

＜本実施形態の変形例＞
本発明は上記の実施形態に限られるものではなく、その要旨を逸脱しない範囲において種々の態様において実施することが可能であり、例えば次のような変形も可能である。 <Modification of this embodiment>
The present invention is not limited to the above-described embodiment, and can be embodied in various forms without departing from the spirit of the invention. For example, the following modifications are also possible.

［変形例１］
上述の実施形態では、補正部３２は、データ取得部３１が取得する最適化された組合せが制約情報に含まれる制約条件を満たすか否かを判定する条件判定部３２ａと、条件判定部３２ａにおける判定結果に応じて基本情報を更新する情報更新部３２ｂと、を有するとした。補正部３２の構成は、これに限定されない。データ取得部３１によって取得される最適化された組合せのデータと制約情報とを用いて、制約条件を考慮して基本情報を補正できればよい。 [Modification 1]
In the above-described embodiment, the correction unit 32 includes a condition determination unit 32a that determines whether the optimized combination acquired by the data acquisition unit 31 satisfies the constraint conditions included in the constraint information, and an information update unit 32b that updates the basic information in accordance with the determination result of the condition determination unit 32a. The configuration of the correction unit 32 is not limited to this. It is sufficient if the basic information can be corrected in consideration of the constraint conditions using the data of the optimized combination acquired by the data acquisition unit 31 and the constraint information.

［変形例２］
上述の実施形態では、図２に示すように、評価判定部３２ｃによる判定の結果を受けて、ステップＳ９において、ベイズ推定によって入力ｘと評価値ｙとの関係を学習し、ステップＳ１０において、ステップＳ９で推定された関係のもと、最適な入力と推定される新たな入力ｘを推定するとした。新たな入力ｘを推定する方法は、これに限定されない。 [Modification 2]
2, in the above-described embodiment, in response to the result of the determination by the evaluation/determination unit 32c, the relationship between the input x and the evaluation value y is learned by Bayesian estimation in step S9, and in step S10, a new input x that is estimated to be the optimal input is estimated based on the relationship estimated in step S9. The method of estimating the new input x is not limited to this.

［変形例３］
上述の実施形態では、学習部３４は、二次制約なし二値変数最適化形式となっている、新たな基本情報を取得するとした。しかしながら、学習部３４による学習形式は、これに限定されない。 [Modification 3]
In the above embodiment, the learning unit 34 acquires new basic information in the form of binary variable optimization without quadratic constraints. However, the learning format of the learning unit 34 is not limited to this.

［変形例４］
上述の実施形態では、データ取得部３１は、イジングマシンなどの二次制約なし二値変数最適化ソルバーを用いて、最適化された組合せのデータを取得するとした。しかしながら、データ取得部３１による組合せのデータを取得する方法は、これに限定されない。 [Modification 4]
In the above-described embodiment, the data acquisition unit 31 acquires data of optimized combinations using a binary variable optimization solver without quadratic constraints, such as an Ising machine. However, the method for acquiring data of combinations by the data acquisition unit 31 is not limited to this.

以上、実施形態、変形例に基づき本態様について説明してきたが、上記した態様の実施の形態は、本態様の理解を容易にするためのものであり、本態様を限定するものではない。本態様は、その趣旨並びに特許請求の範囲を逸脱することなく、変更、改良され得るとともに、本態様にはその等価物が含まれる。また、その技術的特徴が本明細書中に必須なものとして説明されていなければ、適宜、削除することができる。 This aspect has been described above based on embodiments and variations, but the above-described embodiments are intended to facilitate understanding of this aspect and are not intended to limit this aspect. This aspect may be modified or improved without departing from its spirit or the scope of the claims, and equivalents are included in this aspect. Furthermore, if a technical feature is not described as essential in this specification, it may be deleted as appropriate.

＜適用例１＞
制約条件が課されている組合せ最適化問題を解くための最適化装置であって、
組合せの基本構成要素に関する基本情報と、前記制約条件に関する制約情報とが入力される入力部と、
前記基本情報を用いて、最適化された組合せのデータを取得するデータ取得部と、
前記データ取得部が取得する最適化された組合せのデータと前記制約情報とを用いて、前記基本情報を補正する補正部と、
前記補正部が補正した前記基本情報を用いて、前記制約条件を学習した新たな基本情報を取得する学習部と、を備え、
前記データ取得部は、前記学習部が取得する新たな基本情報を用いて、最適化された組合せのデータを新たに取得し、
前記補正部は、前記データ取得部によって取得された、最適化された組合せの新たなデータと、前記制約情報とを用いて、前記学習部が取得する新たな基本情報を補正し、
前記学習部は、前記補正部が補正した新たな基本情報を用いて、前記制約条件をさらに学習した、さらに新たな基本情報を取得する、
最適化装置。
＜適用例２＞
適用例１に記載の最適化装置であって、
前記補正部は、
前記データ取得部が取得する最適化された組合せが、前記制約条件を満たすか否かを判定する条件判定部と、
最適化された前記組合せが前記制約条件を満たしていないと前記条件判定部が判定する場合、前記基本情報を更新する情報更新部と、を含む、
最適化装置。
＜適用例３＞
適用例１または適用例２に記載の最適化装置であって、
前記データ取得部は、最適化された組合せの評価値を算出し、
前記情報更新部は、最適化された組合せが前記制約条件を満たしていないと前記条件判定部が判定する場合、前記制約条件に対応する罰則値を用いて前記評価値を変更するすることで、前記基本情報を更新する、
最適化装置。
＜適用例４＞
適用例１から適用例３のいずれか一例に記載の最適化装置は、さらに、
前記評価値に前記罰則値を加算した修正評価値が、あらかじめ設定されている基準点以下であるか否かを判定する評価判定部を備え、
前記学習部は、前記修正評価値が前記基準点より大きいと前記評価判定部が判定する場合、前記情報更新部によって更新された基本情報を用いて、前記制約条件を学習した新たな基本情報を取得する、
最適化装置。
＜適用例５＞
適用例１から適用例４のいずれか一例に記載の最適化装置であって、
前記学習部は、二次制約なし二値変数最適化形式となっている、新たな基本情報を取得する、
最適化装置。
＜適用例６＞
適用例１から適用例５のいずれか一例に記載の最適化装置であって、
前記データ取得部は、二次制約なし二値変数最適化ソルバーを用いて、最適化された組合せのデータを取得する、
最適化装置。
＜適用例７＞
最適化装置を用いて制約条件が課されている組合せ最適化問題を解く最適化方法であって、
組合せの基本構成要素に関する基本情報と、前記制約条件に関する制約情報とが入力される入力工程と、
前記基本情報を用いて、最適化された組合せのデータを取得する第１データ取得工程と、
前記第１データ取得工程において取得された、最適化された組合せのデータと、前記制約情報とを用いて、前記基本情報を補正する補正工程と、
前記補正工程において補正された基本情報を用いて、前記制約条件を学習した新たな基本情報を取得する学習工程と、
前記学習工程において取得された新たな基本情報を用いて、最適化された組合せのデータを新たに取得する第２データ取得工程と、を備える、
最適化方法。
＜適用例８＞
制約条件が課されている組合せ最適化問題の解決をコンピュータに実行させるコンピュータプログラムであって、
組合せの基本構成要素に関する基本情報と、前記制約条件に関する制約情報とが入力される入力機能と、
前記基本情報を用いて、最適化された組合せのデータを取得する第１データ取得機能と、
前記第１データ取得機能が取得する最適化された組合せのデータと、前記制約情報とを用いて、前記基本情報を補正する補正機能と、
前記補正機能によって補正された基本情報を用いて、前記制約条件を学習した新たな基本情報を取得する学習機能と、
前記学習機能が取得する新たな基本情報を用いて、最適化された組合せのデータを新たに取得する第２データ取得機能と、をコンピュータに実行させる、
コンピュータプログラム。 <Application Example 1>
An optimization device for solving a combinatorial optimization problem with constraints, comprising:
an input unit into which basic information regarding basic components of a combination and constraint information regarding the constraint conditions are input;
a data acquisition unit that acquires data of an optimized combination using the basic information;
a correction unit that corrects the basic information using the optimized combination data acquired by the data acquisition unit and the constraint information;
a learning unit that acquires new basic information that has learned the constraint conditions using the basic information corrected by the correction unit,
the data acquisition unit acquires new data of an optimized combination using the new basic information acquired by the learning unit;
the correction unit corrects the new basic information acquired by the learning unit using the new data of the optimized combination acquired by the data acquisition unit and the constraint information;
the learning unit further learns the constraint conditions using the new basic information corrected by the correction unit, and acquires further new basic information.
Optimizer.
<Application Example 2>
The optimization device according to Application Example 1,
The correction unit
a condition determination unit that determines whether the optimized combination acquired by the data acquisition unit satisfies the constraint condition;
an information update unit that updates the basic information when the condition determination unit determines that the optimized combination does not satisfy the constraint condition,
Optimizer.
<Application Example 3>
The optimization device according to Application Example 1 or Application Example 2,
the data acquisition unit calculates an evaluation value of the optimized combination;
when the condition determination unit determines that the optimized combination does not satisfy the constraint condition, the information update unit updates the basic information by changing the evaluation value using a penalty value corresponding to the constraint condition.
Optimizer.
<Application Example 4>
The optimization device according to any one of Application Examples 1 to 3 further comprises:
an evaluation determination unit that determines whether a corrected evaluation value obtained by adding the penalty value to the evaluation value is equal to or less than a predetermined reference score;
When the evaluation determination unit determines that the corrected evaluation value is greater than the reference point, the learning unit uses the basic information updated by the information update unit to obtain new basic information that has learned the constraint condition.
Optimizer.
<Application Example 5>
The optimization device according to any one of Application Examples 1 to 4,
The learning unit acquires new basic information in the form of a binary variable optimization without quadratic constraints.
Optimizer.
<Application Example 6>
The optimization device according to any one of Application Examples 1 to 5,
the data acquisition unit acquires data of the optimized combination using a binary variable optimization solver without quadratic constraints;
Optimizer.
<Application Example 7>
An optimization method for solving a combinatorial optimization problem with constraints using an optimization device, comprising:
an input step of inputting basic information on basic components of a combination and constraint information on the constraint conditions;
a first data acquisition step of acquiring data of an optimized combination using the basic information;
a correction step of correcting the basic information using the optimized combination data acquired in the first data acquisition step and the constraint information;
a learning step of acquiring new basic information by learning the constraint conditions using the basic information corrected in the correction step;
a second data acquisition step of acquiring new data of an optimized combination using new basic information acquired in the learning step,
Optimization methods.
<Application Example 8>
A computer program that causes a computer to solve a combinatorial optimization problem subject to constraints,
an input function for inputting basic information regarding the basic components of the combination and constraint information regarding the constraint conditions;
a first data acquisition function that acquires data of an optimized combination using the basic information;
a correction function that corrects the basic information using the optimized combination data acquired by the first data acquisition function and the constraint information;
a learning function that acquires new basic information that has learned the constraints using the basic information corrected by the correction function; and
a second data acquisition function that acquires new data of an optimized combination using new basic information acquired by the learning function;
Computer program.

１…最適化装置
１０…入力部
５０…送受信部
３１…データ取得部
３２…補正部
３２ａ…条件判定部
３２ｂ…情報更新部
３３…評価判定部
３４…学習部 DESCRIPTION OF SYMBOLS 1... Optimization device 10... Input unit 50... Transmitting/receiving unit 31... Data acquisition unit 32... Correction unit 32a... Condition determination unit 32b... Information update unit 33... Evaluation determination unit 34... Learning unit

Claims

An optimization device for solving a combinatorial optimization problem with constraints, comprising:
an input unit into which basic information regarding basic components of a combination and constraint information regarding the constraint conditions are input;
a data acquisition unit that acquires data of an optimized combination using the basic information;
a correction unit that corrects the basic information using the optimized combination data acquired by the data acquisition unit and the constraint information;
a learning unit that acquires new basic information that has learned the constraint conditions using the basic information corrected by the correction unit,
the data acquisition unit acquires new data of an optimized combination using the new basic information acquired by the learning unit;
the correction unit corrects the new basic information acquired by the learning unit using the new data of the optimized combination acquired by the data acquisition unit and the constraint information;
the learning unit further learns the constraint conditions using the new basic information corrected by the correction unit, and acquires further new basic information.
Optimizer.

2. The optimization device according to claim 1,
The correction unit
a condition determination unit that determines whether the optimized combination acquired by the data acquisition unit satisfies the constraint condition;
an information update unit that updates the basic information when the condition determination unit determines that the optimized combination does not satisfy the constraint condition,
Optimizer.

3. The optimization device according to claim 2,
the data acquisition unit calculates an evaluation value of the optimized combination;
when the condition determination unit determines that the optimized combination does not satisfy the constraint condition, the information update unit updates the basic information by changing the evaluation value using a penalty value corresponding to the constraint condition.
Optimizer.

The optimization device according to claim 3 further comprises:
an evaluation determination unit that determines whether a corrected evaluation value obtained by adding the penalty value to the evaluation value is equal to or less than a predetermined reference score;
When the evaluation determination unit determines that the corrected evaluation value is greater than the reference point, the learning unit uses the basic information updated by the information update unit to obtain new basic information that has learned the constraint condition.
Optimizer.

3. The optimization device according to claim 1 or 2,
The learning unit acquires new basic information in the form of a binary variable optimization without quadratic constraints.
Optimizer.

3. The optimization device according to claim 1 or 2,
the data acquisition unit acquires data of the optimized combination using a binary variable optimization solver without quadratic constraints;
Optimizer.

An optimization method for solving a combinatorial optimization problem with constraints using an optimization device, comprising:
an input step of inputting basic information on basic components of a combination and constraint information on the constraint conditions;
a first data acquisition step of acquiring data of an optimized combination using the basic information;
a correction step of correcting the basic information using the optimized combination data acquired in the first data acquisition step and the constraint information;
a learning step of acquiring new basic information by learning the constraint conditions using the basic information corrected in the correction step;
a second data acquisition step of acquiring new data of an optimized combination using new basic information acquired in the learning step,
Optimization methods.

A computer program that causes a computer to solve a combinatorial optimization problem subject to constraints,
an input function for inputting basic information regarding the basic components of the combination and constraint information regarding the constraint conditions;
a first data acquisition function that acquires data of an optimized combination using the basic information;
a correction function that corrects the basic information using the optimized combination data acquired by the first data acquisition function and the constraint information;
a learning function that acquires new basic information that has learned the constraints using the basic information corrected by the correction function; and
a second data acquisition function that acquires new data of an optimized combination using new basic information acquired by the learning function;
Computer program.