JP7769271B2

JP7769271B2 - Imaging device, imaging method, and program

Info

Publication number: JP7769271B2
Application number: JP2024530193A
Authority: JP
Inventors: 稜白川; 陽光曽我部; 暁経三反崎; 正樹北原
Original assignee: Nippon Telegraph and Telephone Corp; NTT Inc USA
Current assignee: NTT Inc; NTT Inc USA
Priority date: 2022-06-30
Filing date: 2022-06-30
Publication date: 2025-11-13
Anticipated expiration: 2042-06-30
Also published as: WO2024004121A1; JPWO2024004121A1

Description

本発明は、撮像装置、撮像方法、およびプログラムの技術に関する。 The present invention relates to imaging devices, imaging methods, and program technologies.

従来、圧縮スペクトル撮像という圧縮センシング理論に基づくハイパースペクトル画像の測定技術がある。その実装手法の１つとして符号化開口マスクと分散光学素子とを組み合わせたＣＡＳＳＩ（Coded Aperture Snapshot Spectral Imaging）と呼ばれる技術がある（例えば非特許文献１参照。） Conventionally, there is a hyperspectral image measurement technology based on compressed sensing theory called compressed spectral imaging. One implementation method is a technology called CASSI (Coded Aperture Snapshot Spectral Imaging), which combines a coded aperture mask and a dispersive optical element (see, for example, Non-Patent Document 1).

また、このようなＣＡＳＳＩに関し、ハイパースペクトル画像の測定精度を向上させるための手法として、ＣＡＳＳＩ光学系内の符号化開口マスクのデザイン方法が提案されている（例えば非特許文献２参照）。 In addition, with regard to such CASSI, a method for designing a coded aperture mask within the CASSI optical system has been proposed as a technique for improving the measurement accuracy of hyperspectral images (see, for example, non-patent document 2).

また、一方で、ＣＡＳＳＩにおける符号化開口マスクのデザイン及び圧縮信号から元信号への再構成処理における最適化問題の定式化を工夫して、取得する波長域の情報を限定することにより測定精度を向上させる技術が提案されている（例えば非特許文献３参照）。 On the other hand, a technology has been proposed to improve measurement accuracy by limiting the wavelength range information acquired through devising the design of the coded aperture mask in CASSI and the formulation of the optimization problem in the reconstruction process from the compressed signal to the original signal (see, for example, non-patent document 3).

Compressive Coded Aperture Spectral Imaging: An Introduction [Arce et al., 2013]Compressive Coded Aperture Spectral Imaging: An Introduction [Arce et al., 2013] Compressive spectral imaging approach using adaptive coded apertures [Zhang et al., 2020]Compressive spectral imaging approach using adaptive coded apertures [Zhang et al., 2020] Rank minimization code aperture design for spectrally selective compressive imaging[Arguello et al., 2013]Rank minimization code aperture design for spectrally selective compressive imaging[Arguello et al., 2013]

一般に、上述のＣＡＳＳＩは（１）式のように定式化される。 Generally, the above-mentioned CASSI is formulated as equation (1).

（１）式において、ｇは圧縮信号を表し、ｆは元信号を表し、ΦはＣＡＳＳＩの観測過程を表す観測行列（圧縮行列ともいう）である。このように、圧縮センシングにおいて圧縮信号を得るための観測過程（観測行列）は、後段の元信号推定の性能と密接に関係する。 In equation (1), g represents the compressed signal, f represents the original signal, and Φ is an observation matrix (also called a compression matrix) that represents the CASSI observation process. In this way, the observation process (observation matrix) for obtaining the compressed signal in compressed sensing is closely related to the performance of the subsequent original signal estimation.

しかしながら、ＣＡＳＳＩの観測過程における設計の自由度は、符号化開口マスクのデザインのみである。そのため、元信号再構成のための理想的な観測行列を設計することは困難である。ＣＡＳＳＩの観測過程（Φ）は、符号化→分散→積算の処理であるが、分散と積算の処理は光学素子に固有の処理であり、設計の自由度はないためである。However, the only degree of design freedom in the CASSI observation process is the design of the coded aperture mask. Therefore, it is difficult to design an ideal observation matrix for reconstructing the original signal. The CASSI observation process (Φ) is a process of encoding → dispersion → accumulation, but the dispersion and accumulation processes are inherent to optical elements and there is no degree of design freedom.

また、取得する波長域を限定する従来技術では、各波長域を取得するか否かの２択でありそれぞれの重要度の大小を考慮することができなかった。加えて、波長域の限定は入力信号に対して一様に行われるため、空間的に異なる領域について異なる波長域の情報を取得することはできなかった。 Furthermore, conventional technologies that limit the wavelength range to be acquired only allow for the choice of whether or not to acquire each wavelength range, and are unable to consider the importance of each. In addition, because the wavelength range is limited uniformly for the input signal, it is not possible to acquire information on different wavelength ranges for spatially different regions.

また、従来技術では、取得する波長域を限定するために、限定した波長域の情報のみを含む画素を圧縮信号から抽出して再構成問題を解くため、利用可能な圧縮信号の要素数が削減され、再構成精度が低下することが想定される。そのため、従来は、複数回の撮影により圧縮信号の情報を補足することで再構成精度の低下を抑えるというアプローチがとられているが、そのデメリットとして測定にかかるコストが増加してしまう。 In addition, in conventional technology, in order to limit the wavelength range to be acquired, pixels containing only information from the limited wavelength range are extracted from the compressed signal to solve the reconstruction problem, which reduces the number of available compressed signal elements and is expected to reduce reconstruction accuracy. Therefore, a conventional approach has been to suppress the decline in reconstruction accuracy by supplementing compressed signal information through multiple captures, but the disadvantage is that the cost of measurement increases.

上記事情に鑑み、本発明は、ＣＡＳＳＩによるハイパースペクトル画像の測定精度を向上させることができる技術の提供を目的としている。 In consideration of the above circumstances, the present invention aims to provide technology that can improve the measurement accuracy of hyperspectral images using CASSI.

本発明の一態様は、圧縮センシングによるハイパースペクトル画像の測定を行う撮像装置であって、入力信号に符号化開口マスクを作用させることで入力信号を符号化して出力する符号化部と、前記符号化部によって符号化された前記入力信号を波長分散して出力する分散部と、前記分散部により波長分散された前記入力信号を撮像する計測部と、を備えるＣＡＳＳＩ観測系と、前記ＣＡＳＳＩ観測系が撮像する画素ごとに注目度合いの重みを設定する注目重み設定部と、前記注目重み設定部によって設定された前記画素ごとの注目度合いの重みに基づいて前記符号化開口マスクを生成するマスク生成部と、を備える撮像装置である。 One aspect of the present invention is an imaging device that measures hyperspectral images using compressed sensing, and includes a CASSI observation system that includes: an encoding unit that encodes and outputs an input signal by applying a coded aperture mask to the input signal; a dispersing unit that wavelength-disperses and outputs the input signal coded by the encoding unit; and a measurement unit that captures the input signal wavelength-dispersed by the dispersing unit; an attention weight setting unit that sets an attention weight for each pixel captured by the CASSI observation system; and a mask generation unit that generates the coded aperture mask based on the attention weight for each pixel set by the attention weight setting unit.

本発明の一態様は、圧縮センシングによるハイパースペクトル画像の測定を行う撮像方法であって、入力信号に符号化開口マスクを作用させることで入力信号を符号化して出力する符号化部と、前記符号化部によって符号化された前記入力信号を波長分散して出力する分散部と、前記分散部により波長分散された前記入力信号を撮像する計測部と、を備えるＣＡＳＳＩ観測系により入力信号を撮像する撮像ステップと、前記ＣＡＳＳＩ観測系の出力信号を乗算マスクによって変調する変調ステップと、を有する撮像方法である。 One aspect of the present invention is an imaging method for measuring hyperspectral images using compressed sensing, comprising: an imaging step of imaging an input signal using a CASSI observation system that includes: an encoding unit that encodes the input signal by applying a coded aperture mask to the input signal and outputs the encoded signal; a dispersion unit that wavelength-disperses the input signal encoded by the encoding unit and outputs the wavelength-dispersed input signal; and a measurement unit that images the input signal wavelength-dispersed by the dispersion unit; and a modulation step of modulating the output signal of the CASSI observation system using a multiplication mask.

本発明の一態様は、コンピューターを、上記の撮像装置として機能させるためのプログラムである。 One aspect of the present invention is a program for causing a computer to function as the above-mentioned imaging device.

本発明により、ＣＡＳＳＩによるハイパースペクトル画像の測定精度を向上させることが可能となる。 This invention makes it possible to improve the measurement accuracy of hyperspectral images using CASSI.

第１実施形態の撮像装置１Ａの構成を示すブロック図である。1 is a block diagram showing a configuration of an imaging apparatus 1A according to a first embodiment. 第１実施形態の撮像装置１Ａによる処理の流れを示すイメージ図である。FIG. 2 is a conceptual diagram showing a processing flow by the imaging device 1A of the first embodiment. 第１実施形態の撮像装置１Ａが奏する効果の一例を示す図である。1A and 1B are diagrams illustrating an example of an effect achieved by the imaging device 1A of the first embodiment. 第２実施形態の撮像装置１Ｂの構成を示すブロック図である。FIG. 10 is a block diagram showing the configuration of an imaging device 1B according to a second embodiment. 第２実施形態の撮像装置１Ｂによる処理の流れを示すイメージ図である。FIG. 10 is a conceptual diagram showing a processing flow by an imaging device 1B according to a second embodiment. 第２実施形態の撮像装置１Ｂが符号化開口マスクおよび乗算マスクを生成する場合の処理の流れを示すイメージ図である。FIG. 10 is a conceptual diagram showing the flow of processing when the imaging device 1B of the second embodiment generates a coded aperture mask and a multiplication mask. 第２実施形態の撮像装置１Ｂにおいて、マスク生成部１６０が乗算マスクを生成し、再構成処理部１４０が再構成モデルを学習する流れを示すイメージ図である。FIG. 10 is a conceptual diagram showing a flow in which the mask generation unit 160 generates a multiplication mask and the reconstruction processing unit 140 learns a reconstruction model in the imaging device 1B of the second embodiment.

以下、本発明の実施形態について、図面を参照して詳細に説明する。
＜第１実施形態＞
図１は、第１実施形態の撮像装置１Ａの構成を示すブロック図である。撮像装置１Ａは、光学系１１０と、ＣＡＳＳＩ観測系１２０と、変調部１３０と、再構成処理部１４０とを備える。 Hereinafter, embodiments of the present invention will be described in detail with reference to the drawings.
First Embodiment
1 is a block diagram showing the configuration of an image pickup apparatus 1A according to the first embodiment. The image pickup apparatus 1A includes an optical system 110, a CASSI observation system 120, a modulation unit 130, and a reconstruction processing unit 140.

撮像装置１Ａは、ＣＰＵ（Central Processing Unit）等のプロセッサーとメモリーとを用いて構成される。撮像装置１Ａは、プロセッサーがプログラムを実行することによって、入力信号としての光を入力して、ハイパースペクトル画像の推定信号を出力する装置として機能する。撮像装置１Ａが備える各部のうち、ＣＡＳＳＩ観測系１２０の一部と、変調部１３０と、再構成処理部１４０とは、プロセッサーがプログラムを実行することによって実現される。なお、撮像装置１Ａの機能のうち電気的信号処理を実行する機能の一部または全部は、ＡＳＩＣ（Application Specific Integrated Circuit）やＰＬＤ（Programmable Logic Device）やＦＰＧＡ（Field Programmable Gate Array）等のハードウェアを用いて実現されても良い。上記のプログラムは、コンピューター読み取り可能な記録媒体に記録されても良い。コンピューター読み取り可能な記録媒体とは、例えばフレキシブルディスク、光磁気ディスク、ＲＯＭ、ＣＤ－ＲＯＭ、半導体記憶装置（例えばＳＳＤ：Solid State Drive）等の可搬媒体、コンピューターシステムに内蔵されるハードディスクや半導体記憶装置等の記憶装置である。上記のプログラムは、電気通信回線を介して送信されてもよい。The imaging device 1A is configured using a processor such as a CPU (Central Processing Unit) and memory. The imaging device 1A functions as a device that receives light as an input signal and outputs an estimated signal for a hyperspectral image by having the processor execute a program. Of the components of the imaging device 1A, part of the CASSI observation system 120, the modulation unit 130, and the reconstruction processing unit 140 are implemented by having the processor execute a program. Note that some or all of the functions of the imaging device 1A that perform electrical signal processing may be implemented using hardware such as an ASIC (Application Specific Integrated Circuit), a PLD (Programmable Logic Device), or an FPGA (Field Programmable Gate Array). The above program may be recorded on a computer-readable recording medium. Examples of computer-readable recording media include portable media such as flexible disks, magneto-optical disks, ROMs, CD-ROMs, and semiconductor storage devices (e.g., SSDs: Solid State Drives), as well as storage devices such as hard disks and semiconductor storage devices built into computer systems. The above program may also be transmitted via a telecommunications line.

光学系１１０は、画像を結像するレンズで構成される。光学系１１０は、複数のレンズを備えてもよい。 The optical system 110 is composed of a lens that forms an image. The optical system 110 may include multiple lenses.

ＣＡＳＳＩ観測系１２０は、ＣＡＳＳＩにより入力信号を符号化して計測する機能を有する。より具体的には、ＣＡＳＳＩ観測系１２０は、例えば、符号化部１２１と、分散部１２２と、計測部１２３とを備える。以下、ＣＡＳＳＩ観測系１２０の構成について説明するが、これは、第１実施形態および第２実施形態のＣＡＳＳＩ観測系１２０が、従来のＣＡＳＳＩ観測系と同様であってよいことを意図したものである。 The CASSI observation system 120 has the function of encoding and measuring an input signal using CASSI. More specifically, the CASSI observation system 120 includes, for example, an encoding unit 121, a dispersing unit 122, and a measuring unit 123. The configuration of the CASSI observation system 120 is described below, but this is intended to mean that the CASSI observation system 120 of the first and second embodiments may be similar to conventional CASSI observation systems.

符号化部１２１は、入力信号に符号化開口マスクを作用させることで符号化された入力信号を出力する。符号化開口マスクは、ＬＣＯＳ（Liquid Crystal On Silicon）やＤＭＤ（Digital Mirror Device）などである。符号化開口マスクは、０および１の離散値（以下｛０，１｝と表す場合がある）で構成されてもよいし、０から１までの連続値（以下［０，１］と表す場合がある）によって構成されてもよい。例えば、符号化開口マスクは、符号化開口を可変のパラメータとしてモデル化し、再構成モデルを含めたエンド・トゥ・エンドの入出力の最適化により決定する、というデータ駆動的なアプローチにより生成され得る。一方、このようなデータ駆動的なアプローチとは逆に、符号化開口マスクは、画像のスパース変換時の基底に対するインコヒーレンス性を利用した圧縮センシングの理論に基づく理論的なアプローチによって生成されてもよい。The encoding unit 121 applies a coded aperture mask to the input signal to output a coded input signal. The coded aperture mask may be a liquid crystal on silicon (LCOS) or digital mirror device (DMD). The coded aperture mask may be composed of discrete values of 0 and 1 (hereinafter sometimes represented as {0,1}) or continuous values ranging from 0 to 1 (hereinafter sometimes represented as [0,1]). For example, the coded aperture mask may be generated using a data-driven approach in which the coded aperture is modeled as a variable parameter and determined by end-to-end input/output optimization, including a reconstruction model. Alternatively, the coded aperture mask may be generated using a theoretical approach based on compressed sensing theory, which exploits the incoherence of the basis during sparse image transformation.

分散部１２２は、例えばプリズムである。分散部１２２は、符号化部１２１の出力信号を入力し、入力信号を波長分散して出力する。 The dispersion unit 122 is, for example, a prism. The dispersion unit 122 receives the output signal from the encoding unit 121, disperses the input signal by wavelength, and outputs the result.

計測部１２３は、例えば、ＦＰＡ（Focal Plane Array：焦点面アレイ）アレイセンサである。計測部１２３は、画素ごとの各センサが分散部１２２の出力信号を入力し、波長方向に積算して出力する。計測部１２３は、計測結果としての圧縮信号（画像）を出力する。 The measurement unit 123 is, for example, an FPA (Focal Plane Array) array sensor. The measurement unit 123 receives the output signal of the dispersion unit 122 from each sensor for each pixel, integrates it in the wavelength direction, and outputs it. The measurement unit 123 outputs a compressed signal (image) as the measurement result.

変調部１３０は、ＣＡＳＳＩ観測系１２０の出力信号を入力し、それに乗算マスクによる変調を施して出力する。例えば、乗算マスクは、上述の符号化開口マスクと同様に可変のパラメータとしてモデル化し、符号化開口マスクと再構成モデルを含めたエンド・トゥ・エンドの入出力の最適化により決定する、というデータ駆動的なアプローチにより生成され得る。The modulation unit 130 receives the output signal of the CASSI observation system 120, modulates it with a multiplication mask, and outputs the result. For example, the multiplication mask can be generated using a data-driven approach, in which it is modeled as a variable parameter, similar to the coded aperture mask described above, and determined by end-to-end input/output optimization including the coded aperture mask and the reconstruction model.

再構成処理部１４０は、変調部１３０の出力信号を入力し、それに再構成処理を施すことにより、ハイパースペクトル画像の推定結果（推定信号）を出力する。 The reconstruction processing unit 140 inputs the output signal of the modulation unit 130 and performs reconstruction processing on it, thereby outputting an estimated result (estimated signal) of the hyperspectral image.

図２は、撮像装置１Ａによる処理の流れを示すイメージ図である。図２は、各ステップの信号変換をパラメータ化（モデル化）した様子を表している。なお、図２の信号変換において、実線は、光学素子（プリズムやセンサ等）の固有のパラメータ値（固定）に基づく変換を表し、破線は、設計可能な可変パラメータに基づく変換を表している。 Figure 2 is an image diagram showing the processing flow by the imaging device 1A. Figure 2 shows how the signal conversion at each step is parameterized (modeled). Note that in the signal conversion in Figure 2, solid lines represent conversion based on the inherent parameter values (fixed) of optical elements (prisms, sensors, etc.), while dashed lines represent conversion based on designable variable parameters.

まず、撮像装置１Ａが、光学系１１０を介して、撮像対象空間内の光の波長データ（Spectral Data Cube）Ｄ０をＣＡＳＳＩ観測系１２０に入力する（ステップＳ１）。 First, the imaging device 1A inputs wavelength data (Spectral Data Cube) D0 of light in the imaging target space to the CASSI observation system 120 via the optical system 110 (step S1).

続いて、ＣＡＳＳＩ観測系１２０において、符号化部１２１が、入力した波長データ（入力信号）に対して符号化開口マスクＭ１を作用させることにより、符号化された波長データＤ１を分散部１２２に出力する（ステップＳ２）。この工程は、３Ｄ（空間および波長）の入力信号に対して各画素を符号化開口マスクで被覆することを意味する。Next, in the CASSI observation system 120, the encoding unit 121 applies a coded aperture mask M1 to the input wavelength data (input signal) and outputs the coded wavelength data D1 to the dispersion unit 122 (step S2). This process means that each pixel of the 3D (spatial and wavelength) input signal is covered with a coded aperture mask.

続いて、ＣＡＳＳＩ観測系１２０において、分散部１２２が、符号化された波長データＤ１を波長分散して計測部１２３に出力する（ステップＳ３）。この工程は、分散光学素子（プリズム）により波長軸を傾斜させることを意味するものである。 Next, in the CASSI observation system 120, the dispersion unit 122 wavelength-disperses the encoded wavelength data D1 and outputs it to the measurement unit 123 (step S3). This process involves tilting the wavelength axis using a dispersion optical element (prism).

続いて、ＣＡＳＳＩ観測系１２０において、計測部１２３が、波長分散された波長データＤ２を入力し、各画素が波長方向に波長データＤ２を積算することにより、対象空間の計測結果としての圧縮信号データＤ３を生成して変調部１３０に出力する（ステップＳ４）。この工程は、傾斜された各波長信号をピクセル毎に積算することを意味するものである。ここまでの工程により、２Ｄの圧縮信号が取得される。 Next, in the CASSI observation system 120, the measurement unit 123 inputs the wavelength-dispersed wavelength data D2 and accumulates the wavelength data D2 in the wavelength direction for each pixel, thereby generating compressed signal data D3 as the measurement result of the target space and outputting it to the modulation unit 130 (step S4). This process involves accumulating each tilted wavelength signal for each pixel. Through these steps, a 2D compressed signal is obtained.

続いて、変調部１３０が、ＣＡＳＳＩ観測系１２０から入力した２Ｄの圧縮信号データＤ３に対して乗算マスクによる変調を施すことによって変調後圧縮信号データＤ４を生成する（ステップＳ５）。変調部１３０は、生成した変調後圧縮信号データＤ４を再構成処理部１４０に出力する。Next, the modulation unit 130 generates modulated compressed signal data D4 by modulating the 2D compressed signal data D3 input from the CASSI observation system 120 using a multiplication mask (step S5). The modulation unit 130 outputs the generated modulated compressed signal data D4 to the reconstruction processing unit 140.

続いて、再構成処理部１４０が、変調部１３０から入力した変調後圧縮信号データＤ４に対して再構成処理を施すことにより、推定結果としてのハイパースペクトル画像ＩＭＧを生成して出力する（ステップＳ６）。例えば、再構成処理部１４０は、変調後圧縮信号データＤ４を再構成モデルに入力することにより再構成処理を実施する。Next, the reconstruction processing unit 140 performs reconstruction processing on the modulated compressed signal data D4 input from the modulation unit 130, thereby generating and outputting a hyperspectral image IMG as an estimation result (step S6). For example, the reconstruction processing unit 140 performs reconstruction processing by inputting the modulated compressed signal data D4 into a reconstruction model.

ここで、再構成モデルは、例えば、所望の信号データと、変調後圧縮信号データとの相関（すなわち観測行列）をニューラルネットワークなどの機械学習手法により予め学習することによって生成される。例えば、再構成モデルは、ＡＤＭＭ（Alternating Direction Method of Multiplier）やＩＳＴＡ（Iterative Shrinkage Thresholding Algorithm）等に代表される繰り返し最適化アルゴリズムを深層学習によって実装したＤＵＮ（Deep Unrolled Network）で構築され、符号化部１２１と変調部１３０を含めた可変パラメータをエンド・トゥ・エンドで最適化することにより学習され得る。 Here, the reconstruction model is generated by, for example, learning in advance the correlation between the desired signal data and the modulated compressed signal data (i.e., the observation matrix) using a machine learning method such as a neural network. For example, the reconstruction model can be constructed using a DUN (Deep Unrolled Network), which implements an iterative optimization algorithm such as ADMM (Alternating Direction Method of Multiplier) or ISTA (Iterative Shrinkage Thresholding Algorithm) using deep learning, and can be learned by end-to-end optimization of variable parameters including those of the encoding unit 121 and the modulation unit 130.

図３は、第１実施形態の撮像装置１Ａが奏する効果の一例を示す図である。図３は、乗算マスクを用いない（符号化開口マスクのみ）従来構成による画像の推定精度と、上記の乗算マスク（符号化開口マスクと乗算マスクの組み合わせ）を用いた本実施形態の構成による画像の推定精度との比較を示す。図３に示すグラフの縦軸はＰＳＮＲ（Peak Signal to-Noise Ratio：ピーク信号対雑音比）の強度である。図３からも明らかなように、本実施形態の推定結果の方が、従来構成よりもＰＳＮＲの値が大きく、画質が向上していることが分かる。 Figure 3 is a diagram showing an example of the effect achieved by the imaging device 1A of the first embodiment. Figure 3 shows a comparison between the image estimation accuracy using a conventional configuration that does not use a multiplication mask (only a coded aperture mask) and the image estimation accuracy using the configuration of this embodiment that uses the above-mentioned multiplication mask (a combination of a coded aperture mask and a multiplication mask). The vertical axis of the graph shown in Figure 3 represents the intensity of the PSNR (Peak Signal-to-Noise Ratio). As is clear from Figure 3, the estimation results of this embodiment have a larger PSNR value than the conventional configuration, indicating improved image quality.

このように構成された第１実施形態の撮像装置１Ａによれば、ハイパースペクトル画像の測定を行うＣＡＳＳＩ観測系において、計測された信号に対して乗算マスクによる変調処理を行うことにより、ＣＡＳＳＩによるハイパースペクトル画像の測定精度を向上させることができる。 According to the imaging device 1A of the first embodiment configured in this manner, the accuracy of measuring hyperspectral images by CASSI can be improved by performing modulation processing using a multiplication mask on the measured signal in the CASSI observation system that measures hyperspectral images.

より具体的には、ＣＡＳＳＩ観測系で用いられる符号化開口マスクの各画素は入力信号の対応する画素情報を遮断するか否か（［０，１］の場合は減衰）であるために、従来構成では、入力信号における同一画素内の各波長信号に対する信号変換は画素で共通した信号変換（｛０，１｝または［０，１］のスカラー倍にする変換）に限られた。これに対して、第１実施形態の撮像装置１Ａは、分散処理前の入力信号に直接的に作用する符号化開口マスクに加えて、分散処理および積算処理を行った結果の計測信号に対してマスクを乗算して変調する変調部１３０を有することにより、入力信号の同一画素内の各波長信号に対して異なる信号変換を可能とするものである。More specifically, because each pixel of the coded aperture mask used in the CASSI observation system either blocks or attenuates the corresponding pixel information in the input signal (attenuating in the case of [0,1]), in conventional configurations, signal conversion for each wavelength signal within the same pixel in the input signal was limited to a signal conversion common to the pixel (a conversion that multiplies {0,1} or [0,1] by a scalar). In contrast, the imaging device 1A of the first embodiment includes not only a coded aperture mask that acts directly on the input signal before dispersion processing, but also a modulation unit 130 that multiplies and modulates the measurement signal resulting from dispersion processing and integration processing by the mask, thereby enabling different signal conversions for each wavelength signal within the same pixel in the input signal.

このため、ハイパースペクトル画像の測定に関して、自由度の高い観測行列の設計が可能となり、符号化開口マスクおよび乗算マスクの２つのマスクを適切に設計することで最終的に得られる元信号の精度を向上させることが可能となる。 This makes it possible to design observation matrices with a high degree of freedom when measuring hyperspectral images, and by appropriately designing the two masks, the coded aperture mask and the multiplication mask, it is possible to improve the accuracy of the original signal that is ultimately obtained.

また、ＣＡＳＳＩの観測行列の設計は各種光学素子を用いたものでありハードウェア上での実装である。これに対して、本実施形態の変調部１３０は信号圧縮後の信号処理であり、ソフトウェア上の実装が可能である。従って、変調部１３０の実装において物理的な制約が少なく、変換の速度や連続値の表現において扱いやすい等のメリットがある。 Furthermore, the design of the CASSI observation matrix uses various optical elements and is implemented in hardware. In contrast, the modulation unit 130 of this embodiment processes signals after signal compression and can be implemented in software. Therefore, there are fewer physical constraints on the implementation of the modulation unit 130, and advantages include ease of handling in terms of conversion speed and representation of continuous values.

さらに、第１実施形態の撮像装置１Ａにおいて、変調部１３０は、ＣＡＳＳＩによる従来の撮像プロセスに追加や変更等を加えるものではない。このため、本実施形態の撮像装置１Ａによれば、容易に、且つ、装置を複雑化させることなく元信号の推定精度を向上させることができる。 Furthermore, in the imaging device 1A of the first embodiment, the modulation unit 130 does not add or modify anything to the conventional imaging process using CASSI. Therefore, with the imaging device 1A of this embodiment, the estimation accuracy of the original signal can be improved easily and without complicating the device.

＜第２実施形態＞
図４は、第２実施形態の撮像装置１Ｂの構成を示すブロック図である。図４について、図１と同様の構成には図１と同じ符号を付すことによりここでの説明を省略する。撮像装置１Ｂは、変調部１３０を備えない点、注目重み設定部１５０およびマスク生成部１６０を備える点で第１実施形態の撮像装置１Ａと異なる。 Second Embodiment
4 is a block diagram showing the configuration of an image pickup device 1B according to the second embodiment. In FIG. 4, the same components as those in FIG. 1 are denoted by the same reference numerals, and a description thereof will be omitted. The image pickup device 1B differs from the image pickup device 1A according to the first embodiment in that it does not include a modulation unit 130, but includes a focus weight setting unit 150 and a mask generation unit 160.

注目重み設定部１５０は、ＣＡＳＳＩ観測系１２０が撮像する画素ごとに、観測データとして注目すべき度合いの重み（以下「注目重み」という。）を設定する。例えば、注目重み設定部１５０は、不図示の操作入力部から注目重みの設定操作の入力を受け付け、その入力情報をもとに、画素ごとの注目重みを示す注目重み情報を生成してマスク生成部１６０に出力する。 The attention weight setting unit 150 sets a weight (hereinafter referred to as "attention weight") indicating the degree to which the observation data should be focused on for each pixel captured by the CASSI observation system 120. For example, the attention weight setting unit 150 accepts input for setting the attention weight from an operation input unit (not shown), and based on the input information, generates attention weight information indicating the attention weight for each pixel and outputs it to the mask generation unit 160.

より、具体的には、注目重みは、連続値［０，１］の範囲内の値であり、注目重み設定部１５０は、画素ごとに、３Ｄ（空間および波長）の注目重みを設定する。 More specifically, the attention weight is a value within the continuous range of [0, 1], and the attention weight setting unit 150 sets a 3D (spatial and wavelength) attention weight for each pixel.

なお、注目重みの設定方法は、上記のような方法のほか、任意の方法によって設定されてよい。例えば、注目重み情報が、通信によって他の装置から受信されてもよいし、不図示の記憶部から予め記憶された注目重み情報が読み出されてもよい。 Note that the attention weight may be set by any method other than the method described above. For example, attention weight information may be received from another device via communication, or pre-stored attention weight information may be read from a storage unit (not shown).

マスク生成部１６０は、注目重み設定部１５０から入力する注目重み情報に基づいて符号化開口マスクを生成する。マスク生成部１６０は、生成した符号化開口マスクを、後段のＣＡＳＳＩ観測系１２０の符号化部１２１に設定する。 The mask generation unit 160 generates a coded aperture mask based on the attention weight information input from the attention weight setting unit 150. The mask generation unit 160 sets the generated coded aperture mask in the coding unit 121 of the downstream CASSI observation system 120.

図５は、撮像装置１Ｂによる処理の流れを示すイメージ図である。まず、撮像装置１Ｂにおいて、注目重み設定部１５０が、画素ごとの注目重みを設定する処理を行い（ステップＳ２０１）、その設定内容を示す注目重み情報をマスク生成部１６０に出力する。例えば、注目重み設定部１５０は、画像内の注目領域を矩形で選択する操作を受け付けてもよいし、画像内の各領域に注目領域か否かの情報を対応づけたアノテーションマップを入力する操作を受け付けてもよい。 Figure 5 is an image diagram showing the processing flow by imaging device 1B. First, in imaging device 1B, attention weight setting unit 150 performs processing to set attention weights for each pixel (step S201) and outputs attention weight information indicating the settings to mask generation unit 160. For example, attention weight setting unit 150 may accept an operation to select an attention area in an image with a rectangle, or may accept an operation to input an annotation map that associates information on whether each area in the image is an attention area or not.

続いて、マスク生成部１６０が、注目重み設定部１５０から注目重み情報を入力し、その注目重み情報をもとに符号化開口マスクを生成する（ステップＳ２０２）。例えば、マスク生成部１６０は、ＣＮＮ（Convolutional Neural Network）やＵ－ＮＥＴなどのＤＮＮ（Deep Neural Network）ベースの画像処理モデルを用いて符号化開口マスクを生成することができる。マスク生成部１６０は、生成した符号化開口マスクを後段のＣＡＳＳＩ観測系１２０の符号化部１２１に設定する。 Next, the mask generation unit 160 inputs the attention weight information from the attention weight setting unit 150 and generates a coded aperture mask based on the attention weight information (step S202). For example, the mask generation unit 160 can generate a coded aperture mask using an image processing model based on a CNN (Convolutional Neural Network) or a DNN (Deep Neural Network), such as U-NET. The mask generation unit 160 sets the generated coded aperture mask in the coding unit 121 of the downstream CASSI observation system 120.

以降の処理は、第１実施形態において乗算マスクを適用しない場合の処理の流れと同様である。具体的には、ＣＡＳＳＩ観測系１２０がステップＳ２０２で設定された符号化開口マスクを用いて観測データの取得（撮像）を行い、再構成処理部１４０が取得された観測データに再構成処理を施すことにより元信号の推定結果である画像を出力する。図５では、第１実施形態の撮像装置１Ａが実行する処理と同様の処理については図２と同じ符号を示している。 The subsequent processing is the same as the processing flow when the multiplication mask is not applied in the first embodiment. Specifically, the CASSI observation system 120 acquires (images) observation data using the coded aperture mask set in step S202, and the reconstruction processing unit 140 performs reconstruction processing on the acquired observation data to output an image that is an estimation result of the original signal. In Figure 5, processes similar to those performed by the imaging device 1A in the first embodiment are denoted by the same symbols as in Figure 2.

なお、図５では、マスク生成部１６０が注目重みの設定操作により符号化開口マスクのみを生成する場合について説明したが、マスク生成部１６０は、符号化開口マスクに加えて、第１実施形態の乗算マスクを生成してもよい。この場合、マスク生成部１６０が乗算マスクを生成する機能は、深層学習モデルによる機械学習によって実現され得る。 Note that Figure 5 describes a case in which the mask generation unit 160 generates only a coded aperture mask by setting the attention weights, but the mask generation unit 160 may also generate the multiplication mask of the first embodiment in addition to the coded aperture mask. In this case, the function of the mask generation unit 160 to generate the multiplication mask can be realized by machine learning using a deep learning model.

図６は、この場合の処理の流れを示すイメージ図である。この場合、撮像装置１Ｂは、第１実施形態の変調部１３０をさらに備え、ＣＡＳＳＩ観測系１２０の観測データを変調した上で再構成処理を実施するように構成されてもよい。 Figure 6 is an image diagram showing the processing flow in this case. In this case, the imaging device 1B may further include the modulation unit 130 of the first embodiment and be configured to modulate the observation data of the CASSI observation system 120 and then perform reconstruction processing.

図７は、マスク生成部１６０が乗算マスクを生成し、再構成処理部１４０が再構成モデルを学習する流れを示すイメージ図である。上述のとおり、マスク生成部１６０が乗算マスクを生成する機能および再構成処理部１４０は、深層学習モデルによって実現され得る。例えば、マスク生成部１６０および再構成処理部１４０は、それぞれ、画素単位で設定された注目重みによって重み付けされたＭＳＥ（Mean Squared Error）をロスとして学習することにより、注目領域の再構成に特化した乗算マスクの生成モデルおよび再構成モデルを構築することができる。これはすなわち、注目重みを実数値［０，１］で表現し、その値を利用したロスの設計を行うことで、重要度の大小を考慮したハイパースペクトル画像の計測を可能とするものである。例えば、ロスは以下の（２）式のように定義される。 Figure 7 is an image diagram showing the process by which the mask generation unit 160 generates a multiplication mask and the reconstruction processing unit 140 learns a reconstruction model. As described above, the function of the mask generation unit 160 to generate a multiplication mask and the reconstruction processing unit 140 can be realized by a deep learning model. For example, the mask generation unit 160 and the reconstruction processing unit 140 can construct a multiplication mask generation model and a reconstruction model specialized for reconstructing the region of interest by learning the MSE (Mean Squared Error) weighted by the attention weight set on a pixel-by-pixel basis as the loss. In other words, by expressing the attention weight as a real value [0, 1] and designing the loss using this value, it is possible to measure hyperspectral images taking into account the level of importance. For example, the loss is defined as in the following equation (2):

以上説明したように、第２実施形態の撮像装置１Ｂは、ハイパースペクトル画像の測定を行うＣＡＳＳＩ観測系について、撮像される画像内の画素ごとに注目度合いの重みを示す注目重みを設定し、設定した注目重み情報に基づいて符号化開口マスクを生成することにより、注目領域に特化した再構成モデルを構成することができる。このため、第２実施形態の撮像装置１Ｂによれば、ＣＡＳＳＩによるハイパースペクトル画像の測定精度を向上させることができる。 As described above, the imaging device 1B of the second embodiment can configure a reconstruction model specialized for the region of interest by setting attention weights indicating the weight of the degree of attention for each pixel in the captured image for the CASSI observation system that measures the hyperspectral image, and generating a coded aperture mask based on the set attention weight information. Therefore, the imaging device 1B of the second embodiment can improve the measurement accuracy of hyperspectral images using CASSI.

さらに、第２実施形態の撮像装置１Ｂは、第１実施形態の変調部１３０を備え、マスク生成部１６０が、注目重み設定部１５０により設定された注目重み情報に基づく符号化開口マスク含め、エンド・トゥ・エンドで最適化するように学習された生成モデル（ＤＮＮ）をもとに乗算マスクを生成し、変調部１３０が生成された乗算マスクを用いて観測データを変調することにより、ＣＡＳＳＩによるハイパースペクトル画像の測定精度をさらに向上させることができる。 Furthermore, the imaging device 1B of the second embodiment is equipped with the modulation unit 130 of the first embodiment, and the mask generation unit 160 generates a multiplication mask based on a generative model (DNN) trained to perform end-to-end optimization, including a coded aperture mask based on the attention weight information set by the attention weight setting unit 150, and the modulation unit 130 modulates the observation data using the generated multiplication mask, thereby further improving the measurement accuracy of hyperspectral images by CASSI.

より具体的には、第２実施形態の撮像装置１Ｂは、ＣＡＳＳＩ観測系１２０で用いられる符号化開口マスクを、画素単位で注目したい波長域について重みづけ（注目重みの設定）を行った上で、画素及び波長域ごとの注目重みを入力としてＤＮＮ（Deep Neural Network）により符号化開口マスクを生成する。これはすなわち、ピクセルワイズな重みの設計とそれを加味した符号化開口マスクの生成モデル及び再構成モデルの学習により、波長の注目領域を空間的に変動させた測定画像の取得が可能となるということである。 More specifically, the imaging device 1B of the second embodiment weights the coded aperture mask used in the CASSI observation system 120 for the wavelength range of interest on a pixel-by-pixel basis (setting the weight of interest), and then generates the coded aperture mask using a DNN (Deep Neural Network) with the weight of interest for each pixel and wavelength range as input. In other words, by designing pixel-wise weights and learning a coded aperture mask generation model and reconstruction model that take this into account, it is possible to acquire measurement images in which the wavelength region of interest is spatially varied.

これにより、各波長域に対して｛０，１｝の離散値ではなく［０，１］で連続する範囲内の値として重要度（重み）を設定して、画素ごとに異なる波長域の重要度を設定して撮像を行うことができる。第２実施形態の撮像装置１Ｂによれば、従来手法とは異なり、取得した圧縮信号の画素情報を全て利用することができるため、単一撮影での再構成の結果が十分な性能であることが期待できるので、計測コストを小さくすることができる。This allows the importance (weight) of each wavelength range to be set as a value within a continuous range of [0, 1] rather than as a discrete value of {0, 1}, making it possible to set different importance of wavelength ranges for each pixel during imaging. Unlike conventional methods, the imaging device 1B of the second embodiment can utilize all pixel information from the acquired compressed signal, so the reconstruction results from a single image can be expected to have sufficient performance, thereby reducing measurement costs.

以上、この発明の実施形態について図面を参照して詳述してきたが、具体的な構成はこの実施形態に限られるものではなく、この発明の要旨を逸脱しない範囲の設計等も含まれる。 The above describes in detail an embodiment of the present invention with reference to the drawings, but the specific configuration is not limited to this embodiment and also includes designs that do not deviate from the gist of the present invention.

本発明は、ＣＡＳＳＩ観測系を用いたハイパースペクトル画像の計測を行う撮像装置に適用可能である。 The present invention is applicable to imaging devices that measure hyperspectral images using the CASSI observation system.

１Ａ，１Ｂ…撮像装置、１１０…光学系、１２０…ＣＡＳＳＩ観測系、１２１…符号化部、１２２…分散部、１２３…計測部、１３０…変調部、１４０…再構成処理部、１５０…重み設定部、１６０…マスク生成部 1A, 1B...imaging device, 110...optical system, 120...CASSI observation system, 121...encoding unit, 122...dispersion unit, 123...measurement unit, 130...modulation unit, 140...reconstruction processing unit, 150...weight setting unit, 160...mask generation unit

Claims

An imaging device that measures hyperspectral images using compressed sensing,
a CASSI observation system including: an encoding unit that applies a coded aperture mask to an input signal to encode the input signal and output the encoded signal; a dispersing unit that wavelength-disperses the input signal encoded by the encoding unit and outputs the wavelength-dispersed input signal; and a measuring unit that captures the input signal wavelength-dispersed by the dispersing unit;
an attention weight setting unit that sets an attention weight for each pixel captured by the CASSI observation system;
a mask generation unit that generates the coded aperture mask based on the attention degree weight for each pixel set by the attention weight setting unit;
An imaging device comprising:

a modulation unit that modulates the output signal of the CASSI observation system using a multiplication mask;
The imaging device according to claim 1 .

the mask generation unit generates the multiplication mask in addition to the coded aperture mask based on the weight of the attention degree, and sets the generated multiplication mask in the modulation unit.
The imaging device according to claim 2 .

the attention weight setting unit is capable of setting an arbitrary value between 0 and 1 as the weight of the attention degree for each pixel.
The imaging device according to claim 1 .

a reconstruction processing unit that receives an output signal from the modulation unit, performs reconstruction processing on the output signal using a reconstruction model that is pre-constructed by a neural network , and outputs an estimation result of a hyperspectral image;
The imaging device according to claim 2 .

the mask generation unit constructs the coded aperture mask and the multiplication mask such that end-to -end input/ output of the mask generation unit, the CASSI observation system, the modulation unit, and the reconstruction processing unit is optimized by the reconstruction model .
The imaging device according to claim 5 .

An imaging method for measuring a hyperspectral image by compressed sensing, comprising:
an imaging step of imaging an input signal using a CASSI observation system including: an encoding unit that applies a coded aperture mask to the input signal to encode the input signal and output the encoded signal; a dispersing unit that wavelength-disperses the input signal encoded by the encoding unit and outputs the wavelength-dispersed input signal; and a measuring unit that images the input signal wavelength-dispersed by the dispersing unit;
a modulating step of modulating the output signal of the CASSI observation system by a multiplication mask;
An imaging method comprising:

A program for causing a computer to function as the imaging device described in any one of claims 1 to 6.