JP7207627B2

JP7207627B2 - High-tolerance digital watermarking method

Info

Publication number: JP7207627B2
Application number: JP2019087181A
Authority: JP
Inventors: 河村尚登
Original assignee: Kawamura Naoto
Current assignee: Kawamura Naoto
Priority date: 2019-05-03
Filing date: 2019-05-03
Publication date: 2023-01-18
Anticipated expiration: 2039-05-03
Also published as: JP2020184663A

Description

本発明は，デジタル画像データに著作権情報などの透かし情報を埋め込む電子透かし法で、特に、特にアルゴリズムを公開しても第三者からの攻撃に対して高耐性を示す不可視型で可逆の電子透かし方法に関するものである。 The present invention is an electronic watermarking method for embedding watermark information such as copyright information in digital image data. It relates to the watermarking method.

現在のデジタル情報社会において，情報を手軽に複製でき、多くの人が情報を共有することが可能となり，社会が大きく発展してきた。しかし，その利便性が，他人の著作物を違法に複製し流通させることにより，著作権侵害などの事件を引き起こすことにもなった。画像においては，近年のデジタルカメラやプリンタの高画質化により，原画と寸分も違わぬ複製が容易に得られるようになり，著作権を侵害した違反コピーだけでなく，紙幣や有価証券等の偽造行為という悪質な犯罪行為を助長させる結果となっている。 In today's digital information society, information can be easily duplicated and information can be shared by many people, and society has developed greatly. However, this convenience has also caused incidents such as copyright infringement due to the illegal duplication and distribution of other people's works. In the case of images, the high image quality of digital cameras and printers in recent years has made it possible to easily obtain exact reproductions of original images. It is a result of encouraging vicious criminal acts called acts.

そのような状況の中で，画像情報の中に別の情報，例えば，著作者情報等を埋め込み，著作権を保護したり、違法複写に警告を与えたりする不可視の電子透かし（Digital Watermark）技術が発展してきた。この電子透かし技術は、例えば、著作権保護の目的には、文書や画像の著作物の中に著作権者やURL、連絡先、利用条件などを埋め込み、利用者に注意を喚起するのみならず、不正利用された場合には、その追跡を可能とする。このように、著作権の管理・保護、ならびに不正複写の監視など、セキュリティ対策として電子透かしが広く使われ始めている。 In such a situation, invisible digital watermark technology embeds other information, such as author information, in image information to protect copyright or warn against illegal duplication. has developed. For example, for the purpose of copyright protection, this digital watermarking technology not only embeds the copyright owner, URL, contact information, terms of use, etc. , and in case of unauthorized use, it is possible to trace it. In this way, electronic watermarks are beginning to be widely used as security measures such as copyright management/protection and unauthorized copying monitoring.

このセキュリティ対策としての電子透かしは、頑強で簡単に消失あるいは除去できない高耐性のものでなければならない。画像の編集加工や簡単な操作で透かし情報を除去や消去ができるならば、不正使用して追跡や監視の目から逃れることができるからである。 The electronic watermark as a security measure must be robust and highly durable so that it cannot be lost or removed easily. This is because if the watermark information can be removed or erased by editing the image or by a simple operation, it can be used illegally and escaped from tracking and surveillance.

通常、耐性を強化するためには、電子透かしを画像データの深層に埋め込む必要がある。このため、埋め込みの強度（gain）を大きくして埋め込む。これは、原画像に対してダメージが増大し、画質が劣化するという弊害をもたらす。つまり、画質と耐性とはお互いにトレードオフの関係にあり、両方を満たすことは通常困難である。 Typically, to enhance robustness, a digital watermark needs to be embedded deep into the image data. Therefore, the embedding strength (gain) is increased for embedding. This increases damage to the original image, resulting in deterioration of image quality. In other words, image quality and durability are in a trade-off relationship, and it is usually difficult to satisfy both.

画質と耐性を同時に満たす電子透かし法として、特許文献1から６および非特許文献１から２に示されるグリーンノイズパターンを画像データに埋め込む手法（以降、グリーンノイズ拡散法と呼ぶ）をこれまで提案してきた。グリーンノイズパターンは，そのスペクトルが特定の帯域fmax～fminに閉じ込められ、その主周波数は、人の視覚特性で認識できにくい周波数帯に設定できる。このため、電子透かしに用いた場合、埋め込み画像を明視の距離で観察した場合、ドットパターンは低い応答性を示しアーティファクトは認識されにくい。また、プリンタやスキャナーでの書き込みや読み取りでは高い応答感度を有すため、印刷しても埋め込み情報は保持できる。信頼度を上げるため、埋め込み強度（gain）を上げてもアーティファクトが視認され難く，高画質性が保たれる特徴がある。 As a digital watermarking method that satisfies image quality and durability at the same time, we have proposed a method of embedding a green noise pattern in image data (hereinafter referred to as a green noise diffusion method), which is shown in Patent Documents 1 to 6 and Non-Patent Documents 1 to 2. rice field. The green noise pattern has its spectrum confined to a specific band fmax to fmin, and its main frequency can be set to a frequency band that is difficult to recognize by human visual characteristics. For this reason, when used for digital watermarking, when the embedded image is observed at a distance of clear vision, the dot pattern exhibits low responsiveness and artifacts are difficult to recognize. In addition, since it has high response sensitivity in writing and reading with printers and scanners, embedded information can be retained even after printing. In order to increase reliability, even if the embedding strength (gain) is increased, artifacts are less visible and high image quality is maintained.

グリーンノイズ拡散法はスペクトル拡散法と同じ様に編集や加工に強靭であるが、第三者が不正利用のために透かし情報を除去することに対しても、強固でなければならない。第三者が様々な方法、様々な画像、様々な情報で埋め込みアルゴリズムを解析・推定し攻撃する（以降、結託攻撃と呼ぶ）ことにより、埋め込み画像から透かし情報を除去し、原画に戻すことにより、不正利用を可能としてしまう。アルゴリズム公開型の電子透かしでは、アルゴリズムを知ることにより信号処理を逆解析することが容易で、このため攻撃を受けやすい。埋め込みのパラメータは、通常、“鍵”という形で保持される。この鍵は、通常、埋め込む画素の位置情報や、強さ（gain情報）、乱数列、ハッシュ関数のパスワードなど、各種パラメータを用い、鍵情報がなければ透かしの除去はできない。結託攻撃ではこの鍵の情報を推定することである。 Like the spread spectrum method, the green noise diffusion method is robust against editing and processing, but it must also be robust against removal of watermark information for unauthorized use by a third party. A third party analyzes, estimates, and attacks the embedding algorithm using various methods, various images, and various information (hereafter referred to as collusion attacks), thereby removing the watermark information from the embedded image and returning it to the original image. , makes unauthorized use possible. Algorithmic open watermarking makes it easy to reverse analyze the signal processing by knowing the algorithm and is therefore vulnerable to attacks. Embedding parameters are usually held in the form of a "key". This key usually uses various parameters such as position information of pixels to be embedded, strength (gain information), random number sequence, hash function password, etc. Without the key information, the watermark cannot be removed. A collusion attack involves guessing this key information.

ステガノグラフィとしての利用形態では、秘密情報をこっそり画像中に埋め込み、鍵なしでは透かし抽出が出来ないようになっている。このため、アルゴリズムは非公開である。しかし、著作権保護を目的とする電子透かし法は、利用者が誰でも著作権情報の有無や条件などを知ることが出来る必要があり、透かし抽出は誰でも簡単にできるようにする必要がある。また、広く普及するためには、業界で統一された手法として標準化が必要で、アルゴリズムが公開できることが望ましい。 In the form of steganography, secret information is secretly embedded in the image so that the watermark cannot be extracted without a key. For this reason, the algorithm is private. However, in the digital watermarking method for copyright protection, it is necessary for any user to be able to know the presence or absence of copyright information and conditions, etc., and it is necessary for anyone to be able to easily extract watermarks. . In addition, in order to spread widely, it is necessary to standardize it as a unified method in the industry, and it is desirable that the algorithm can be published.

この時、アルゴリズムを公開しても安全性が確保されるためには、通常、非対称型の電子透かし手法が望ましいとされている。対称型は、透かしの埋め込みと逆手順で抽出を行うので、埋め込み位置や方法などが発覚されやすく攻撃をうけやすいからである。
グリーンノイズ拡散法は埋め込みは実空間で，抽出はスペクトル空間で行われる非対称型で，かつ，グリーンノイズパターンはランダムドットが拡散したパターンのためスペクトラム拡散と同じ様に完全に除去することは一般に難しい．アルゴリズムを公開したとしても，埋め込みの各種パラメータは秘密情報として「秘密鍵」の形で受領者に渡される。秘密性はこの鍵に集約され、安全性が担保され、通常は、この鍵がなければ透かしの除去はできない様になっている。 At this time, in order to ensure security even if the algorithm is made public, an asymmetric digital watermarking method is generally considered desirable. This is because the symmetric type extracts watermarks in the reverse order of embedding, so the location and method of embedding can easily be discovered and attacked.
The green noise diffusion method is an asymmetric type in which the embedding is performed in real space and the extraction is performed in spectral space, and because the green noise pattern is a pattern in which random dots are diffused, it is generally difficult to completely remove it in the same way as spread spectrum. . Even if the algorithm is made public, the various embedded parameters are passed to the recipient as confidential information in the form of a "private key." Confidentiality is aggregated into this key, security is guaranteed, and normally watermark removal cannot be performed without this key.

さらに、画像ごとに埋め込みパターンなどのパラメータを変えるため、鍵情報は画像ごとに異なる。このため、鍵を紛失したり盗難されたりしても、その鍵で他の画像を操作し、透かしを除去することはできない。 Furthermore, since parameters such as an embedding pattern are changed for each image, the key information is different for each image. Therefore, even if the key is lost or stolen, the key cannot be used to manipulate other images to remove the watermark.

そのうえで、前述の結託攻撃に対する対策が必要である。結託攻撃では、鍵なしで原画像との差分をとることや、平坦部の多い画像、複数の画像から、埋め込みの特徴、画像の相関などから、埋め込みパラメータやパターンなどを解読し鍵情報を割り出す。このため、このような攻撃に対する対策が必要である。第三者からのこのような結託攻撃は、近年のAI技術を用いた機械学習を用いて種々のパターンを学習することにより、鍵情報を推定することが可能となりつつある。特に、ブロック型の電子透かし法では、ブロック毎に透かし情報を埋め込むため、1枚の画像からブロック数だけの多数の学習データが揃うため、埋め込みパラメータの推定が行いやすい。 In addition, countermeasures against the above-mentioned collusion attacks are necessary. In a collusion attack, the key information is determined by taking the difference from the original image without a key, decoding the embedding parameters and patterns from images with many flat parts, and embedding features and image correlations from multiple images. . Therefore, countermeasures against such attacks are required. Such collusion attacks from third parties are becoming possible to estimate key information by learning various patterns using machine learning using recent AI technology. In particular, in the block-type digital watermarking method, since watermark information is embedded in each block, as many pieces of learning data as the number of blocks are prepared from one image, embedding parameters can be easily estimated.

このため、特許文献５では、埋め込むグリーンノイズパターンに反転パターンを用いてその数を増やし、埋め込むビット情報（０，１）に対して複数のパターンの選択を乱数列として鍵情報に入れる方法を提案した。複数パターンはオリジナルパタ－ンの空間的反転パターンと輝度データの反転パターン（ネガ/ポジのパターン）から成り、一つのグリーンノイズパターンに対して８つの反転パターンを生成した。このため、パターンの種類と数が増え、同じパターンが繰り返し出てくるというリスクは減少し、安全性は改善される。 For this reason, Patent Document 5 proposes a method of increasing the number of green noise patterns to be embedded by using inverted patterns, and inserting the selection of a plurality of patterns into key information as a random number sequence for bit information (0, 1) to be embedded. bottom. The multiple patterns consisted of the spatially inverted pattern of the original pattern and the inverted pattern of the luminance data (negative/positive pattern), and eight inverted patterns were generated for one green noise pattern. As a result, the number and types of patterns are increased, the risk of the same pattern appearing repeatedly is reduced, and safety is improved.

しかし、反転パターンのみではパターンの位相（空間位置情報のシフト量）がなく常に同じ位相であるため、多くのパターン埋め込み情報からパターンの同定が可能であり、まだ、完全に安全であるとは言えない。 However, with only the inverted pattern, there is no pattern phase (spatial position information shift amount) and the phase is always the same, so it is possible to identify the pattern from a lot of pattern embedding information, and it is still completely safe. Absent.

特開2018-182471JP 2018-182471 特開2018-201134JP 2018-201134 特開2018-207247JP 2018-207247 特願2017-117460Patent application 2017-117460 特願2018-216362Patent application 2018-216362 特願2019-000178Patent application 2019-000178

N.Kawamura;”Digital Halftoning by Clustered Dither Screen Masks Representing Green-noise Characteristics”, Proceeding of the 1st ICAI 2015, T109-3, pp.708-711 (2015).N.Kawamura;”Digital Halftoning by Clustered Dither Screen Masks Representing Green-noise Characteristics”, Proceeding of the 1st ICAI 2015, T109-3, pp.708-711 (2015). N.Kawamura;”Digital Halftoning by Stochastic Dither Screen Mask Representing Green to Blue Noise Characteristics”, Trans. on IEVC, Vol.5, No.1, pp.34-41 (2017).N.Kawamura;”Digital Halftoning by Stochastic Dither Screen Mask Representing Green to Blue Noise Characteristics”, Trans. on IEVC, Vol.5, No.1, pp.34-41 (2017).

そこで、高画質と編集・加工に対する耐性を維持し、結託攻撃に対しても十分強固な電子透かし法を得ることが課題である。すなわちアルゴリズム公開型電子透かし法において、第三者が鍵なしに埋め込みパターンの解析により埋め込みパターンを割り出すことが出来ない強い耐性を示す電子透かし法を得ることが課題である。 Therefore, it is a challenge to obtain a digital watermarking method that maintains high image quality and resistance to editing and processing, and is sufficiently robust against collusion attacks. That is, in the algorithm public type digital watermarking method, it is a problem to obtain a digital watermarking method that exhibits strong resistance such that a third party cannot determine the embedding pattern by analyzing the embedding pattern without a key.

前記課題を解決するために、本発明は、画像をブロック分割し、ブロックごとに透かし情報を埋め込む電子透かし法において、グリーンノイズ特性を示す埋め込みパターンを原画像に重畳し埋め込む際、ブロック毎にパターンの位相をシフトあるいは変動させて埋め込む。 In order to solve the above problems, the present invention provides a digital watermarking method in which an image is divided into blocks and watermark information is embedded in each block. embedded by shifting or varying the phase of

すなわち、画像データI(x,y)をR× Rのブロックに分割し、ブロック毎に透かし情報ビットに対応した異なるパターンを埋め込む電子透かし法において、埋め込みパターンはグリーンノイズ特性を示す複数の異なる基本パターンPi(x,y)を、ブロック毎に異なる位相シフト量 (x0,y0)（ただし、 0≦ x0,y0＜ R ）を用いて、x方向にx0、y方向にy0だけシフトさせたもので、 P'i(x,y) = Pi((x+x0) mod R, (y+y0) mod R) （ここで、P’i(x,y)はシフト後のパターン、mod RはRを法とする剰余）なる演算式で求められ、透かしの埋め込みは、かかるパターンP’i(x,y)を用いて、W(x,y)=I(x,y)+ gain・P'i(x,y) (ここでgainは埋め込み強度) なる演算にて行い、前記位相シフト量（x0,y0）は、乱数発生器で発生し位相シフトリストL(n) (nはブロック番号)に記憶し、鍵として、基本パターンPi(x,y)、埋め込み強度（gain）及び位相シフトリストL(n)を保管し、透かし情報の抽出は、透かしの埋め込まれた画像データをブロックに分割し、ブロックのフーリエスペクトルパターンの形状から、埋め込みビット情報を抽出するもので、位相シフト量によらず同じ抽出ソフトウェアで実行可能であり、透かしの除去は、透かしの埋め込まれた画像W(x,y)に、鍵を用いて、I(x,y)=W(x,y)-gain・P'i(x,y) なる演算にて除去を行い原画に戻すことや新たな情報を埋め込むことが可能なことを特徴とする。
That is, in the digital watermarking method in which image data I(x,y) is divided into R×R blocks and different patterns corresponding to watermark information bits are embedded in each block, the embedding patterns are a plurality of different basic patterns exhibiting green noise characteristics. The pattern Pi(x,y) is shifted by x0 in the x direction and y0 in the y direction using a different phase shift amount (x0,y0) (where 0≤x0,y0<R) for each block. , P'i(x,y) = Pi((x+x0) mod R, (y+y0) mod R) (where P'i(x,y) is the shifted pattern and mod R is (R modulo R), and the watermark embedding is W(x,y)=I(x,y)+gain・P 'i(x, y) (here, gain is the embedding strength) The phase shift amount (x0, y0) is generated by a random number generator and obtained from the phase shift list L(n) (n is the block number ), and the basic pattern Pi(x,y), the embedding strength (gain), and the phase shift list L(n) are stored as keys. The embedded bit information is extracted from the shape of the Fourier spectrum pattern of each block. The same extraction software can be used regardless of the phase shift amount. ,y) is removed by the operation I(x,y)=W(x,y)-gain・P'i(x,y) using the key to return to the original image and new information It is characterized by being embeddable .

かかる位相をシフトさせたパターンP'i(x,y)は、R×Rの基本パターンPi(x,y)をベースに、(x0,y0)（ただし、0≦x0,y0＜R）だけシフトさせたもので、シフト量（x0,y0）はブロック毎に異なる値をとるが、透かしの埋め込まれたブロックのフーリエスペクトルパターンの形状は、位相シフト量によらず同じスペクトル形状を示すため、透かし情報の抽出は、この位相シフト量の変更に伴う抽出ソフトウェアの変更は不要であり、透かしを除去する時のみ、この鍵を用いて透かしを除去する。このようにすることにより、結託攻撃からパターンの推定を困難にすることが出来る。
Such a phase-shifted pattern P'i(x,y) is based on the basic pattern Pi(x,y) of R×R, by (x0,y0) (where 0≦x0,y0<R) Although the amount of shift (x0, y0) is different for each block, the shape of the Fourier spectrum pattern of the watermark-embedded block shows the same spectrum shape regardless of the amount of phase shift. Extraction of watermark information does not require a change in the extraction software accompanying a change in this phase shift amount, and the watermark is removed using this key only when removing the watermark. By doing so, it is possible to make it difficult to estimate patterns from collusion attacks.

かかる埋め込みパターンPi(x,y)は、画像ごとに異なる初期値から生成されたグリーンノイズパターンであり、シフトリストL(ｎ)は、画像ごとに異なるシフト列から構成され、これらが組み込まれた鍵で他の画像の透かしを除去できない。
The embedding pattern Pi(x,y) is a green noise pattern generated from different initial values for each image, and the shift list L( n ) is composed of different shift sequences for each image, and Unable to remove watermarks on other images with a key.

本発明により、編集加工による高耐性と、第三者からの結託攻撃に対する耐性が備わり、頑強で、鍵なしでは透かしの除去ができない安全な電子透かしを得ることができる。 According to the present invention, it is possible to obtain a secure digital watermark that has high resistance to editing processing and resistance to collusion attacks from a third party, is robust, and cannot be removed without a key.

また、鍵により元の画像に復元可能である。これは画像の購入者が、配信者から直接原画を配信してもらう手間を省くことができるし、原画での配信を行うことがないため、漏洩や盗難などの心配もなく、セキュリティ上も安全性が高い。さらに購入者はその鍵を用いて、編集加工した画像に第二著作者としての著作権情報を埋め込むことができる。 Also, the original image can be restored with a key. This saves the purchaser of the image the trouble of having the original image delivered directly from the distributor, and since the original image is not distributed, there is no need to worry about leaks or theft, and it is safe in terms of security. highly sexual. Furthermore, the purchaser can use the key to embed the copyright information as the second author in the edited image.

かかる鍵は、基本となるパターンPi(x,y)と、gain(埋め込み強度)および位相シフトリストL(n)のみでよく、鍵のサイズを小さくすることが可能で取り扱いが簡単である。 Such a key only needs the basic pattern Pi(x, y), gain (embedding strength) and phase shift list L(n), which makes it possible to reduce the size of the key and facilitate handling.

また、鍵に含まれる基本パターンPi(x,y)は、画像ごとに異なるグリーンノイズパターンを用い、鍵の紛失や盗難時にも、この鍵を用いて他の画像の透かし除去はできない。このため安全性は極めて高い。 Also, the basic pattern Pi(x, y) included in the key uses a different green noise pattern for each image, and even if the key is lost or stolen, this key cannot be used to remove watermarks from other images. Therefore, the safety is extremely high.

埋め込まれた画像は、シフトパターンにより、埋め込み画像から埋め込みパターンを推定する結託攻撃に対しても安全性は確保されている。このため、アルゴリズムを公開してもそのアルゴリズムから埋め込みパターンを推定することはできず、攻撃の助長とはならない。 Security of the embedded image is ensured by the shift pattern against a collusion attack in which the embedding pattern is estimated from the embedding image. For this reason, even if the algorithm is made public, the embedding pattern cannot be estimated from the algorithm, which does not encourage attacks.

本発明の電子透かしの埋め込みおよび抽出の装置の図、1 is a diagram of a watermark embedding and extraction apparatus of the present invention; 透かし入り画像のインターネットでの配信を示した図、A diagram showing the distribution of watermarked images over the Internet, 埋め込み用ドットパターン生成のフローを示す図、A diagram showing the flow of embedding dot pattern generation, ドットパターンのスペクトルの回転方向と埋め込みビットを表す図、A diagram showing the rotation direction and embedded bits of the spectrum of the dot pattern, 埋め込み用ドットパターン生成時におけるフィルタの形状を示す図Diagram showing the shape of the filter when generating a dot pattern for embedding 生成されたドットパターンとそのスペクトルおよび埋め込み情報を示す図、A diagram showing the generated dot pattern and its spectrum and embedded information, 透かし埋め込みの処理フローを表す図、Diagram showing the process flow of watermark embedding, 透かし抽出の処理フローを表す図,Figure showing the processing flow of watermark extraction, パターン識別の処理フローを表す図、A diagram showing the processing flow of pattern identification, マスクパターンを示す図、A diagram showing a mask pattern, 位相シフトパターンを示す図、Figures showing phase shift patterns, 乱数発生による位相シフトパターンを示す図、A diagram showing a phase shift pattern by random number generation, 基本パターンと位相シフトパターンおよびそれらのスペクトルを表す図で、(a)はR=64, (b)はR=32 の場合を示す。FIG. 10 is a diagram showing a basic pattern, a phase shift pattern and their spectra, (a) for R=64 and (b) for R=32. 256×256の画像に埋め込みを行ったときの画像で、（a）は原画像、(b)は埋め込み画像、(c)はそのスペクトルを表す画像。(a) is the original image, (b) is the embedded image, and (c) is the spectrum of the embedded image.

図１は本発明の情報埋め込みと抽出のための電子透かしシステムの構成図である。著作権保有者のコンピュータ３には、著作権を有す画像データが、例えばハードディスクなどのデータメモリ７に保管されている。画像データは、プログラムメモリ６にある画像処理プログラムにより、ＣＰＵ１１，ＲＯＭ４，ＲＡＭ５などを用いて画像処理され、モニター８に表示される。コンピュータ３にはスキャナー１、プリンタ２が接続され、処理された画像はプリンタから出力し、またスキャナー１から画像読み取りができる。かかる画像処理は大きなサイズの画像を取り扱う場合，演算負荷が増大するため，並列処理を行うことや、ＧＰＵなどの高速化を図るための処理ボードが入っている場合もある。 FIG. 1 is a configuration diagram of a digital watermarking system for information embedding and extraction according to the present invention. In the copyright holder's computer 3, copyrighted image data is stored in a data memory 7 such as a hard disk. The image data is image-processed by the image processing program in the program memory 6 using the CPU 11, ROM 4, RAM 5, etc., and is displayed on the monitor 8. FIG. A scanner 1 and a printer 2 are connected to the computer 3 , and the processed image can be output from the printer and the image can be read from the scanner 1 . Such image processing increases the computational load when handling images of a large size, so there are cases where parallel processing is performed and a processing board such as a GPU is included for speeding up.

画像中に著作権情報を埋め込むためには、キーボード９から、例えば、著作者の名前や日付、ＵＲLなどの文字情報を入れるが、埋め込む文字数は、最低１６文字は必要となる。これはASCIIコードの文字で埋め込むものとして16バイト（１２８ビット）が必要となる。 In order to embed the copyright information in the image, character information such as the author's name, date, URL, etc. is input from the keyboard 9, and at least 16 characters are required to be embedded. This requires 16 bytes (128 bits) to be embedded with ASCII code characters.

図２は、一つの運用形態を示すもので，インターネット配信の処理手順を示すものである。透かし情報を埋め込まれた画像は、配信者あるいは著作権者１６のコンピュータやサーバーからインターネット１２により配信や展示される（１３）。それを見た購入希望者１７は、購入希望を配信者に連絡する（１４）。所定の手続き後、配信者は透かしの埋め込みと除去のための秘密鍵を配信する（１５）。購入者は、配信者との契約に基づき、送付された画像の埋め込まれた電子透かしを除去し、画像の編集加工を行うことができる。 FIG. 2 shows one operation mode, showing a processing procedure for internet distribution. The image embedded with the watermark information is distributed or exhibited via the Internet 12 from the computer or server of the distributor or copyright holder 16 (13). The prospective purchaser 17 who sees it informs the distributor of his purchase desire (14). After a predetermined procedure, the distributor distributes the private key for watermark embedding and removal (15). Based on the contract with the distributor, the purchaser can remove the electronic watermark embedded in the sent image and edit the image.

購入者は、さらに編集加工された画像データを、２次著作者として自己の著作情報やＵＲＬなどの情報をこの秘密鍵を用いて埋め込むことができる。これらの透かし情報を埋め込まれた画像は、公開され市場に流通し様々な利用がされる。 The purchaser can embed information such as his/her own work information and URL as a secondary author in the edited image data using this secret key. Images embedded with such watermark information are released to the public, circulated on the market, and used in various ways.

その画像を見た第三者は、透かし抽出ソフトにより、この画像に著作権があることを確認できる。透かし抽出ソフトウェアには秘密性はなく、例えば，配信者や著作権者のホームページなどから自由にダウンロードできるようにして広く公開し、だれでも使用できる。第三者はこれにより画像の所有者や著作権情報、連絡先などを知ることができるため、不正使用の防止・警告にもなる。 A third party who sees the image can confirm that the image has the copyright by using watermark extraction software. The watermark extracting software is not confidential and can be freely downloaded from, for example, the homepages of distributors and copyright holders and made widely available for anyone to use. Third parties can know the owner of the image, copyright information, contact information, etc. by this, so it can also prevent unauthorized use and warn.

一方、著作権者や二次著作権者は、その透かしにより、著作物としての画像データの追跡と監視を行うことが可能である。もし、第三者が不正に使用した場合、著作権者や二次著作権者は、監視ソフトウェアにより画像から透かし情報を読みだし、自己の著作物であること、さらに第三者が無断で使用している場合には摘発が可能となる。 On the other hand, copyright holders and secondary copyright holders can track and monitor image data as copyrighted works by using the watermark. If a third party uses the image illegally, the copyright holder or secondary copyright holder will read the watermark information from the image using monitoring software, confirm that it is their own work, and that the third party will use it without permission. If so, it can be detected.

鍵は埋め込む画像ごとに１対１で対応する。このため、著作権者は悪用されないように画像ごとに異なる鍵を用いる。この鍵を用いて他の画像の透かしを除去することはできない。鍵の紛失や第三者による結託攻撃による鍵の取得が起きても、その被害は該当する1枚の画像のみで、他の画像に対しては透かし除去はできない。このとき、透かし読み出しのソフトウェアは、鍵が異なっても共通に使える。また。鍵の発行はほぼ無限に行えるため，同じ鍵となる確率は極めて低い。 A key corresponds to each embedded image on a one-to-one basis. For this reason, the copyright holder uses a different key for each image to prevent abuse. This key cannot be used to remove watermarks on other images. Even if the key is lost or the key is obtained through a collusion attack by a third party, the damage will be limited to one image, and the watermark cannot be removed from the other images. At this time, the watermark reading software can be used in common even if the keys are different. Also. Since keys can be issued almost infinitely, the probability of having the same key is extremely low.

以下、実施例に沿って詳しく説明する。
ここで，まず本発明に用いられるグリーンノイズ特性を示すドットパターンの生成について図３に沿って説明する．
今，マトリックスサイズをR画素×R画素 ( R=2^m ここで^はべき乗)として，その中に1画素サイズの黒ドットを埋め込むものとする。黒ドットの面積率をg（0≦g≦１：g =1が全黒，g =0 が全白）とする。点(x,y)におけるドットパターンを p(x,y)として、g=1/2のグリーンノイズ特性を示すドットパターンを以下の様にして求める。
(工程１) まず，疑似乱数発生器により、ランダムドットp(x,y)を生成する（２０）。この時，疑似乱数発生器のSEED値を変えることにより初期状態を変更可能である。
(工程２) p(x,y)を二次元フーリエ変換を行いスペクトルP(u,v)を得る（２１）。ここで、(x,y)は空間座標、(u,v)は空間周波数座標を表す。g=0.5 の場合、(R^2)/2個のランダムドット（初期状態はホワイトノイズ）を発生させ、p(x,y)とする．
(工程３) P(u,v)にフィルタD(u,v)を掛けて新たなP'(u,v)を得る（２２）。すなわち、
P’(u,v)=P(u,v)・D(u,v)
(工程４) P'(u,v)に逆フーリエ変換を行い，多値の点プロファイルp'(x,y)を得る（２３）。
(工程５）誤差関数e(x,y)=p'(x,y)-p(x,y)を求め（２４），各画素位置での誤差の大きい順に白，黒同数反転する（２５）。
(工程６）工程2から工程５までの操作を誤差が許容量以内になるまで繰り返し，最終的にg=1/2のドットパターンを得る。 A detailed description will be given below along with examples.
Here, the generation of the dot pattern showing the green noise characteristics used in the present invention will be explained with reference to FIG.
Assume that the matrix size is R pixels × R pixels (R=2^m where ^ is a power), and 1-pixel size black dots are embedded in it. Let the area ratio of black dots be g (0 ≤ g ≤ 1: g = 1 is all black, g = 0 is all white). Assuming that the dot pattern at the point (x, y) is p(x, y), the dot pattern showing the green noise characteristics of g=1/2 is obtained as follows.
(Step 1) First, a pseudo-random number generator generates random dots p(x,y) (20). At this time, the initial state can be changed by changing the SEED value of the pseudo-random number generator.
(Step 2) A two-dimensional Fourier transform is performed on p(x,y) to obtain a spectrum P(u,v) (21). Here, (x, y) represent spatial coordinates, and (u, v) represent spatial frequency coordinates. For g=0.5, generate (R^2)/2 random dots (initial state is white noise) and let p(x,y).
(Step 3) Multiply P(u,v) by filter D(u,v) to obtain new P'(u,v) (22). i.e.
P'(u,v)=P(u,v)・D(u,v)
(Step 4) Perform an inverse Fourier transform on P'(u, v) to obtain a multivalued point profile p'(x, y) (23).
(Step 5) Find the error function e(x, y)=p'(x, y)-p(x, y) (24), and invert white and black by the same number in descending order of error at each pixel position (25 ).
(Step 6) The operations from Step 2 to Step 5 are repeated until the error is within the allowable amount, and finally a dot pattern of g=1/2 is obtained.

ここで、フィルタD(u,v)のパターンの設定について説明する。今、D(u,v)が円形のリングパターンで、その最大ラジアル周波数をfmax、最小ラジアル周波数をfminであるとする。g=1/2での平均的ドット間隔による周波数f0は，
f0＝√g・fn＝√(1/2)・fn
で与えられ，fmax及びfminはf0を基準として，
a≡(fmax - f0)/fn
b≡(fmin - f0)/fn
として、パラメータ（a,b）を定義する。ここで，fnはナイキスト周波数を示す。かかる(a,b)を変化させて、視覚特性に合わせてクラスターサイズの異なるドットパターンを得ることができる。 Here, the setting of the pattern of the filter D(u,v) will be explained. Now let D(u,v) be a circular ring pattern with its maximum radial frequency fmax and minimum radial frequency fmin. The frequency f0 due to the average dot spacing at g=1/2 is
f0＝√g・fn＝√(1/2)・fn
and fmax and fmin are based on f0,
a ≡ (fmax - f0)/fn
b ≡ (fmin - f0)/fn
Define the parameters (a,b) as Here, fn indicates the Nyquist frequency. By changing (a, b), it is possible to obtain dot patterns with different cluster sizes according to visual characteristics.

透かし埋め込みは4値のパターンで行うものとする。このためには埋め込み2ビット(00,01,10,11)に対応した4種類のパターンが必要である。そのためフィルタD(u,v)のパターンが楕円リングパターンで、４５°づつ回転した4つのパターンとなるようにする。図４はフィルタパターンを模式的に示すもので、楕円リングの短軸の最小値をfmin、最大値をfmaxとし、長軸の最小値をβ・fmin、最大値をβ・fmaxとする。ここで、βは倍率を表す。正立したものを2ビットの（00）に対応するフィルタパターン、それを45°回転したものを（01）、90°回転したものをビット（10）、135°回転したものを（11）にそれぞれ対応するものとする。このようにして得られたの4つの透かしパターンをPi(x,y) （i=0,…,3）と表記するものとする。 Watermark embedding is performed using a quaternary pattern. For this purpose, four types of patterns corresponding to embedded 2 bits (00, 01, 10, 11) are required. Therefore, the pattern of the filter D(u,v) is an elliptical ring pattern, and four patterns are rotated by 45 degrees. FIG. 4 schematically shows a filter pattern, where the minimum value of the minor axis of the elliptical ring is fmin, the maximum value is fmax, and the minimum value of the major axis is β·fmin, and the maximum value is β·fmax. Here, β represents magnification. Filter pattern corresponding to 2-bit (00) upright, (01) rotated by 45°, bit (10) rotated by 90°, and (11) rotated by 135° shall correspond to each other. The four watermark patterns thus obtained shall be denoted as Pi(x,y) (i=0,...,3).

図５に、かかる4つのフィルタ（フィルタ0、フィルタ1、フィルタ2、フィルタ3）の形状を示す。図で黒領域は0、白領域は1を表す。図6に、このフィルタD(u,v)を用いて、前述の工程を経て作成されたパターンおよびスペクトルを示す。各パターンはR×Rのサイズで、パターン0は2ビットの埋め込み情報（00）に対応し、パターン1は（01）、パターン2は（10）、パターン3は（11）にそれぞれ対応する。 FIG. 5 shows the shapes of such four filters (Filter 0, Filter 1, Filter 2, Filter 3). In the figure, black areas represent 0 and white areas represent 1. FIG. 6 shows the pattern and spectrum created through the above steps using this filter D(u,v). Each pattern has a size of R×R, pattern 0 corresponds to 2-bit embedded information (00), pattern 1 corresponds to (01), pattern 2 corresponds to (10), and pattern 3 corresponds to (11).

続いて、画像データへの透かし埋め込み手法について説明する。透かし情報の埋め込みは、赤（R）,緑（G）、青（B）からなるカラー画像データの青（B）データに埋め込むか、あるいは、輝度、色差信号となる（Y,Cb,Cr）信号に変換し，輝度（Y）データに埋め込むことが考えられる。電子データでのみ透かし情報を取り扱う場合は、青（B）に埋め込むのがもっとも視覚的には認識されにくく理想的である。印刷画像やモニター画像をカメラやスキャナーで読み取るような応用を考えた場合は、青（B）データ（印刷時はイエロー）は精度よく色分離することが難しいため、輝度（Y）データを用いるのが良い。 Next, a method of embedding a watermark in image data will be described. Watermark information is embedded in blue (B) data of color image data consisting of red (R), green (G), and blue (B), or it becomes luminance and color difference signals (Y, Cb, Cr). It is conceivable to convert it into a signal and embed it in luminance (Y) data. When watermark information is handled only in electronic data, embedding it in blue (B) is ideal because it is the least visually recognizable. When considering applications such as reading printed images or monitor images with a camera or scanner, it is difficult to accurately separate blue (B) data (yellow when printing), so luminance (Y) data should be used. is good.

続いて、図７のフローに沿って処理手順を説明する。まず、埋め込み情報をビット列に変換し、wmとする（２０）。例えば、埋め込みたい情報が“0123456789abcdef”となるASCII16文字をビット列に展開すると、wm＝“0011000000110001…”となる128ビット列となる。先頭ビットからi番目のビットをwm(i)とする。 Next, a processing procedure will be described along the flow of FIG. First, the embedded information is converted into a bit string and set to wm (20). For example, if the information to be embedded is "0123456789abcdef" and the ASCII 16 characters are expanded into a bit string, it becomes a 128-bit string wm="0011000000110001...". Let wm(i) be the i-th bit from the first bit.

次に、画像データをR画素×R画素のブロックに分割する（２１）。これはドットパターンのサイズと同じサイズにする必要がある。ブロックサイズとしては、64画素×64画素を用いると高精度に抽出できるが、多くの情報を埋め込みたい場合は32画素×32画素をブロックサイズとすると、4倍の情報の埋め込みが可能となる。 Next, the image data is divided into blocks of R pixels×R pixels (21). It should be the same size as the dot pattern size. A block size of 64 pixels x 64 pixels enables highly accurate extraction, but if a large amount of information is to be embedded, a block size of 32 pixels x 32 pixels enables embedding of four times as much information.

埋め込み作業はブロック単位で行われる。ｎ番目のブロックに埋め込む（２３）として、まず、n=1からスタートする（２２）。埋め込み透かし情報を先頭から連続した２ビットを取り出し、これをb＝（wm(i) , wm(i+1)）とする。 The embedding work is done in block units. As embedding in the n-th block (23), first, start from n=1 (22). Consecutive 2 bits are extracted from the head of the embedded watermark information, and this is defined as b=(wm(i), wm(i+1)).

bの値により埋め込むパターンを選択する（２４）。b=(00)の場合はパターン0を用いて埋め込む（２５a）。同様に、b=(01)の場合はパターン1を用いて埋め込む（２５b）。b=(10)の場合はパターン2を用いて埋め込む（２５ｃ）。b=(11)の場合はパターン3を用いて埋め込む（２５d）。 The pattern to be embedded is selected (24) according to the value of b. If b=(00), embed using pattern 0 (25a). Similarly, when b=(01), pattern 1 is used for embedding (25b). If b=(10), pattern 2 is used for embedding (25c). If b=(11), pattern 3 is used for embedding (25d).

かかる透かし情報の埋め込みは実空間（画素空間）で行う。今、画像データをI(x,y)，埋め込み強度をgain、透かしの埋め込まれた画像データをW(x,y)とすると、
W(x,y)=I(x,y)+gain・Pi(x,y) (i=0,1,2,3) ------（１）
ただし、
b=(00) の時 Pi(x,y)はパターン0
b=(01) の時 Pi(x,y)はパターン1
b=(10) の時 Pi(x,y)はパターン2
b=(11) の時 Pi(x,y)はパターン3
となる。Piは（1,0）の二値であるが，平均輝度を保存するため（1/2,-1/2）とする。埋め込んだ結果、画素データがダイナミックレンジを超えてオーバーフローやアンダーフローが生じることがあるため、あらかじめ画像データに以下の線形変換を施した画像I’(x,y)にて行うこともある。
I’(x,y)=(1-gain)・I(x,y)+gain/2 -----（２）
この処理は後述の透かし情報を完全に除去する場合には必須であるが、除去を必要としない場合や、オーバーフローやアンダーフローの頻度が低い場合は省略しても構わない。
全ブロックが終了かを判断し（２６）、終了でない場合は次のブロックへ移る（２７）。 Such embedding of watermark information is performed in a real space (pixel space). Now, if the image data is I(x,y), the embedding strength is gain, and the watermark embedded image data is W(x,y),
W(x,y)=I(x,y)+gain・Pi(x,y) (i=0,1,2,3)-----(1)
however,
Pi(x,y) is pattern 0 when b=(00)
When b=(01) Pi(x,y) is pattern 1
When b=(10) Pi(x,y) is pattern 2
When b=(11) Pi(x,y) is pattern 3
becomes. Pi is a binary value of (1,0), but it is set to (1/2,-1/2) to preserve the average luminance. As a result of embedding, the pixel data may exceed the dynamic range and overflow or underflow may occur. Therefore, the image I'(x,y) is sometimes subjected to the following linear transformation of the image data in advance.
I'(x,y)=(1-gain)・I(x,y)+gain/2-----(2)
This process is essential for completely removing watermark information, which will be described later, but may be omitted if the removal is unnecessary or if the frequency of overflow or underflow is low.
It is determined whether all blocks are finished (26), and if not finished, the next block is moved to (27).

以上のようにして、埋め込みビット列wm(i)をすべて埋め込む。埋め込みブロック数が埋め込みビット数より多い場合は、繰り返し埋め込むことで信頼度を高めることができる。この場合は文字列の先頭を示す区切りマーカが必要である。
埋め込まれた画像は、Pi(x,y)はランダムドットであるため，ブロック境界は目立たない。グリーンノイズ特性を示すドットパターンは、印刷におけるFMスクリーンとしても用いられ、分散性ドットで均一性にすぐれているため視覚的にも一様で粒状性も感じさせない。 All embedding bit strings wm(i) are embedded as described above. If the number of embedding blocks is larger than the number of embedding bits, the reliability can be increased by repeatedly embedding. In this case, a delimiter marker indicating the beginning of the string is required.
In the embedded image, Pi(x,y) is a random dot, so block boundaries are not conspicuous. The dot pattern, which exhibits green noise characteristics, is also used as an FM screen in printing. Dispersive dots are excellent in uniformity, so they are visually uniform and do not give the impression of graininess.

続いて、透かし情報の抽出について図８の処理フローに沿って説明する。抽出作業は、埋め込み時と同様R×Rのブロックに分割し（３０）、ブロック単位で行われる。n番目のブロックから抽出するものとして、n=1からスタートする（３１）。n番目のブロックからR×Rの画像を取り出しFFT（高速フーリエ変換）を行い、スペクトル情報Wnに変換する（３２）。続いて、ヒストグラムイコライゼーションによる規格化を行う（３３）。これは、ブロック毎に輝度が異なるため、コントラストを向上させて識別精度を向上させることと、後述の信頼度を各ブロックで平等に評価するためである。続いて後述のパターンの識別を行い（４０）、識別パターンから埋め込み情報の２ビットデータを抽出する（３４）。ブロックがすべて終わったか否かを判断し（３５）、終了していない場合はnをn+1として（３６）、次のブロックへ移る。終了した場合は全ビットを取り出して、元の文字情報を復元する（３７）。 Next, extraction of watermark information will be described along the processing flow of FIG. The extraction work is performed in units of blocks by dividing into R×R blocks (30) as in the case of embedding. Starting from n=1 (31), we extract from the nth block. An R×R image is extracted from the n-th block, FFT (Fast Fourier Transform) is performed, and the image is converted into spectral information Wn (32). Subsequently, normalization by histogram equalization is performed (33). This is because the luminance differs for each block, so that the contrast is improved to improve the identification accuracy, and the degree of reliability, which will be described later, is equally evaluated for each block. Subsequently, the pattern is identified as described later (40), and 2-bit data of embedded information is extracted from the identification pattern (34). It is judged whether or not all the blocks have ended (35), and if not, n is set to n+1 (36), and the next block is started. When finished, all bits are taken out to restore the original character information (37).

図９はパターン識別処理（４０）の処理フローを示す。ブロック内のスペクトル情報Wnは、用意されたマスクパターンとのマッチングを行い、その出力Qiを求める（４１）。マスクパターンは図１０に示すようにフィルタパターンと同じで、45°毎に回転したマスク0（M0）、マスク1（M1）、マスク2（M2）、マスク3（M3）の4つが用意されている。
図９に戻って、i番目のマスクをMiとして、Wnとのマッチングの出力Qiは、
Qi=(1/Zi)ΣMi・Wn (i=0,1,2,3) --------（３）
ただし Zi=ΣMi
として求められる（４２）。ただし、Σはブロック内の各画素（今の場合は空間周波数に対応）について行われる。その結果、各マスクに対応した出力（Q0,Q1,Q2,Q3）が得られる。この4つの出力のうち最も大きいものをQmaxとして、
Qmax＝Q0 の時、b=00
Qmax＝Q1 の時、b=01
Qmax＝Q2 の時、b=10
Qmax＝Q3 の時、b=11
となる2ビットの透かし情報ｂを得る（４３および４４a、４４b、４４c、４４d）。 FIG. 9 shows the processing flow of pattern identification processing (40). Spectral information Wn in the block is matched with a prepared mask pattern to obtain its output Qi (41). As shown in Fig. 10, the mask pattern is the same as the filter pattern, and four masks, mask 0 (M0), mask 1 (M1), mask 2 (M2), and mask 3 (M3), rotated by 45° are prepared. there is
Returning to FIG. 9, with Mi as the i-th mask, the output Qi of matching with Wn is
Qi=(1/Zi)ΣMi・Wn (i=0,1,2,3) --------(3)
However, Zi = ΣMi
(42). However, Σ is performed for each pixel in the block (corresponding to spatial frequency in this case). As a result, outputs (Q0, Q1, Q2, Q3) corresponding to each mask are obtained. Letting the largest of these four outputs be Qmax,
When Qmax = Q0, b = 00
When Qmax = Q1, b = 01
When Qmax = Q2, b = 10
When Qmax = Q3, b = 11
2-bit watermark information b is obtained (43 and 44a, 44b, 44c, 44d).

次に、Qmaxの次に大きな出力をQnextとすると、信頼度relは
rel=Qmax-Qnext ------------（４）
で表される。すなわちQnextがQmaxに非常に近い値の場合は誤判定の確率が高く、QmaxとQnextとの差が大きい程信頼性は高い。この差を信頼度の指標とすることができる。
以上のようにして、2ビットの埋め込み情報ｂと信頼度relが求まる。 Next, if the next largest output after Qmax is Qnext, the reliability rel is
rel=Qmax-Qnext ------------(4)
is represented by That is, when Qnext is a value very close to Qmax, the probability of misjudgment is high, and the greater the difference between Qmax and Qnext, the higher the reliability. This difference can be used as an indicator of reliability.
As described above, the 2-bit embedded information b and the reliability rel are obtained.

透かしの除去は、埋め込みを減算することで可能となる。すなわち、式（１)より、
I(x,y)=W(x,y)-gain・Pi(x,y) (i=0,1,2,3) -------------(５)
となり、透かし埋め込み画像から埋め込みパターンにgainを乗じて引くことで、完全に原画に戻すことが可能である。ただし、埋め込み時にオーバーフローやアンダーフローを避けるために線形ダイナミックレンジ補正を行った場合は、逆補正をする必要がある。
以上のようにして透かし除去が可能であるが、埋め込みパターン（パターン0、パターン1、パターン2、パターン3）およびgainが必要となる。
Watermark removal is possible by subtracting the embedding. That is, from formula (1),
I(x,y)=W(x,y)-gain・Pi(x,y) (i=0,1,2,3) -------------(5)
By subtracting the embedding pattern from the watermark-embedded image by multiplying the gain, it is possible to completely restore the original image. However, if linear dynamic range correction is performed to avoid overflow or underflow during embedding, inverse correction must be performed.
Although watermark removal is possible as described above, an embedding pattern (pattern 0, pattern 1, pattern 2, pattern 3) and gain are required.

かかる秘密鍵の容量は、例えばブロックサイズが64× 64の場合、パターン0～パターン3までの4パターン分で16Kビット（ 2Kバイト）、それにgainと埋め込み情報の容量が加わる。32× 32の場合はパターン0～パターン3までの4パターン分で4Kビット（ 512バイト）となる。gainは、通常、100バイト以下であるので、鍵容量の大半は埋め込みパターン情報が占める。
For example, if the block size is 64×64, the capacity of the private key is 16 Kbits (2 Kbytes) for 4 patterns from pattern 0 to pattern 3, plus the capacity of gain and embedded information. In the case of 32x32, four patterns from pattern 0 to pattern 3 are 4K bits (512 bytes). Since gain is usually 100 bytes or less, the embedded pattern information occupies most of the key capacity.

鍵の容量はなるべく小さくした方が取り扱いが容易である。鍵容量を減らすため、パターン2、パターン3は、それぞれ、パターン0、パターン1から生成することができる。すなわち、パターン2はパターン0を90°回転したもの、パターン3はパターン1を90°回転したものを採用する。ドットパターンの回転はそのままスペクトルの回転となるため、他の処理の変更はない。 The smaller the capacity of the key, the easier it is to handle. To reduce the key size, pattern 2 and pattern 3 can be generated from pattern 0 and pattern 1, respectively. That is, pattern 2 is obtained by rotating pattern 0 by 90°, and pattern 3 is obtained by rotating pattern 1 by 90°. Since the rotation of the dot pattern is the same as the rotation of the spectrum, there is no change in other processing.

鍵の紛失や盗難に対しては、画像ごとに異なるドットパターンを用いることにより、同じ鍵で別の画像の透かしを除去することはできなくすることができる。これらは、グリーンノイズパターン生成時の、初期乱数のseed値を変えることによって得られる。スペクトル分布は同一であり，抽出ソフトウェアは共通であるが，ドットプロファイルは異なる。R=64の場合，1つのドットパターンにつき異なるパターン数は，4096 C 2048 個あり、複数のドットパターンを用いる場合は、その積となるため，同じパターンとなる確率は極めて低いため，安全性は高い。ここで、Cはcombinationを表す。著作権者は購入者に対してそれぞれ異なる乱数から求めたドットパターンを秘密鍵として提供する。購入者ごとにこの鍵は異なるため、他の購入者の画像から透かしを除去することはできない。 For loss or theft of a key, using a different dot pattern for each image makes it impossible to remove the watermark of another image with the same key. These are obtained by changing the seed value of the initial random number when generating the green noise pattern. The spectral distribution is the same, the extraction software is common, but the dot profile is different. In the case of R = 64, there are 4096 C 2048 different patterns per dot pattern, and when using multiple dot patterns, the product is the product, so the probability of having the same pattern is extremely low. high. Here, C represents a combination. The copyright holder provides each purchaser with a dot pattern obtained from different random numbers as a secret key. Since this key is different for each purchaser, it is not possible to remove the watermark from another purchaser's image.

次に位相シフトパターンの方法について説明する。
Pi(x,y)はグリーンノイズパターンであるため、境界がない。このため位相シフトしたパターンも目立たない特徴がある。そこでこのパターンの位相をシフトしたパターンPi’(x,y)を生成する。
今、R×Rの基本パターンPi(x,y)の位相をx方向にx0, y方向にy0だけシフトしたものをP’i(x,y)とする。ただし、シフト量は、0≦x0,y0＜R であるとする。P’i(x,y)は以下の式で表される。
P'i(x,y)=Pi((x+x0) mod R, (y+y0) mod R) ----------(６)
ここで、mod R はRを法とした剰余を表す。つまり、x+x0 ,y+y0 の値がRを超えた場合は先頭に戻り、位相シフトしたパターンが得られる。 Next, the phase shift pattern method will be described.
Since Pi(x,y) is a green noise pattern, it has no boundaries. For this reason, the phase-shifted pattern is also inconspicuous. Therefore, a pattern Pi'(x,y) is generated by shifting the phase of this pattern.
Let P'i(x,y) be obtained by shifting the phase of the R×R basic pattern Pi(x,y) by x0 in the x direction and by y0 in the y direction. However, the shift amount is assumed to be 0≤x0,y0<R. P'i(x,y) is represented by the following formula.
P'i(x,y)=Pi((x+x0) mod R, (y+y0) mod R) ----------(6)
where mod R represents the remainder modulo R. In other words, when the values of x+x0 and y+y0 exceed R, it returns to the beginning and a phase-shifted pattern is obtained.

位相シフトされた埋め込みパターンP'i(x,y)を用いて、埋め込みは、位相シフトのない場合と同様に以下のように行われる。
W(x,y)=I(x,y)+gain・P'i(x,y) -------(７)
ただし、P'i(x,y)は式(6)で示されたものである With the phase-shifted embedding pattern P'i(x,y), the embedding is done as follows, as in the case without phase shift.
W(x,y)=I(x,y)+gain・P'i(x,y) -------(7)
where P'i(x,y) is given by equation (6)

図11は、位相シフトされた埋め込みパターンP'i(x,y)を示したもので、(x0.y0)=(0,0）のパターンをベースにx,y方向にR/4,R/2位相シフトした時のパターンである。R=32の時、シフト量ｓは、すべての組み合わせで１６パターンあるが、そのうちの（0,0）、（8,8）、（16,16）、（24,24）、（0,24）、（0,16）、（16,8）、（24,0）の８つのパターンを示す。実際の運用時は、シフト量が知られないように、乱数発生器でシフト量ｓを決める。図12は乱数で0～（W-1）までの整数値を発生させたもので、シフト量ｓは、1024パターンあるが、そのうちの（22,25）、（23,17）、（6,17）、（28,13）、（30,8）、（9,14）、（19,14）、（30,0）のs0からｓ7までの８パターンのみを示す。 Fig. 11 shows the phase-shifted embedding pattern P'i(x,y). Based on the pattern of (x0.y0)=(0,0), R/4,R This is the pattern when phase-shifted by /2. When R = 32, the shift amount s has 16 patterns in all combinations, of which (0, 0), (8, 8), (16, 16), (24, 24), (0, 24 ), (0,16), (16,8), and (24,0). During actual operation, the shift amount s is determined by a random number generator so that the shift amount is unknown. Fig. 12 shows random integer values from 0 to (W-1) generated. Shift amount s has 1024 patterns, of which (22,25), (23,17), (6, 17), (28,13), (30,8), (9,14), (19,14), (30,0) 8 patterns from s0 to s7 are shown.

これらのパターンはすべて同じスペクトル特性を示す。図13に位相シフトある場合とない場合のパターンとそのスペクトル分布を示す。一例として、シフト値(x0,y0)=(R/2,R/2)として、R=64の場合(32,32),R=32の場合（16,16）について示す。シフトしてもパターンの境界は分からないし、そのスペクトル形状は変化がない。透かし抽出は位相のシフト量に依存しないため、抽出ソフトは変更や修正は不要である。 All these patterns exhibit the same spectral characteristics. FIG. 13 shows patterns with and without phase shift and their spectral distributions. As an example, shift values (x0, y0)=(R/2, R/2) are shown for (32, 32) for R=64 and (16, 16) for R=32. The shift does not reveal the boundary of the pattern, and its spectral shape does not change. Since watermark extraction does not depend on the amount of phase shift, the extraction software does not need to be changed or modified.

図14に画像に埋め込んだ時の図を示す。画像1は256×256のLENNAの画像に32×32のブロック毎にgain=0.1875で埋め込んだもので、4値のグリーンノイズパターンに対して乱数の位相シフトパターンを用いて埋め込んだもので有る。全部で64個のブロックがあり、16文字（ASCII）が埋め込まれている。同図(b)は埋め込み画像で、(c)はスペクトル画像で埋め込んだ１６文字が正しく抽出出来ている。画像２は輝度値が192の一様画像に埋め込んだもので、埋め込みパターンが識別できるようにgain=0.5の大きな値で意図的に埋め込み、パターンを強調している。これらの図から分かるように、パターンの境界が目立たない。これはグリーンノイズパターンがランダムパターンで境界が無いからである。このため、x0,y0の値をどのように変えても、その境界が目立たない。 Fig. 14 shows a diagram when embedded in an image. Image 1 is a 256×256 LENNA image embedded with gain=0.1875 for each 32×32 block, and a 4-level green noise pattern is embedded using a random number phase shift pattern. There are 64 blocks in total, padded with 16 characters (ASCII). In the same figure, (b) is an embedded image, and (c) is a spectral image in which the 16 embedded characters can be correctly extracted. Image 2 is embedded in a uniform image with a luminance value of 192, and is intentionally embedded with a large value of gain=0.5 to emphasize the pattern so that the embedded pattern can be identified. As can be seen from these figures, the pattern boundaries are inconspicuous. This is because the green noise pattern is a random pattern and has no boundaries. Therefore, no matter how the values of x0 and y0 are changed, the boundary is not conspicuous.

位相変化量はx、y両方向に1画素刻みでR個可能なため全部でR^2 個か可能である。ブロックサイズが32×32の場合、1024個のパターンが可能である。64×64の場合は
4096個のパターンが可能である。しかし、細かなシフト量は攻撃に弱いため、状態数を減らして大きなシフト量で構成することが好ましい。 Since R phase shifts are possible in both the x and y directions in increments of one pixel, a total of R^2 phase shifts is possible. If the block size is 32x32, 1024 patterns are possible. For 64x64
4096 patterns are possible. However, since fine shift amounts are vulnerable to attacks, it is preferable to reduce the number of states and configure a large shift amount.

このP'i(x,y)の位相シフト量（x0,y0）はブロック毎に変え、位相シフトリストL(n)（nはブロック番号）として記憶され鍵に格納される。埋め込まれた透かしの除去は、埋め込みパターンP'i(x,y)、埋め込み強度gain、および位相シフトリストL(n)から、
I(x,y)=W(x,y)-gain・P'i(x,y) --------(８)
として、除去が可能である。完全に原画に戻すためには整数型の演算で丸め誤差を考慮した演算が必要である。埋め込みパターンP'i(x,y)は、式(6)に示すように基本パターンPi(x,y)と位相シフト量から求まるため、鍵には基本パターンPi(x,y)、埋め込み強度gain、および位相シフトリストL(n)を格納する。 The phase shift amount (x0, y0) of this P'i(x, y) is changed for each block and stored as a phase shift list L(n) (where n is the block number) and stored in the key. Removal of the embedded watermark is from the embedding pattern P'i(x,y), the embedding strength gain, and the phase shift list L(n),
I(x,y)=W(x,y)-gain・P'i(x,y) --------(8)
can be removed as In order to completely return to the original image, it is necessary to perform an integer-type operation in consideration of rounding errors. Since the embedding pattern P'i(x,y) is obtained from the basic pattern Pi(x,y) and the phase shift amount as shown in Equation (6), the keys are the basic pattern Pi(x,y) and the embedding strength store gain, and phase shift list L(n).

次に、位相シフトの効果について説明する。第三者からの結託攻撃では、ブロック毎の画像を合成することで埋め込みパターンを推定することが考えられる。すなわち、埋め込みの情報に応じたN個の埋め込みパターンの集合から、Pi(x,y)に対応するブロック別に埋め込み画像パターンCi(x,y)を収集する。続いて、埋め込みパターンCi(x,y)を加算し平均値を求めると、位相シフトを行わない場合は、
Ci(x,y)=(１/N)ΣWi(x,y)
= (１/N)ΣIi(x,y)+gain・(１/N)ΣPi(x,y) ----(8)
となる。ここで、NはPi(x,y)に対応するブロックの数、ΣWi(x,y)は透かしの埋め込まれた画像データの加算、ΣIi(x,y)はブロック分割された画像データの加算を示す。ここでNを無限にすると、
(１/N)ΣIi(x,y)→const（画像のDC成分） ------(9)
(１/N)ΣPi(x,y) →Pi(x,y) -----(10)
となり、したがって、Ci(x,y)は、
Ci(x,y)=gain・Pi(x,y)＋const -----(11)
となり、Pi(x,y)が抽出されてしまう。
一方、位相シフトを行った埋め込み画像においては,P'i(x,y)は位相がランダムに変わるため、
(１/N)ΣP'i(x,y) →const -------(12)
となり、埋め込みパターンPi(x,y)は抽出できない。 Next, the effect of phase shift will be described. In a collusion attack from a third party, it is conceivable to estimate an embedding pattern by synthesizing images for each block. That is, from a set of N embedding patterns corresponding to embedding information, embedding image patterns Ci(x,y) are collected for each block corresponding to Pi(x,y). Next, when the embedding pattern Ci(x,y) is added and the average value is calculated, if the phase shift is not performed,
Ci(x,y)=(1/N)ΣWi(x,y)
= (1/N)ΣIi(x,y)+gain・(1/N)ΣPi(x,y) ----(8)
becomes. where N is the number of blocks corresponding to Pi(x,y), ΣWi(x,y) is the addition of watermarked image data, and ΣIi(x,y) is the addition of block-divided image data. indicates If N is infinite here,
(1/N)ΣIi(x,y)→const (DC component of image) ------(9)
(1/N)ΣPi(x,y) →Pi(x,y) -----(10)
and therefore Ci(x,y) is
Ci(x,y)=gain・Pi(x,y)＋const-----(11)
As a result, Pi(x,y) is extracted.
On the other hand, in the phase-shifted embedded image, the phase of P'i(x,y) changes randomly, so
(1/N)ΣP'i(x,y) → const -------(12)
As a result, the embedded pattern Pi(x,y) cannot be extracted.

式(12)が成立するためにはNが大きくなければならない。今、画像サイズがW×HであるとしてR×Rのブロック数は[W/R]×[H/R]となる。ここで[・]は少数以下の切り捨てを表す。これを埋め込みパターンPi(x,y)で埋め込むものとすると、一つの埋め込みパターンPi(x,y)の個数は、最大[W/R]×[H/R] となる。例えば256×256画素の画像を32×32サイズのブロックで埋め込むとするとPi(x,y)は、最大の出現回数は６４となる。Pi(x,y)は4値の場合、4つのパターンがあるので、平均出現回数は16となる。したがって、位相シフトのパターンP'i(x,y)が１つのパターンにつき64個以上あれば、画像の中に同じパターンが出現する確率は極めて低い。 N must be large for Equation (12) to hold. Now, assuming that the image size is W×H, the number of R×R blocks is [W/R]×[H/R]. Here, [・] represents rounding down to the nearest decimal place. Assuming that this is embedded with an embedding pattern Pi(x,y), the maximum number of one embedding pattern Pi(x,y) is [W/R]×[H/R]. For example, if an image of 256×256 pixels is embedded in blocks of size 32×32, Pi(x,y) can appear up to 64 times at maximum. If Pi(x,y) has 4 values, there are 4 patterns, so the average number of appearances is 16. Therefore, if there are 64 or more phase shift patterns P'i(x,y) per pattern, the probability of the same pattern appearing in the image is extremely low.

様々な画像サイズに対応するため、一つの埋め込みパターンに対して、256個の位相シフトパターンを用意する。すなわち、x0,y0をそれぞれ W/16, 2W/16, 3W/16, …, 15W/16の位相シフト量を与えた256パターンを生成し、０から255までの番号をつけ、位相シフトリストに埋め込み順のリストを作る。512×512サイズの画像では32×32のブロック分割では256ブロックあり、位相シフトリストの容量は256バイトでよい。さらに大きな画像に対してはこの位相シフトリストのデータを繰り返して用いる。 To accommodate various image sizes, 256 phase shift patterns are prepared for one embedding pattern. That is, 256 patterns are generated by giving x0 and y0 a phase shift amount of W/16, 2W/16, 3W/16, …, 15W/16 respectively, numbered from 0 to 255, and put in the phase shift list Make a list in embedding order. An image of 512×512 size has 256 blocks when divided into 32×32 blocks, and the capacity of the phase shift list can be 256 bytes. Data in this phase shift list are used repeatedly for larger images.

鍵のサイズは、32×32のブロックサイズでは、パターン０および１の容量がそれぞれ1024ビット（128バイト）、位相シフトリストが256バイト、それにgain が数バイトの合計520バイト程度である。埋め込み文字情報は抽出時に得ることが出来るので鍵に含ませる必要はない。位相シフトリストLは、パターン数を減らすとさらに小さくできる。さらには、固定された乱数列を用いる場合は位相リストは不要とすることもできる。この場合は固定乱数の先頭位置のみの指定でよいため数バイトでよい。 With a block size of 32×32, the key size is 1024 bits (128 bytes) for patterns 0 and 1, 256 bytes for the phase shift list, and several bytes for the gain, for a total of about 520 bytes. Embedded character information can be obtained at the time of extraction, so it does not need to be included in the key. The phase shift list L can be made even smaller by reducing the number of patterns. Furthermore, if a fixed random number sequence is used, the phase list may not be required. In this case, it is sufficient to specify only the head position of the fixed random number, so a few bytes will suffice.

埋め込み情報量Ｖは、画像サイズをＷ×Ｈとすると、４値であるので、Ｖ＝2×[W/R]×[H/R] ビットであたえられる。例えば、540×530の画像でR=64のブロックで埋め込むとすると、Ｖ＝128 ビットとなり、ASCCI 文字コードで16文字が埋め込み可能である。32×32のブロックで埋め込むとすると、64文字が埋め込み可能である。著作権保護の用途で使用する場合、少なくとも16文字以上が必要となるため、64×64のブロックサイズで埋め込む場合は512×512の画像サイズ以上、32×32のブロックの場合は256×256の画像サイズ以上が必要となる。 Assuming that the image size is W.times.H, the embedded information amount V is four values, so V=2.times.[W/R].times.[H/R] bits. For example, if a 540×530 image is embedded in R=64 blocks, V=128 bits and 16 characters can be embedded in ASCII character code. 64 characters can be embedded if it is embedded in a 32×32 block. When used for copyright protection purposes, at least 16 characters or more are required. Therefore, when embedding in a block size of 64 x 64, the image size is 512 x 512 or more, and when embedding in a block of 32 x 32, it is 256 x 256. Image size or larger is required.

以上のように、本手法は、埋め込みパターンに位相シフトを与えることにより、第三者からの結託攻撃に強靭な電子透かしが得られたものである。この手法は、特許文献5に紹介された反転パターンと組み合わせて用いることによりさらに強靭なものとなる。
また、元々グリーンノイズ拡散法は、各種編集や加工に対する耐性が高いため、本手法による結託攻撃に対する耐性も加わり、あらゆる攻撃に対して強靭であるといえる。 As described above, in this method, by giving a phase shift to the embedding pattern, a digital watermark that is strong against collusion attacks from third parties is obtained. This technique becomes even more robust when used in combination with the reversal pattern introduced in Patent Document 5.
In addition, since the green noise diffusion method is inherently highly resistant to various types of editing and processing, it can be said that this method is strong against all kinds of attacks because it is also resistant to collusion attacks by this method.

以上、本発明の電子透かし法について説明したが、本手法は視認されにくいグリーンノイズ特性を示すドットパターンを埋め込むため，視覚的な画質劣化が少なく，画像の編集や加工などの攻撃に対して強い耐性がある。その理由は、本手法はスペクトル拡散型透かし手法であるからである。通常のスペクトル拡散ではホワイトノイズを重畳するが、本発明ではグリーンノイズを重畳するため、画像のスペクトルとの分離が容易であるからである。 The digital watermarking method of the present invention has been described above, but since this method embeds a dot pattern that exhibits green noise characteristics that are difficult to see, there is little visual deterioration in image quality and it is resistant to attacks such as image editing and processing. Tolerant. The reason is that this method is a spread spectrum watermarking method. While white noise is superimposed in normal spectrum spreading, green noise is superimposed in the present invention, so that it can be easily separated from the spectrum of the image.

そのうえで、位相シフトパターンを用いることにより、埋め込みドットが空間的に拡散され、パターンの解読が極めて困難で、結託攻撃に対しても強い耐性を示す。このため、あらゆる攻撃に対して耐性が強く安全性が高い。このような特徴から、著作権保護の利用形態では、強靭な透かし埋め込みが可能で、様々な環境において、違法使用やコピー、改ざんなどの検出が可能である。 In addition, by using the phase shift pattern, the embedded dots are spatially diffused, making it extremely difficult to decipher the pattern and exhibiting strong resistance to collusion attacks. Therefore, it is highly resistant to all kinds of attacks and has high security. Due to such features, in the form of use for copyright protection, robust watermark embedding is possible, and illegal use, copying, falsification, etc. can be detected in various environments.

透かしの埋め込みは，ブロック単位で実空間にて埋め込むため、大きなサイズの画像など画像サイズに制限がなく、ディザ法並みに高速に処理可能である。このため動画像のリアルタイム埋め込みも可能で、ドライブレコーダや監視カメラなどの利用が可能である。 Since the watermark is embedded in the real space in units of blocks, there is no restriction on the size of the image such as a large size image, and processing can be performed at a high speed equivalent to that of the dither method. For this reason, real-time embedding of moving images is also possible, enabling the use of drive recorders and surveillance cameras.

また本手法は完全可逆型であるので改ざん検知としても利用が可能である。すなわち、画像データのハッシュ値を，署名情報として透かしに埋め込む。改ざん検知を行う場合は、鍵により透かしを除去し完全に原画に戻した後、画像のハッシュ値を求め、両者が一致した時改ざんがなかったことを証明する。
この改ざん検知手法は，電子透かし技術が完全に可逆な手法に限る。埋め込んだ結果，画像データが変わるとHash値が変化するからである。結託攻撃による鍵の解読は、このようなシステムを根底から覆すもので有り、それを防ぐためにも本手法が有効である。 In addition, since this method is completely reversible, it can also be used for tampering detection. That is, the hash value of the image data is embedded in the watermark as signature information. When tampering detection is performed, after removing the watermark with the key and completely restoring the original image, the hash value of the image is obtained, and if both match, it is proved that there has been no tampering.
This tampering detection method is limited to a completely reversible electronic watermark technology. This is because the Hash value changes when the image data changes as a result of embedding. Decryption of the key by a collusion attack overturns such a system from the ground up, and this method is also effective in preventing it.

1はスキャナー、2はプリンタ、3はコンピュータシステム、4はROM, 5はRAM,6はプログラムメモリ、７はデータメモリ、８はモニター、9はキーボード、10は通信機能、11はCPU、１２はインターネット、１３は画像データの配信、１４は購入希望の連絡、１５は秘密鍵の送付、１６は著作権者のコンピュータ、１７は購入者のコンピュータを表す。 1 is scanner, 2 is printer, 3 is computer system, 4 is ROM, 5 is RAM, 6 is program memory, 7 is data memory, 8 is monitor, 9 is keyboard, 10 is communication function, 11 is CPU, 12 is Internet, 13 image data distribution, 14 purchase request notification, 15 secret key delivery, 16 copyright holder's computer, and 17 purchaser's computer.

Claims

In a digital watermarking method in which image data I(x,y) is divided into R×R blocks and different patterns corresponding to watermark information bits are embedded in each block,
The embedding pattern uses multiple different basic patterns Pi(x,y) that exhibit green noise characteristics, and different phase shift amounts (x0,y0) (where 0≤x0,y0<R) for each block. is shifted by x0 in the y direction and y0 in the y direction,
P'i(x,y) = Pi((x+x0) mod R, (y+y0) mod R)
(where P'i(x,y) is the pattern after shift, mod R is the remainder modulo R)
is obtained by the following arithmetic expression,
Watermark embedding uses such a pattern P'i(x,y) as follows:
W(x,y)=I(x,y)+ gain・P'i(x,y) (where gain is embedding strength)
and
The phase shift amount (x0, y0) is generated by a random number generator and stored in a phase shift list L(n) (n is a block number),
Store the basic pattern Pi(x,y), the embedding strength (gain) and the phase shift list L(n) as keys,
Extraction of watermark information involves dividing the watermarked image data into blocks and extracting the embedded bit information from the shape of the Fourier spectrum pattern of the block. The same extraction software can be used regardless of the amount of phase shift. Yes ,
To remove the watermark, the watermarked image W(x,y) is processed using the key
I(x,y)=W(x,y)-gain・P'i(x,y)
A reversible electronic watermarking method that can restore the original image by removing it by the following calculation and embed new information .