JP4131049B2

JP4131049B2 - Signal processing apparatus and signal processing method

Info

Publication number: JP4131049B2
Application number: JP28562398A
Authority: JP
Inventors: 哲二郎近藤; 秀雄中屋; 俊彦浜松; 寿一白木
Original assignee: Sony Corp
Current assignee: Sony Corp
Priority date: 1998-10-07
Filing date: 1998-10-07
Publication date: 2008-08-13
Anticipated expiration: 2018-10-07
Also published as: JP2000115721A

Description

【０００１】
【発明の属する技術分野】
本発明は、予測係数生成方法及び信号処理装置に関し、特に、クラス分類してクラス毎に補間処理等を行う場合に用いられる予測係数を生成するための予測係数生成方法及び生成された予測係数を用いて処理を行う信号処理装置に関するものである。
【０００２】
【従来の技術】
近年において、映像信号や音声信号の解像度あるいはサンプリング周波数等を向上させる技術が種々提案されている。例えば、標準あるいは低解像度に相当する標準テレビジョン信号から、高解像度のいわゆるＨＤＴＶ信号にアップコンバージョンを行う場合や、サブサンプル補間を行う場合には、従来の線形補間による手法よりも、例えば特開平６−１７８２７７号公報や特開昭７−７９４１８号公報に開示されるようなクラス分類を伴う変換処理を行う方が、性能的に良好な結果を得られることが知られている。
【０００３】
このクラス分類を伴う変換処理とは、例えば、標準あるいは低解像度に相当する標準テレビジョン信号（ＳＤ信号）を高解像度の信号（ＨＤ信号）に変換する場合に、入力されたＳＤ信号をクラス分類してクラスコードを発生し、このクラスコードを用いてクラス毎に予め設定されている予測係数を選んで上記入力ＳＤ信号を予測演算処理することにより、ＨＤ信号を得るものである。上記予測演算処理には例えば線形１次式が用いられる。
【０００４】
上記クラス分類を伴う変換処理は、上述したようなＳＤ信号からＨＤ信号へのアップコンバージョン以外にも、映像信号の走査方式についてインターレース／プログレッシブ変換を行う場合や、走査線数を変換する場合や、ぼけた画像を鮮明な画像に変換する場合や、電子ズーム処理や、フレーム数変換処理（例えば１秒間に２４フレームと３０フレームとの間での変換）や、フォーマット変換処理等の種々の信号変換処理に適用して好ましい結果が得られている。
【０００５】
【発明が解決しようとする課題】
ところで、上述したようなクラス分類を伴う変換処理により、例えば原信号の解像度を向上する場合に、ぼけの小さい部分の改善度に比べてぼけの大きい部分での改善度が低いということがある。このぼけの大きい部分の改善度を上げるには、クラス分類の際に予め画素を間引いておいたり、予測演算の際の予測タップの間隔を間引いて広くとることが考えられる。
【０００６】
しかしながら、上記間引きを施すと、ぼけの小さい部分についての解像度向上が十分に行えなくなるという欠点がある。
【０００７】
本発明は、上述したような実情に鑑みてなされたものであり、ぼけの小さい部分と大きい部分とを有する映像信号等のような入力信号に対して、それぞれの部分に最適な解像度向上等の変換処理を施すことができるような信号処理装置及び信号処理方法の提供を目的とする。
【０００８】
【課題を解決するための手段】
本発明は、上述した課題を解決するために、入力信号の処理対象となる注目位置のデータ及びその周辺の複数データをクラスタップとし、該クラスタップからクラス分類情報を求めて上記入力信号をクラス分類するクラス分類手段と、上記クラス分類手段により分類されたクラスに応じてクラス毎に予め設定された予測係数を出力する予測係数出力手段と、上記入力信号の処理対象となる上記注目位置のデータ及びその周辺の複数データを予測タップとし、該予測タップを上記予測係数出力手段からの予測係数を用いて予測演算して目的とする出力信号のデータの予測値を出力する予測演算手段と、上記入力信号に応じて上記クラス分類手段への入力データ及び上記予測演算手段への入力データの各間引き間隔を制御する間引き制御手段とを有し、上記間引き制御手段は、予め設定された複数の間引き間隔毎に、間引かれたデータを補間したときの値と真値との誤差の絶対値和をアクティビティとして求める演算手段と、この演算手段により求められたアクティビティを上記間引き間隔が小さいものから順に所定の閾値と比較して該所定の閾値を初めて超えたときの間引き間隔を最適間引き率に決定する間引き率決定手段とを有する。
【０００９】
上記間引き制御は、予め設定された複数の間引き間隔についての変化の度合いを表すアクティビティを求め、求められたアクティビティに基づき上記間引き間隔の最適のものを決定することが好ましい。
【００１０】
また上記間引き制御は、上記間引き間隔が小さいものから順に上記アクティビティを求め、各間引き間隔毎のアクティビティが所定の閾値を初めて超えたときの間引き間隔を最適間引き率に決定することが好ましい。
【００１１】
具体的には、上記クラス分類のためにクラスタップを生成し、上記予測演算のために予測タップを生成しており、これらのクラスタップ及び予測タップについての間引き率を制御する。
【００１２】
入力信号の処理対象となる注目位置におけるクラスタップ及び予測タップの間隔を最適に間引いて、原入力信号のぼけの小さい部分では短いタップ間隔で、ぼけの大きい所では長いタップ間隔でそれぞれ処理を行う。
【００１３】
【発明の実施の形態】
以下、本発明に係る実施の形態について図面を参照しながら説明する。
先ず、本発明に係る信号処理装置として、上述したクラス分類を伴う変換処理を行う信号変換装置の具体例を、図１を参照しながら説明する。この具体例では、特に、画像のＳＤ信号をＨＤ信号に解像度変換する画像情報変換装置を想定している。
【００１４】
ところで、上述のクラス分類を伴う変換処理におけるクラス分類のためには、現在の処理対象となる注目位置のデータ及びその周辺の複数データが使用され、これらのデータは、フィルタのタップに対応することから、クラスタップと称される。また、上記予測演算処理についても、注目位置のデータ及びその周辺の複数データが使用され、これらのデータは予測フィルタのタップであることから、予測タップと称される。
【００１５】
図１において、入力端子１１には標準解像度のＳＤ信号が供給され、間引き制御部２０を介して、予測タップ生成回路１２及びクラスタップ生成回路１５に送られている。予測タップ生成回路１２から出力される予測タップは、予測演算回路１３に送られて予測演算される。また、クラスタップ生成回路１５からのクラスタップは、クラス分類回路１６に供給されてクラス分類が行われ、決定されたクラスのインデックスをアドレスとして予測係数ＲＯＭ１７がアクセスされて予測係数が読み出され、予測演算回路１３に送られる。
【００１６】
予測演算回路１３は、推定演算回路あるいは補間値生成回路とも称され、クラス毎に適応的に選択されて予測係数ＲＯＭ１７から供給された予測係数を用いて、予測タップ生成回路１２からの予測タップに基づいて予測演算を行うことで、入力されたＳＤデータに対するＨＤデータを算出するものである。予測演算回路１３にて予測演算されることにより得られたＨＤ信号は、出力端子１４より取り出される。
【００１７】
クラス分類回路１６は、クラスタップ生成回路１５からのクラスタップを、例えばＡＤＲＣ（適応型ダイナミックレンジ符号化）することによりクラス情報を出力する。このクラス分類回路１６における適応型ダイナミックレンジ符号化とは、特開昭６１−１４４９８９号公報等に開示されるように、本来ビデオ信号の高能率圧縮符号化のために開発されたものであるが、ビデオ信号のレベル分布のパターンを短い語長で効率的に表現できることから、このＡＤＲＣ符号化データに基づいてクラス分類のための情報を得ているものである。すなわち、ビデオ信号の符号化単位となるブロック内のダイナミックレンジＤＲは、当該ブロック内の最大値ＭＡＸ及び最小値ＭＩＮにより、
ＤＲ＝ＭＡＸ−ＭＩＮ＋１
と表せ、ブロック内の各画素データのレベルＬをｎビットに再量子化するときの再量子化コードＱは、
Ｑ＝［（Ｌ−ＭＩＮ＋0.５）・２ⁿ／ＤＲ］
ただし、［］は切り捨て処理を意味する
となる。上記ブロック内の再量子化コードＱを用いて、当該ブロックのビデオ信号をクラス分類するための情報を求める。
【００１８】
このクラス情報は、クラスコードあるいはクラスのインデックスとして予測係数ＲＯＭ１７に送られ、クラスに応じた予測係数が読み出され、予測演算回路１３に送られる。
【００１９】
次に、間引き制御部２０について説明する。
間引き制御部２０は、入力信号の変化の程度、例えば変化がなだらかか否か、あるいは画像信号の場合のぼけ具合等に応じて、上記クラスタップや予測タップの間引き率を変化させるような適応的な制御を行うものである。具体的に、図１の間引き制御部２０では、入力端子１１からの上記ＳＤ信号がアクティビティ演算回路２１に送られてアクティビティが演算され、間引き率決定回路２２はこのアクティビティに応じて間引き率を決定する。決定された間引き率により、予測タップ生成回路１２及びクラスタップ生成回路１５のタップの間引き率が制御される。間引き制御部２０の遅延回路２３は、アクティビティ演算回路２１及び間引き率決定回路２２での処理時間だけ入力信号を遅延するものである。
【００２０】
このような入力信号のアクティビティに基づく間引き率の制御について、図２〜図４を参照しながら説明する。ここでは、簡単のために、１次元のクラスタップ及び予測タップを用いて画像の解像度変換処理を行うものと仮定するが、２次元、３次元へも容易に拡張できることは勿論である。
【００２１】
図２の（Ａ）に示すように、入力画素が１次元方向にサンプリングされているものとし、現在処理の対象となる注目画素を中心として５個のクラスタップと１３個の予測タップをとるものとする。注目画素の近傍に高域成分がある場合には、タップの間隔を詰めて解像度変換処理を行うと解像度向上の改善度が高いが、注目画素の近傍に高域成分が少なく、なだらかに変化している場合には、改善度が少ないことが分かっている。その場合、図２の（Ｂ）のように間引き間隔を開けて面積的に広い部分をカバーするようにタップをとることにより、解像度が向上する。この場合、入力画像信号のアクティビティと間引き率と改善度との間には最適な関係に着目し、本発明の実施の形態においては、入力画像信号のアクティビティを演算して求め、求められたアクティビティに基づいて、クラスタップ及び予測タップの間引き率を決定している。
【００２２】
なお、図２の（Ａ）、（Ｂ）中の斜線を施した画素が、入力画素から取り出されて次段に送られるクラスタップ及び予測タップを示し、（Ａ）は高域成分が多いとき、（Ｂ）は高域成分が少ないときをそれぞれ示している。
【００２３】
ここでアクティビティの具体例としては、図２の（Ａ）のような密な（間引きなしの）１次元のクラスタップを基本とし、図２の（Ｂ）のような１以上の画素を間引いたクラスタップにおける補間誤差の絶対値和をアクティビティとみなしている。このようにして得られたアクティビティを所定の閾値ｔｈと比較することで、最適の間引き間隔を決定する。
【００２４】
図３の（Ａ）は、１次元の入力信号を、縦軸をレベル方向として示したものであり、１間引きでクラスタップをとる例を示している。間引きされた後に得られる５個のクラスタップの画素値をＸ₁〜Ｘ₅とし、これらのタップ間の間引かれる画素値をＹ₁〜Ｙ₄とするとき、中央のＸ₃ が注目画素である。このときのアクティビティ（すなわちこの実施の形態での補間誤差の絶対値和の平均）Ａ₁ は、
Ａ₁ ＝｛Σ｜Ｙ_i −（Ｘ_i−Ｘ_i+1）／２｜｝／４
ただし、１≦ｉ≦４
の式で表される。
【００２５】
また、図３の（Ｂ）は、同様に２間引きの場合の例を示したものである。この場合、５個のクラスタップ（画素値Ｘ₁〜Ｘ₅）に対して、タップ間毎にそれぞれ２個ずつの画素が間引かれ、これらを順にＹ₁,Ｙ₂、Ｙ₃,Ｙ₄、Ｙ₅,Ｙ₆、Ｙ₇,Ｙ₈とする。この図３の（Ｂ）のときのアクティビティ（補間誤差の絶対値和の平均）Ａ₂ は、
Ａ₂ ＝｛Σ｜Ｙ_2i-1 −（２Ｘ_i＋Ｘ_i+1）／３｜＋
｜Ｙ_2i −（Ｘ_i＋２Ｘ_i+1）／３｜｝／８
ただし、１≦ｉ≦４
の式で表される。
【００２６】
以下、同様にして、間引き間隔を増やしたときの補間誤差の絶対値和の平均値（Ａ₃、Ａ₄、・・・）を求めていく。なお、間引き無しのときは、アクティビティＡ₀ として、
Ａ₀ ＝ α｛Σ｜Ｘ_i−Ｘ_i+1｜｝／４
ただし、α＜１，１≦ｉ≦４
の式で示される隣接画素差分の絶対値和の平均値を用いる。
【００２７】
以上のようにして求めたアクティビティをＡ₀ から順次、所定の閾値ｔｈと比較し、閾値より初めて大きくなったときのアクティビティが示す間引き率を最適間引き率と決定する。
【００２８】
これを論理式のように表すと、
ｉｆ（Ａ₀ ＞ｔｈ）間引き無し
else if （Ａ₁ ＞ｔｈ）１間引き
else if （Ａ₂ ＞ｔｈ）２間引き
else if （Ａ₃ ＞ｔｈ）３間引き
else if （Ａ₄ ＞ｔｈ）４間引き
else if （Ａ₅ ＞ｔｈ）５間引き
・・・
となる。
【００２９】
図４は、図１の間引き率決定回路２２における、上記アクティビティに基づく間引き率の決定の手順を説明するためのフローチャートであり、８間引きまで行う場合の具体例を示している。
【００３０】
この図４において、最初のステップＳ４１では、上記隣接画素差分の絶対値和の平均値により求められたアクティビティＡ₀ が、所定の閾値ｔｈよりも大きいか否かを判別している。ＹＥＳのときにはステップＳ４２に進み、間引き率決定回路２２は間引き無しコードを発生して、図１の予測タップ生成回路１２及びクラスタップ生成回路１５に送っている。ステップＳ４１でＮＯと判別されたときには、ステップＳ４３に進み、上記１間引きのときの補間誤差の絶対値和の平均値として求められたアクティビティＡ₁ が、所定の閾値ｔｈよりも大きいか否かを判別している。このステップＳ４３で、ＹＥＳと判別されたときにはステップＳ４４に進んで１間引きコードを発生し、ＮＯと判別されたときには次のステップＳ４５に進んでいる。ステップＳ４５では、上記２間引きのときのアクティビティＡ₂ が、所定の閾値ｔｈよりも大きいか否かを判別し、ＹＥＳと判別されたときにはステップＳ４６に進んで２間引きコードを発生し、ＮＯと判別されたときには次のステップに進んでいる。このような処理を以下同様に３間引き〜６間引きについて順次行い、６間引きのときのアクティビティＡ₆ が所定の閾値ｔｈ以下のときにはステップＳ４７に進んで、上記７間引きのときのアクティビティＡ₇ が所定の閾値ｔｈよりも大きいか否かを判別する。このステップＳ４７で、ＹＥＳと判別されたときにはステップＳ４８に進んで７間引きコードを発生し、ＮＯと判別されたときには次のステップＳ４９に進んで８間引きコードを発生している。なお、上記間引き率決定回路２２にて発生された上記１間引きコード〜８間引きコードは、図１の予測タップ生成回路１２及びクラスタップ生成回路１５に送られることは勿論である。
【００３１】
次に、上記図１に示したクラス分類を伴う変換処理のクラス毎の予測係数を最小自乗法によって求めるための学習装置について、図５を参照しながら説明する。この具体例では、上記図１に示す信号処理装置として、標準あるいは低解像度の映像信号に相当する標準テレビジョン信号（ＳＤ信号）を、高解像度の映像信号であるいわゆるＨＤＴＶ信号（ＨＤ信号）に変換する装置を想定し、このＳＤ信号をＨＤ信号に変換する変換装置に用いられる予測係数を求めるための学習を行うための学習装置を図５に示している。
【００３２】
図５の入力端子５１には、教師信号となる高解像度のＨＤ信号が供給されており、このＨＤ信号をダウンコンバータ５２に送ってローパスフィルタリング処理やダウンサンプリング処理等を施すことにより、ＨＤ信号に対応する擬似的な標準解像度のＳＤ信号を得ている。これらのＨＤ信号及びＳＤ信号は、正規方程式生成回路５３に送られる。また、ダウンコンバータ５２からのＳＤ信号は、間引き制御部６０を介し、予測タップ生成回路５４及びクラスタップ生成回路５５に送られている。
【００３３】
間引き制御部６０は、上記図１の間引き制御部２０と同様な構成を有し、上記ダウンコンバートされたＳＤ信号がアクティビティ演算回路６１に送られてアクティビティが演算され、このアクティビティに基づいて間引き率決定回路６２が予測タップ及びクラスタップの間引き率を決定している。また、遅延回路６３は、アクティビティ演算回路６１及び間引き率決定回路６２での処理時間分だけ入力信号を遅延させている。これらのアクティビティ演算回路６１、間引き率決定回路６２及び遅延回路６３は、図１のアクティビティ演算回路２１、間引き率決定回路２２及び遅延回路２３にそれぞれ対応するものであり、動作は上述した図１の各部と同様であるため、説明を省略する。
【００３４】
間引き制御部６０の遅延回路６３を介した上記ＳＤ信号が供給される予測タップ生成回路５４及びクラスタップ生成回路５５には、間引き制御部６０の間引き率決定回路６２からの間引き制御信号あるいは間引きコードが入力され、各タップの間引き間隔が制御されるようになっている。予測タップ生成回路５４からの予測タップは、正規方程式生成回路５３に送られ、クラスタップ生成回路５５からのクラスタップはクラス分類回路５６でクラス分類されて、クラス情報が正規方程式生成回路５３に送られる。予測タップ生成回路５４、クラスタップ生成回路５５及びクラス分類回路５６は、動作は上述した図１の各予測タップ生成回路１２、クラスタップ生成回路１５及びクラス分類回路１６とそれぞれ同様であるため、説明を省略する。
【００３５】
正規方程式生成回路５３は、クラス毎にそれぞれ正規方程式を生成する回路であり、学習素材が入力し終わった後に得られた各クラス毎の正規方程式を連立方程式解法回路５７に送り、各クラス毎に正規方程式から連立方程式を解いて各クラス毎の予測係数を求め、これらの各クラス毎の予測係数を予測係数メモリ５８に記憶する。この予測係数メモリ５８の記憶内容が、図１の予測係数ＲＯＭ１７に書き込まれて用いられる。
【００３６】
以上説明したような本発明の実施の形態によれば、入力信号の処理対象となる注目位置におけるクラスタップ及び予測タップの間隔を最適に間引いて、原入力信号のぼけの小さい所では短いタップ間隔で、ぼけの大きい所では長いタップ間隔でそれぞれ処理を行うことにより、ぼけの小さい部分でも、ぼけの大きい部分でも、改善度の低下がなく、より一層の解像度の向上が図れる。
【００３７】
なお、本発明は上述した実施の形態のみに限定されるものではなく、例えば、アクティビティ演算回路２１、間引き率決定回路２２及び遅延回路２３は、ハードウェア的に表しているが、ソフトウェアにより実現できることは勿論である。また、標準解像度の映像信号（ＳＤ信号）を高解像度の映像信号（ＨＤ信号）に変換する場合のみに限定されず、所定解像度の映像信号を他の解像度の映像信号に変換する場合や、所定のサンプリング周波数あるいはサンプルビット数のオーディオ信号を、他のサンプリング周波数やサンプルビット数のオーディオ信号に変換する場合等にも適用できることは勿論である。
【００３８】
【発明の効果】
本発明によれば、入力信号をクラス分類し、分類されたクラスに応じた予測係数予測係数を用いて予測演算して予測値を出力する際に、上記入力信号に応じて上記クラス分類及び上記予測演算の対象となるデータの各間引き間隔を制御することにより、入力信号の各部に応じた最適の間引き率での信号処理が行われ、より一層の品質向上が図れる。
【００３９】
また、上記間引き制御として、予め設定された複数の間引き間隔についての変化の度合いを表すアクティビティを求め、上記間引き間隔が小さいものから順に上記アクティビティを求め、各間引き間隔毎のアクティビティが所定の閾値を初めて超えたときの間引き間隔を最適間引き率に決定することにより、最適な間引き間隔を合理的に求めることができる。
【００４０】
また、上記クラス分類のためにクラスタップを生成し、上記予測演算のために予測タップを生成しており、これらのクラスタップ及び予測タップについての間引き率を制御し、入力信号の処理対象となる注目位置におけるクラスタップ及び予測タップの間隔を最適に間引いて、原入力信号のぼけの小さい部分では短いタップ間隔で、ぼけの大きい部分では長いタップ間隔でそれぞれ処理を行うことにより、ぼけの小さい部分でも、ぼけの大きい部分でも、改善度の低下がなく、より一層の品質向上が図れる。
【図面の簡単な説明】
【図１】本発明の実施の形態として、クラス分類を伴う変換処理を行う画像情報変換装置の一例を示すブロック図である。
【図２】入力信号から取り出されるクラスタップ及び予測タップについて、高域成分が多いときと少ないときとをそれぞれ示す図である。
【図３】入力信号から取り出されるクラスタップの信号変化について、１間引きのときと２間引きのときとをそれぞれ示す図である。
【図４】入力信号のアクティビティに基づく間引き率決定の手順を説明するためのフローチャートである。
【図５】本発明の実施の形態を説明するための、クラス分類を伴う変換処理の予測係数を求めるための学習装置の一例を示すブロック図である。
【符号の説明】
１２予測タップ生成回路、１３予測演算回路、１５クラスタップ生成回路、１６クラス分類回路、１７予測係数ＲＯＭ、２０間引き制御部、２１アクティビティ演算回路、２２間引き率決定回路[0001]
BACKGROUND OF THE INVENTION
The present invention relates to a prediction coefficient generation method and a signal processing apparatus, and more particularly to a prediction coefficient generation method and a generated prediction coefficient for generating a prediction coefficient used when classifying and performing interpolation processing or the like for each class. The present invention relates to a signal processing apparatus that performs processing by using the signal processing apparatus.
[0002]
[Prior art]
In recent years, various techniques for improving the resolution or sampling frequency of video and audio signals have been proposed. For example, in the case of up-conversion from a standard television signal corresponding to standard or low resolution to a so-called HDTV signal of high resolution, or in the case of performing sub-sample interpolation, for example, Japanese Patent Laid-Open It is known that a better result can be obtained by performing the conversion process with classification as disclosed in JP-A-6-178277 and JP-A-7-79418.
[0003]
The conversion process accompanied by the class classification is, for example, when converting a standard television signal (SD signal) corresponding to a standard or low resolution into a high resolution signal (HD signal), and classifying the input SD signal into a class classification. Then, a class code is generated, and a prediction coefficient preset for each class is selected using this class code, and the input SD signal is subjected to a predictive calculation process to obtain an HD signal. For the prediction calculation process, for example, a linear linear expression is used.
[0004]
In addition to the up-conversion from the SD signal to the HD signal as described above, the conversion process with the above classification is performed when performing interlace / progressive conversion for the scanning method of the video signal, converting the number of scanning lines, Various signal conversions such as converting a blurred image into a clear image, electronic zoom processing, frame number conversion processing (for example, conversion between 24 and 30 frames per second), format conversion processing, etc. Favorable results have been obtained when applied to processing.
[0005]
[Problems to be solved by the invention]
By the way, in the case of improving the resolution of the original signal by the above-described conversion process with classification, for example, the degree of improvement in the portion with large blur may be lower than that in the portion with small blur. In order to increase the degree of improvement in the portion where the blur is large, it is conceivable that pixels are thinned out in advance during classification, or that the interval between prediction taps during prediction calculation is thinned out.
[0006]
However, when the above thinning is performed, there is a drawback that the resolution cannot be sufficiently improved for a portion with a small blur.
[0007]
The present invention has been made in view of the above-described circumstances. For an input signal such as a video signal having a small blur portion and a large blur portion, an optimal resolution improvement for each portion is performed. It is an object of the present invention to provide a signal processing apparatus and a signal processing method that can perform conversion processing.
[0008]
[Means for Solving the Problems]
In order to solve the above-described problem, the present invention uses the data of the target position to be processed of the input signal and a plurality of data around it as a class tap, obtains class classification information from the class tap, and classifies the input signal as a class. Class classification means for classifying, prediction coefficient output means for outputting a prediction coefficient preset for each class according to the class classified by the class classification means, and data on the target position to be processed by the input signal And a plurality of data around it as a prediction tap, a prediction calculation means for predicting and calculating the prediction tap using a prediction coefficient from the prediction coefficient output means, and outputting a prediction value of the data of the target output signal; and in response to an input signal and a decimation control means for controlling each sampling interval of the input data to the input data and the prediction calculation means to said classification means The thinning-out control means includes an arithmetic means for obtaining, as an activity, an absolute value sum of errors between a value obtained by interpolating the thinned data and a true value at a plurality of preset thinning intervals, The activity obtained by the means is compared with a predetermined threshold value in order from the smallest thinning interval, and the thinning rate determining unit determines the thinning interval as the optimum thinning rate when the predetermined threshold value is exceeded for the first time .
[0009]
In the thinning control, it is preferable to obtain an activity representing the degree of change for a plurality of preset thinning intervals, and to determine the optimum thinning interval based on the obtained activity.
[0010]
In the thinning-out control, it is preferable that the activities are obtained in order starting from the smallest thinning-out interval, and the thinning-out interval when the activity for each thinning-out interval exceeds a predetermined threshold for the first time is determined as the optimum thinning-out rate.
[0011]
Specifically, class taps are generated for the class classification, prediction taps are generated for the prediction calculation, and the thinning rate for these class taps and prediction taps is controlled.
[0012]
Optimal thinning of the class tap and prediction tap intervals at the target position where the input signal is to be processed, and processing is performed with a short tap interval when the blur of the original input signal is small and with a long tap interval when the blur is large .
[0013]
DETAILED DESCRIPTION OF THE INVENTION
Embodiments according to the present invention will be described below with reference to the drawings.
First, as a signal processing apparatus according to the present invention, a specific example of a signal conversion apparatus that performs the conversion process with the above-described class classification will be described with reference to FIG. In this specific example, an image information conversion device that converts the resolution of an image SD signal into an HD signal is assumed.
[0014]
By the way, for the class classification in the conversion process with the class classification described above, the data of the current position to be processed and a plurality of data around it are used, and these data correspond to the taps of the filter. Therefore, it is called a class tap. The prediction calculation process is also referred to as a prediction tap because it uses the data of the position of interest and a plurality of data around it, and these data are taps of the prediction filter.
[0015]
In FIG. 1, a standard resolution SD signal is supplied to the input terminal 11 and sent to the prediction tap generation circuit 12 and the class tap generation circuit 15 via the thinning control unit 20. The prediction tap output from the prediction tap generation circuit 12 is sent to the prediction calculation circuit 13 for prediction calculation. The class tap from the class tap generation circuit 15 is supplied to the class classification circuit 16 for class classification, and the prediction coefficient ROM 17 is accessed by using the determined class index as an address, and the prediction coefficient is read out. It is sent to the prediction calculation circuit 13.
[0016]
The prediction calculation circuit 13 is also referred to as an estimation calculation circuit or an interpolation value generation circuit. The prediction calculation circuit 13 uses the prediction coefficient adaptively selected for each class and supplied from the prediction coefficient ROM 17 as a prediction tap from the prediction tap generation circuit 12. Based on the prediction calculation, HD data for the input SD data is calculated. The HD signal obtained by the prediction calculation in the prediction calculation circuit 13 is taken out from the output terminal 14.
[0017]
The class classification circuit 16 outputs class information by subjecting the class tap from the class tap generation circuit 15 to, for example, ADRC (adaptive dynamic range coding). The adaptive dynamic range coding in the class classification circuit 16 is originally developed for high-efficiency compression coding of video signals, as disclosed in Japanese Patent Application Laid-Open No. 61-144899. Since the level distribution pattern of the video signal can be efficiently expressed with a short word length, information for class classification is obtained based on this ADRC encoded data. That is, the dynamic range DR in the block that is a coding unit of the video signal is determined by the maximum value MAX and the minimum value MIN in the block,
DR = MAX-MIN + 1
The requantization code Q when the level L of each pixel data in the block is requantized to n bits is
Q = [(L−MIN + 0.5) · 2 ⁿ / DR]
However, [] means truncation processing. Information for classifying the video signal of the block is obtained using the requantization code Q in the block.
[0018]
This class information is sent to the prediction coefficient ROM 17 as a class code or a class index, and a prediction coefficient corresponding to the class is read out and sent to the prediction calculation circuit 13.
[0019]
Next, the thinning control unit 20 will be described.
The thinning control unit 20 is adapted to change the thinning rate of the class tap or the prediction tap according to the degree of change of the input signal, for example, whether the change is gentle or the degree of blur in the case of the image signal. Control. Specifically, in the thinning control unit 20 in FIG. 1, the SD signal from the input terminal 11 is sent to the activity calculation circuit 21 to calculate the activity, and the thinning rate determination circuit 22 determines the thinning rate according to this activity. To do. The tap thinning rate of the prediction tap generation circuit 12 and the class tap generation circuit 15 is controlled by the determined thinning rate. The delay circuit 23 of the thinning control unit 20 delays the input signal by the processing time in the activity calculation circuit 21 and the thinning rate determination circuit 22.
[0020]
The control of the thinning rate based on the activity of the input signal will be described with reference to FIGS. Here, for the sake of simplicity, it is assumed that the resolution conversion processing of an image is performed using a one-dimensional class tap and a prediction tap, but it goes without saying that it can be easily extended to two dimensions and three dimensions.
[0021]
As shown in FIG. 2A, it is assumed that the input pixel is sampled in a one-dimensional direction and takes five class taps and 13 prediction taps around the target pixel to be processed. And If there is a high frequency component in the vicinity of the target pixel, resolution improvement processing with a narrow tap interval will improve resolution, but there will be little high frequency component in the vicinity of the target pixel and it will change smoothly. If so, it is known that there is little improvement. In that case, the resolution is improved by taking a tap so as to cover a wide area with a thinning interval as shown in FIG. In this case, paying attention to the optimum relationship among the activity of the input image signal, the thinning rate, and the improvement level, in the embodiment of the present invention, the activity of the input image signal is obtained by calculation. Based on the above, the thinning rate of the class tap and the prediction tap is determined.
[0022]
2A and 2B, the hatched pixels indicate class taps and prediction taps that are extracted from the input pixels and sent to the next stage, and FIG. 2A is when there are many high-frequency components. , (B) respectively show a case where the high frequency component is small.
[0023]
Here, a specific example of the activity is based on a dense (without thinning) one-dimensional class tap as shown in FIG. 2A, and one or more pixels as shown in FIG. 2B are thinned out. The sum of absolute values of interpolation errors in class taps is regarded as activity. The optimum thinning-out interval is determined by comparing the activity thus obtained with a predetermined threshold th.
[0024]
FIG. 3A shows a one-dimensional input signal with the vertical axis as the level direction, and shows an example in which class taps are taken by one decimation. When the pixel values of the five class taps obtained after the thinning are X _{1 to} X ₅ and the pixel values thinned between these taps are Y ₁ to Y ₄ , the center X ₃ is the target pixel. is there. The activity at this time (that is, the average of the sum of absolute values of interpolation errors in this embodiment) A ₁ is
A ₁ = {Σ | Y _i − (X _i −X _{i + 1} ) / 2 |} / 4
However, 1 ≦ i ≦ 4
It is expressed by the following formula.
[0025]
FIG. 3B shows an example in the case of two thinning out similarly. In this case, for the five class taps (pixel values X _{1 to} X ₅ ), two pixels are thinned out between the taps, and these are sequentially assigned to Y ₁ , Y ₂ , Y ₃ , Y _4. , Y ₅ , Y ₆ , Y ₇ , Y ₈ . The activity (average of the sum of absolute values of interpolation errors) A ₂ at (B) in FIG.
A ₂ = {Σ | Y _2i-1 − (2X _i + X _{i + 1} ) / 3 | +
| Y _2i − (X _i + 2X _{i + 1} ) / 3 |} / 8
However, 1 ≦ i ≦ 4
It is expressed by the following formula.
[0026]
In the same manner, the average value (A ₃ , A ₄ ,...) Of the sum of absolute values of interpolation errors when the thinning interval is increased is obtained. When there is no decimation, the activity A ₀ is
A ₀ = α {Σ | X _i −X _{i + 1} |} / 4
However, α <1, 1 ≦ i ≦ 4
The average value of the sum of absolute values of adjacent pixel differences expressed by the following equation is used.
[0027]
The activities obtained as described above are sequentially compared with the predetermined threshold th from A ₀ , and the thinning rate indicated by the activity when it becomes larger than the threshold for the first time is determined as the optimum thinning rate.
[0028]
If this is expressed as a logical expression,
if (A ₀ > th) No decimation
else if (A ₁ > th) 1 decimation
else if (A ₂ > th) 2 decimation
else if (A ₃ > th) 3 decimation
else if (A ₄ > th) 4 thinning
else if (A ₅ > th) 5 decimation
It becomes.
[0029]
FIG. 4 is a flowchart for explaining the procedure for determining the thinning rate based on the activity in the thinning rate determination circuit 22 in FIG. 1, and shows a specific example in the case of performing the thinning rate up to eight.
[0030]
In FIG. 4, in the first step S41, it is determined whether or not the activity A ₀ obtained by the average value of the absolute value sum of the adjacent pixel differences is larger than a predetermined threshold th. If YES, the process proceeds to step S42, where the thinning rate determination circuit 22 generates a code without thinning and sends it to the prediction tap generation circuit 12 and the class tap generation circuit 15 in FIG. When NO is determined in the step S41, the process proceeds to a step S43, and it is determined whether or not the activity A ₁ obtained as an average value of the sum of absolute values of the interpolation errors at the time of the one thinning is larger than a predetermined threshold th. Judging. If it is determined as YES in step S43, the process proceeds to step S44 to generate one thinning code, and if it is determined NO, the process proceeds to the next step S45. In step S45, it is determined whether or not the activity A _{2 at} the time of the two thinning is greater than a predetermined threshold th. If YES, the process proceeds to step S46 to generate a two thinning code and determine NO. When it is done, proceed to the next step. In the same manner, the above processing is sequentially performed for 3 to 6 thinning out. When the activity A _{6 at} the time of 6 thinning is equal to or less than the predetermined threshold th, the process proceeds to step S47, and the activity A _{7 at} the time of the 7 thinning is determined as the predetermined. It is determined whether or not it is larger than the threshold th. If it is determined as YES in step S47, the process proceeds to step S48 to generate a 7 thinning code, and if it is determined to be NO, the process proceeds to the next step S49 to generate an 8 thinning code. Of course, the 1 to 8 decimation codes generated by the decimation rate determination circuit 22 are sent to the prediction tap generation circuit 12 and the class tap generation circuit 15 of FIG.
[0031]
Next, a learning apparatus for obtaining the prediction coefficient for each class of the conversion process with class classification shown in FIG. 1 by the least square method will be described with reference to FIG. In this specific example, as the signal processing apparatus shown in FIG. 1, a standard television signal (SD signal) corresponding to a standard or low resolution video signal is converted into a so-called HDTV signal (HD signal) which is a high resolution video signal. FIG. 5 shows a learning apparatus for performing learning for obtaining a prediction coefficient used in a conversion apparatus that converts this SD signal into an HD signal, assuming a conversion apparatus.
[0032]
The input terminal 51 in FIG. 5 is supplied with a high-resolution HD signal as a teacher signal. The HD signal is sent to the down-converter 52 and subjected to low-pass filtering processing, down-sampling processing, etc. A corresponding pseudo standard definition SD signal is obtained. These HD signal and SD signal are sent to the normal equation generation circuit 53. The SD signal from the down converter 52 is sent to the prediction tap generation circuit 54 and the class tap generation circuit 55 via the thinning control unit 60.
[0033]
The thinning control unit 60 has the same configuration as the thinning control unit 20 in FIG. 1, and the downconverted SD signal is sent to the activity calculation circuit 61 to calculate an activity. Based on this activity, the thinning rate is reduced. The determination circuit 62 determines the thinning rate of the prediction tap and the class tap. The delay circuit 63 delays the input signal by the processing time in the activity calculation circuit 61 and the thinning rate determination circuit 62. The activity calculation circuit 61, the thinning rate determination circuit 62, and the delay circuit 63 correspond to the activity calculation circuit 21, the thinning rate determination circuit 22, and the delay circuit 23 of FIG. 1, respectively. Since it is the same as each part, description is abbreviate | omitted.
[0034]
The prediction tap generation circuit 54 and the class tap generation circuit 55 to which the SD signal is supplied via the delay circuit 63 of the thinning control unit 60 have a thinning control signal or a thinning code from the thinning rate determination circuit 62 of the thinning control unit 60. Is input, and the thinning interval of each tap is controlled. The prediction tap from the prediction tap generation circuit 54 is sent to the normal equation generation circuit 53, the class tap from the class tap generation circuit 55 is classified by the class classification circuit 56, and class information is sent to the normal equation generation circuit 53. It is done. The operations of the prediction tap generation circuit 54, the class tap generation circuit 55, and the class classification circuit 56 are the same as those of the prediction tap generation circuit 12, the class tap generation circuit 15, and the class classification circuit 16 shown in FIG. Is omitted.
[0035]
The normal equation generation circuit 53 is a circuit that generates a normal equation for each class. The normal equation generation circuit 53 sends a normal equation for each class obtained after the learning material is input to the simultaneous equation solving circuit 57, and for each class. A simultaneous equation is solved from the normal equation to obtain a prediction coefficient for each class, and the prediction coefficient for each class is stored in the prediction coefficient memory 58. The stored contents of the prediction coefficient memory 58 are written and used in the prediction coefficient ROM 17 of FIG.
[0036]
According to the embodiment of the present invention as described above, the interval between the class tap and the prediction tap at the target position to be processed of the input signal is optimally thinned, and a short tap interval at a place where the blur of the original input signal is small. Thus, by performing the processing at a long tap interval in a place where the blur is large, there is no decrease in the degree of improvement in a portion where the blur is large or a portion where the blur is large, and the resolution can be further improved.
[0037]
The present invention is not limited to the above-described embodiment. For example, the activity calculation circuit 21, the thinning rate determination circuit 22, and the delay circuit 23 are represented by hardware, but can be realized by software. Of course. In addition, the present invention is not limited to the case of converting a standard resolution video signal (SD signal) to a high resolution video signal (HD signal). Of course, the present invention can also be applied to the case where an audio signal having a sampling frequency or the number of sample bits is converted into an audio signal having another sampling frequency or the number of sample bits.
[0038]
【The invention's effect】
According to the present invention, when the input signal is classified, and the prediction calculation is performed using the prediction coefficient prediction coefficient corresponding to the classified class and the prediction value is output, the classification and the classification are performed according to the input signal. By controlling each thinning interval of the data to be subjected to the prediction calculation, signal processing is performed at an optimum thinning rate corresponding to each part of the input signal, and the quality can be further improved.
[0039]
In addition, as the thinning control, an activity indicating the degree of change in a plurality of preset thinning intervals is obtained, the activities are obtained in order from the smallest thinning intervals, and the activity for each thinning interval has a predetermined threshold value. By determining the thinning interval when it exceeds the first time as the optimum thinning rate, the optimal thinning interval can be reasonably obtained.
[0040]
Further, class taps are generated for the class classification, and prediction taps are generated for the prediction calculation. The decimation rate for these class taps and prediction taps is controlled, and the input signal is processed. By optimizing the interval between the class tap and the prediction tap at the target position, processing is performed with a short tap interval in a portion where the blur of the original input signal is small, and a long tap interval in a portion where the blur is large. However, even in the part where the blur is large, the improvement degree does not decrease and the quality can be further improved.
[Brief description of the drawings]
FIG. 1 is a block diagram illustrating an example of an image information conversion apparatus that performs conversion processing with class classification as an embodiment of the present invention.
FIG. 2 is a diagram showing when a high frequency component is large and when a class tap and a prediction tap extracted from an input signal are large.
FIGS. 3A and 3B are diagrams showing a signal change of a class tap taken out from an input signal, when thinning out one time and when thinning out two times, respectively;
FIG. 4 is a flowchart for explaining a procedure for determining a thinning rate based on an activity of an input signal.
FIG. 5 is a block diagram showing an example of a learning apparatus for obtaining a prediction coefficient of conversion processing with class classification for explaining an embodiment of the present invention.
[Explanation of symbols]
12 prediction tap generation circuit, 13 prediction calculation circuit, 15 class tap generation circuit, 16 class classification circuit, 17 prediction coefficient ROM, 20 decimation control unit, 21 activity calculation circuit, 22 decimation rate determination circuit

Claims

Class classification means for classifying the input signal by obtaining class classification information from the data of the target position to be processed of the input signal and a plurality of data around it as class tap ,
Prediction coefficient output means for outputting a prediction coefficient preset for each class according to the class classified by the class classification means;
The target position data to be processed of the input signal and a plurality of data around it are set as prediction taps, and the prediction tap is predicted using the prediction coefficient from the prediction coefficient output means, and the target output signal is output. A prediction calculation means for outputting a predicted value of the data ;
In response to the input signal have a and decimation control means for controlling each sampling interval of the input data to the input data and the prediction calculation means to said classification means,
The decimation control means is
An arithmetic means for obtaining, as an activity, an absolute value sum of errors between a value obtained by interpolating the thinned data and a true value for each of a plurality of preset thinning intervals;
A decimation rate determination unit that compares the activity obtained by the calculation unit with a predetermined threshold value in order from the smallest decimation interval and determines the decimation interval when the predetermined threshold value is exceeded for the first time as an optimum decimation rate. Signal processing device.

The signal processing apparatus according to claim 1, wherein the class classification information is obtained from encoded data when the class tap is compression encoded .

The signal processing device according to claim 1, wherein the number of data of the prediction tap is larger than the number of data of the class tap .

A class tap generation means is provided in the preceding stage of the class classification means, and a prediction tap generation means is provided in the previous stage of the prediction calculation means, and the thinning-out control means includes each tap in the class tap generation means and the prediction tap generation means. the signal processing apparatus according to claim 1, wherein that controls the thinning ratio for obtaining.

The input signal is a video signal, each tap in the class-tap generating means and the prediction tap generating means, Ru 1D tap or two-dimensional tap der claim 4 signal processing apparatus according.

6. The signal processing apparatus according to claim 5, wherein the target output signal is a video signal having a higher resolution than the input signal.

A class classification step for classifying the input signal by obtaining class classification information from the data of the target position to be processed of the input signal and a plurality of data around the class tap, and class classification information from the class tap ;
A prediction coefficient output step for outputting a prediction coefficient preset for each class according to the class classified by the class classification step ;
The target position data to be processed of the input signal and a plurality of data around it are set as prediction taps, and the prediction tap is predicted using the prediction coefficient from the prediction coefficient output step, and the target output signal A prediction calculation step for outputting a predicted value of the data;
A decimation control step for controlling each decimation interval of the input data to the class classification step and the input data to the prediction calculation step according to the input signal ,
The decimation control step is
A calculation step for obtaining, as an activity, an absolute value sum of errors between a value obtained by interpolating the thinned data and a true value for each of a plurality of preset thinning intervals,
A decimation rate determination step of determining the decimation interval as the optimum decimation rate when the activity obtained by this calculation step is compared with a predetermined threshold in order from the one with the smallest decimation interval and the predetermined threshold is exceeded for the first time Signal processing method.