JP4161685B2

JP4161685B2 - Audio input / output device

Info

Publication number: JP4161685B2
Application number: JP2002326511A
Authority: JP
Inventors: 誠一古田; 一郎赤堀
Original assignee: Denso Corp
Current assignee: Denso Corp
Priority date: 2002-11-11
Filing date: 2002-11-11
Publication date: 2008-10-08
Anticipated expiration: 2022-11-11
Also published as: JP2004165775A

Description

【０００１】
【発明の属する技術分野】
本発明は、音声入出力装置に関する。
【０００２】
【従来の技術】
近年、通信技術の発達により、様々な通話形態によって通話を行う装置が知られている。
【０００３】
例えば、特許文献１では、三人の話者が同時に通話することが可能な携帯端末機が示されている。この携帯端末機は、ＰＤＣ方式とＰＨＳ方式の２つの通信方式によって通信が可能であり、各通信方式毎にエコーキャンセラが設けられている。三人が同時に通話を行う場合、話者Ａは話者Ｂと話者Ｃに対して、異なる通信方式を利用して回線を接続し、通話を行う。話者Ｂと話者Ｃとの間の通話は話者Ａの携帯端末機を介して行われるが、異なる通信方式を利用して通話を行っているため、各通信方式毎に設けられたエコーキャンセラによって話者Ｂと話者Ｃの通話音声が遮断されることなく通話が可能である。
【０００４】
【特許文献１】
特開２０００−１９６７３６号公報
【０００５】
【発明が解決しようとする課題】
一方、このような通話形態の多様化にともない、通話装置の構成も複雑化してきている。特に、通話装置の一機能を担う音声入出力装置はその代表例であり、ハウリング防止機能やエコーキャンセル機能等の搭載により、複雑化の程度も著しい。しかしながら、通話装置の使用に際して課される制約条件によっては、複雑な構成の音声入出力装置は使用できない場合がある。かといって、音声入出力装置の有する機能を省くことによって簡略化を行うと、通話品質が劣化して通話自体が困難となる。
【０００６】
本発明は、前記の問題点を鑑み、簡易な構成で高品質な通話が可能な音声入出力装置の提供を目的とする。
【０００７】
【課題を解決するための手段】
上記目的を達成するために、請求項１に記載の音声入出力装置は、話者の音声を入力して音声信号を生成する複数の音声入力手段の各々が、通話相手方の音声を出力する音声出力手段から等距離に配置され、複数の音声入力手段の各々が生成した各音声信号のレベルの偏差を算出し、これに基づいて送話信号を生成することで、話者の音声とともに音声入力手段に混入した、音声出力手段から出力された音声を、送話信号から除去する処理手段と、処理手段が生成した送話信号を通話相手方へ送信し、通話相手方から送信された受話信号を受信して音声出力手段へ出力する通信手段とを備える音声入出力装置であって、車両における車室の幅を車両の進行方向と平行に２等分する２等分線上に配置されて動作することを特徴とする。
【０００８】
このように、請求項１の音声入出力装置では、話者の音声は通話相手方の音声を出力する音声出力手段から等距離に配置された、複数の音声入力手段から入力される。このとき、複数の音声入力手段は、音声出力手段から等距離に配置されているので、音声出力手段からの音声が各音声入力手段に入力されたとき、等しいレベルの音声信号となる。このため、処理手段が、各音声入力手段が生成した各音声信号のレベルの偏差を算出することで、音声出力手段からの音声に基づく音声信号成分が除去され、話者の音声に基づく送話信号が生成される。このようにして生成された送話信号には、エコーやハウリングの原因となる音声出力手段の出力した信号成分が含まれていないため、送話信号が通信手段によって通話相手方へ送信されたとき、通話相手方はクリアな音声にて話者の音声を聞くことができる。
【０００９】
上述したように、請求項１の音声入出力装置によれば、エコーやハウリングの発生を防止することができ、簡易な構成で高品質な通話が可能となるのである。
なお、話者が各音声入力手段から等距離に位置する場所から音声を発すると、話者の音声から各音声入力手段が生成する音声信号の各々のレベルが全て等しくなり、処理手段が各音声信号のレベルの偏差を算出する際に、話者の音声に基づく音声信号成分が除去されてしまう。しかし、請求項１では、車両における車室の幅を車両の進行方向と平行に２等分する２等分線上に配置するので、話者が車内のどのシートに座って発話しても、話者から各音声入力手段までの距離が等しくならず、話者の音声に基づく音声信号成分が除去されることなく通話が可能となる。
【００１０】
請求項２に記載のように、音声入力手段は２つ設けられ、音声入力手段の各々および音声出力手段は一直線上に配置されることが望ましい。これにより、エコーやハウリングの発生を防止しつつ、本音声入出力装置を小型化することができ、設置が容易となる。
【００１１】
また、請求項３に記載の音声入出力装置は、話者の音声を入力して音声信号を生成する複数の音声入力手段と、通話相手方の音声を出力する音声出力手段と、音声出力手段から複数の音声入力手段に入力される音声による音声信号のレベルが等しくなるように、複数の音声入力手段が生成した音声信号の少なくとも一方を加工する加工手段と、加工手段による加工後に各音声信号のレベルの偏差を算出し、これに基づいて送話信号を生成することで、話者の音声とともに音声入力手段に混入した、音声出力手段から出力された音声を、送話信号から除去する処理手段と、処理手段が生成した送話信号を通話相手方へ送信し、通話相手方から送信された受話信号を受信して音声出力手段へ出力する通信手段とを備える音声入出力装置であって、車両における車室の幅を車両の進行方向と平行に２等分する２等分線上に配置されて動作することを特徴とする。
【００１２】
このように、請求項３の音声入出力装置では、話者の音声は複数の音声入力手段から入力されるが、各音声入力手段が音声出力手段から等距離に配置されていない場合には、各音声入力手段は音声出力手段からの音声が入力されたとき、等しいレベルの音声信号を生成することができない。すなわち、各音声入力手段が生成した音声信号をそのまま利用するだけでは、処理手段は音声出力手段からの音声に基づく音声信号成分を除去することができない。また、各音声入力手段が音声出力手段から等距離に配置されていても、各音声入力手段の有するレベル特性や位相特性のばらつきが大きい場合には、各音声入力手段は音声出力手段からの音声が入力されたとき、等しいレベルの音声信号を生成することができず、処理手段は音声出力手段からの音声に基づく音声信号成分を十分に除去することはできない。さらには、例えば車室内の音響特性によっても、処理手段は音声出力手段からの音声に基づく音声信号成分を除去することができない場合がある。
【００１３】
そこで、請求項３の音声入出力装置では、音声出力手段から各音声入力手段に入力される音声による音声信号のレベルが等しくなるように、各音声入力手段が生成した音声信号の少なくとも一方を加工手段によって加工し、加工後の各音声信号のレベルの偏差を算出して送話信号を生成する。これにより、各音声入力手段の配置、各音声入力手段の有するレベル特性や位相特性のばらつき、および車室内の音響特性によらず、音声出力手段からの音声に基づく音声信号成分を除去して、話者の音声に基づく送話信号を生成することが可能となる。
なお、話者が各音声入力手段から等距離に位置する場所から音声を発すると、話者の音声から各音声入力手段が生成する音声信号の各々のレベルが全て等しくなり、処理手段が各音声信号のレベルの偏差を算出する際に、話者の音声に基づく音声信号成分が除去されてしまう。しかし、請求項３では、車両における車室の幅を車両の進行方向と平行に２等分する２等分線上に配置するので、話者が車内のどのシートに座って発話しても、話者から各音声入力手段までの距離が等しくならず、話者の音声に基づく音声信号成分が除去されることなく通話が可能となる。
【００１４】
請求項４に記載のように、加工手段は、音声信号のレベルを調整する機能を有することが望ましい。各音声入力手段の生成した音声信号のレベルを増幅して調整することにより、各音声入力手段の配置や、各音声入力手段の有するレベル特性のばらつき、および車室内の音響特性による音声信号のレベルの変動を容易に補正することが可能となる。
【００１５】
請求項５に記載のように、加工手段は、音声信号の位相を調整する機能を有することが望ましい。これにより、各音声入力手段の配置、各音声入力手段の有する位相特性のばらつき、および車室内の音響特性による音声信号の位相の変動まで考慮した、高精度な補正が可能となる。
【００１６】
請求項６に記載のように、処理手段が生成した送話信号を検出する検出手段と、検出手段の検出結果に基づいて、加工手段の動作を制御する制御手段とを備えることが望ましい。生成された送話信号のレベルを検出手段によって検出し、その検出結果に基づいて制御手段が加工手段の動作を制御することで、より高精度な音声信号の補正が可能となる。
【００１９】
請求項７に記載のように、音声入出力装置は、車室内のルームライト、ルームミラー、マップランプ、もしくはオーバーヘッドモジュールのいずれかの車室の天井に取り付けられた機器に配置されることが望ましい。これにより、話者の音声に基づく音声信号成分が除去されることなく通話が可能であり、なおかつ、車室内の空間を有効に活用することができるのである。
【００２０】
【発明の実施の形態】
（第１実施形態）
図１は、本発明の第１実施形態である音声入出力装置の全体構成を示すブロック図である。
【００２１】
音声入力手段であるマイクＡ１およびマイクＢ２は、例えば小型のマイクロホンであり、話者の音声を入力して音声信号を生成する。音声出力手段であるスピーカ３は、例えば小型のスピーカであり、通話相手方の音声を出力する。なお、本実施形態におけるマイクＡ１とマイクＢ２とは、いずれもスピーカ３から距離ｄだけ離れて配置され、かつ、マイクＡ１とマイクＢ２とが、スピーカ３を間に挟んで一直線となるように配置されている。これにより、エコーやハウリングの発生を防止しつつ、本音声入出力装置を小型化することができ、設置が容易となるためである。
【００２２】
なお、本音声入出力装置は、車両のルームライトの上方に設置される。これにより、話者がどのシートに座っていても、話者と各マイクとの距離が等しくならず、話者の音声に基づく音声信号成分が除去されることなく通話が可能となるのである（詳細については後述する）。さらには、車室内の空間を有効に活用することもできる。本音声入出力装置の設置場所としては、話者と各マイクとの間の距離が等しくならないような場所、すなわち、車室の幅を車両の進行方向と平行に２等分する２等分線上に設置するのであれば、何処に設置してもよい。
【００２３】
処理手段である信号減算器４は、例えばオペアンプから構成され、マイクＡ１が生成した音声信号のレベルから、マイクＢ２が生成した音声信号のレベルを代数的に減算し、２つの音声信号におけるレベルの偏差を算出して送話信号を生成する。送話信号の生成に関しては、マイクＢ２が生成した音声信号のレベルからマイクＡ１が生成した音声信号のレベルを代数的に減算し、２つの音声信号におけるレベルの偏差を算出して送話信号を生成してもよい。また、音声信号のレベルの減算に関しては、例えばＡ／Ｄ変換器によってマイクＡ１およびマイクＢ２が生成した音声信号のレベルを一旦ディジタル化し、ディジタル減算回路によって減算を行うこととしても良い。また、信号減算器４によって、送信すべき話者の音声信号も減算され、その偏差のレベルが小さくなるため、信号減算器４の前段もしくは後段に増幅器を接続することが好ましい。
【００２４】
通信手段である通信装置５は、例えば車載用無線機であり、信号減算器４が生成した送話信号を通話相手方へ送信する。また、通話相手方から送信された受話信号を受信してスピーカ３へ出力する。通話相手方との通信に関しては、例えばナビゲーション装置に搭載された通信機能を利用することとしても良いし、携帯電話の通信機能を利用することとしても良い。
【００２５】
次に、第１実施形態の音声入出力装置が、送話信号を生成する処理を順に説明する。
【００２６】
話者が発した音声は、マイクＡ１およびマイクＢ２から入力され、それぞれ音声信号が生成される。次に、信号減算器４は、マイクＡ１が生成した音声信号のレベルから、マイクＢ２が生成した音声信号のレベルを代数的に減算して、２つの音声信号のレベルの偏差を算出し、これを送話信号として生成する。本音声入出力装置は、車両のルームライトの上方に設置されていることから、話者が車室内のどのシートに座っていても、話者と各マイクとの間の距離は等しくならない。そのため、各マイクが生成する音声信号の話者の音声に基づく音声信号成分のレベルも等しくならず、信号減算器４が２つの音声信号のレベルを代数的に減算しても、話者の音声に基づく音声信号成分は除去されない。一方、スピーカ３と各マイクとは等距離に配置されているため、各マイクが生成する音声信号のスピーカ３から出力された音声に基づく音声信号成分のレベルは概ね等しくなり、信号減算器４が２つの音声信号のレベルを代数的に減算することによって、スピーカ３から出力された音声に基づく音声信号成分が除去されるのである。
【００２７】
このように、本実施形態の音声入出力装置では、スピーカ３から等距離に配置されたマイクＡ１およびマイクＢ２が生成した各音声信号のレベルを代数的に減算し、２つの音声信号のレベルの偏差を送話信号として生成することにより、マイクＡ１およびマイクＢ２に混入した、スピーカ３から出力された音声を、送話信号から除去することができる。これにより、エコーやハウリングの発生を防止することができ、簡易な構成で高品質な通話が可能となるのである。
（第２実施形態）
図２は、本発明の第２実施形態である音声入出力装置の全体構成を示すブロック図である。図２に示すように、本実施形態では、上述の第１実施形態における音声入出力装置において、増幅器Ａ６および増幅器Ｂ７、レベル検出器８、リモコン９、スイッチング回路１０、ＥＣＵ１１を新たに設けた点が第１実施形態と異なる。
【００２８】
加工手段である増幅器Ａ６および増幅器Ｂ７は、増幅率を変更することが可能な可変増幅器であり、マイクＡ１およびマイクＢ２が生成した各音声信号のレベルを、設定された増幅率に従ってそれぞれ増幅する。各増幅器の増幅率は、ＥＣＵ１１によって変更される。音声信号のレベルの増幅に関しては、例えばＡ／Ｄ変換器によって一旦ディジタル信号に変換した後に増幅することとしても良い。
【００２９】
検出手段であるレベル検出器８は、例えば信号計測器であり、信号減算器４が生成した送話信号のレベルを検出してＥＣＵ１１へ出力する。送話信号のレベルの検出に関しては、例えばＡ／Ｄ変換器によってディジタル信号に変換して検出することとしても良い。
【００３０】
リモコン９は、例えばテンキーと機能キーとを備えた多機能リモコンであり、ＥＣＵ１１に対して増幅器Ａ６と増幅器Ｂ７の増幅率の調整を行うよう指示する。増幅率の調整の指示に関しては、コンピュータから通信ネットワークを介して指示することとしてもよい。また、各増幅器に調整スイッチを設け、ユーザーもしくはサービスマンが手動で調整することとしてもよい。
【００３１】
スイッチング回路１０は、例えばフリップフロップ等から構成され、ＥＣＵ１１の指示に従ってスピーカ３の接続先を通信装置５側もしくはＥＣＵ１１側へ切り換える。スピーカ３の接続先の切り換えに関しては、例えば無接点リレーを用いて行うこととしてもよい。
【００３２】
ＥＣＵ１１は、リモコン９から増幅器Ａ６と増幅器Ｂ７の増幅率の調整を行うよう指示されると、スイッチング回路１０に対してスピーカ３の接続先をＥＣＵ１１側へ切り換えるよう指示する。そして、テスト信号をスピーカ３へ出力し、レベル検出器８が検出した送話信号のレベルが所定値以下となるように、増幅器Ａ６と増幅器Ｂ７の増幅率を変更する。これらの動作は、車両に搭載されたカーナビゲーション装置や通信装置等のＣＰＵに行わせることとしてもよい。
【００３３】
なお、本実施形態のマイクＡ１はスピーカ３から距離ｄ１だけ離れて配置されており、マイクＢ２はスピーカ３から距離ｄ２だけ離れて配置されている。
【００３４】
その他の構成・動作については、前述の第１実施形態と同様なので、説明を省略する。
【００３５】
図３は、第２実施形態の音声入出力装置が、増幅器Ａ６と増幅器Ｂ７の増幅率を調整する処理のフローチャートである。本フローチャートの処理は、話者がリモコン９から各増幅器の増幅率の調整を行うよう指示するたびに実行される。
【００３６】
ステップ３０１では、スイッチング回路１０のスイッチをＥＣＵ１１側へ切り換える。ステップ３０２では、テスト信号の出力を開始する。テスト信号はＴＳＰ信号、正弦波、ホワイトノイズ、インパルスのいずれかでよく、スピーカ３から出力される。
【００３７】
ステップ３０３では、所定時間だけウェイトし、テスト信号から送話信号が生成されるのを待つ。ステップ３０４では、ステップ３０２で出力したテスト信号に応じて、生成された送話信号のレベルを検出する。ステップ３０５では、ステップ３０４で検出した送話信号のレベルの絶対値が所定値Ｋ以下であるか否かを判定する。送話信号のレベルの絶対値が所定値Ｋ以下であれば、ステップ３０７へ進み、テスト信号の出力を終了する。その後、ステップ３０８にてスイッチング回路１０のスイッチを通信装置５側へ切り換え、処理を終了する。一方、ステップ３０５で検出した送話信号のレベルの絶対値が所定値Ｋ以下でない場合は、ステップ３０６へ進み、増幅器Ａ６もしくは増幅器Ｂ７の増幅率を、送話信号のレベルの絶対値がゼロに近づくように変更する。その後、ステップ３０３へ戻り、送話信号のレベルの絶対値が所定値Ｋ以下となるまで、上述の手順を繰り返す。
【００３８】
このように、マイクＡ１およびマイクＢ２がスピーカ３から等距離に配置されていない場合、各マイクが生成する音声信号のうち、スピーカ３からの音声に基づく音声信号成分は等しいレベルとならず、信号減算器４はスピーカ３からの音声に基づく音声信号成分を送話信号から除去することができない。また、マイクＡ１およびマイクＢ２がスピーカ３から等距離に配置されていても、各マイクの有するレベル特性のばらつきが大きい場合には、スピーカ３からの音声に基づく音声信号成分は等しいレベルとならず、信号減算器４はスピーカ３からの音声に基づく音声信号成分を送話信号から十分に除去することはできない。さらには、例えば車室内の音響特性によっても、信号減算器４はスピーカ３からの音声に基づく音声信号成分を送話信号から除去することができない場合がある。そのため、スピーカ３からの音声に基づく音声信号成分のレベルが等しくなるように各増幅器の増幅率を調整し、増幅された各音声信号のレベルの偏差に基づいて送話信号を生成することで、各マイクの配置や、各マイクの有するレベル特性のばらつき、および車室内の音響特性によらず、スピーカ３からの音声に基づく音声信号成分を送話信号から除去することが可能となるのである。また、生成された送話信号をレベル検出器８によって検出し、検出結果に基づいてＥＣＵ１１が増幅器Ａ６および増幅器Ｂ７の増幅率を変更することで、マイクＡ１およびマイクＢ２の配置や、各マイクのレベル特性のばらつき、および車室内の音響特性による音声信号のレベルの変動を高精度に補正することが可能となるのである。
【００３９】
なお、本フローチャートの処理は、本音声入出力装置が使用されていない際に、一定時間毎に自動的に行うこととしても良い。また、本音声入出力装置を搭載した車両のディーラーや工場等で定期的に行うこととしてもよい。
【００４０】
次に、第２実施形態の音声入出力装置が、送話信号を生成する処理を順に説明する。
【００４１】
話者が発した音声は、マイクＡ１およびマイクＢ２から入力され、それぞれ音声信号が生成される。次に、増幅器Ａ６および増幅器Ｂ７は、マイクＡ１およびマイクＢ２によって生成された音声信号を、図３のフローチャートの処理を実行することによって設定された増幅率に従って増幅する。信号減算器４は、増幅器Ａ６によって増幅された音声信号のレベルから、増幅器Ｂ７によって増幅された音声信号のレベルを代数的に減算して、２つの音声信号のレベルの偏差を算出し、スピーカ３から出力された音声の音声信号成分を除去した送話信号を生成する。
【００４２】
このように、本実施形態の音声入出力装置では、マイクＡ１およびマイクＢ２が生成した音声信号のレベルを増幅器Ａ６および増幅器Ｂ７によってそれぞれ増幅し、増幅された各音声信号のレベルの偏差を信号減算器４によって算出して、送話信号を生成する。これにより、各マイクの配置や、各マイクのレベル特性のばらつき、および車室内の音響特性によらず、スピーカ３から出力された音声を送話信号から除去することが可能となるのである。なお、各マイクとスピーカとの距離は等距離であっても良い。また、増幅器は１つだけ設けることとし、どちから一方の音声信号を増幅することとしても良い。
（第３実施形態）
図４は、本発明の第３実施形態である音声入出力装置の全体構成を示すブロック図である。図４に示すように、本実施形態では、上述の第２実施形態における音声入出力装置において、増幅器Ａ６および増幅器Ｂ７に代えて波形成形器Ａ１２および波形成形器Ｂ１３をそれぞれ設け、さらに、レベル検出器８に代えて波形検出器１４を設けた点が第２実施形態と異なる。
【００４３】
波形成形器Ａ１２および波形成形器Ｂ１３は、マイクＡ１およびマイクＢ２が生成した各音声信号の波形を、設定された成形パラメータに従って成形し、各音声信号のレベルおよび位相を成形パラメータに従って変更する。各波形成形器の成形パラメータは可変であり、ＥＣＵ１１によって変更される。音声信号のレベルや位相の変更に関しては、例えばＡ／Ｄ変換器によって音声信号をディジタル信号に変換し、ｚ変換、ｗａｖｅｌｅｔ変換等により行うこととしても良い。
【００４４】
波形検出器１４は、例えば信号計測器であり、信号減算器４が生成した送話信号の波形を検出する。送話信号の波形の検出に関しては、例えばＡ／Ｄ変換器によってディジタル信号に変換して検出することとしてもよい。
【００４５】
本実施形態のリモコン９は、ＥＣＵ１１に対して波形成形器Ａ１２と波形成形器Ｂ１３の成形パラメータの調整を行うよう指示する。
【００４６】
本実施形態のＥＣＵ１１は、リモコン９から波形成形器Ａ１２と波形成形器Ｂ１３の成形パラメータの調整を行うよう指示されると、スイッチング回路１０に対してスピーカ３の接続先をＥＣＵ１１側へ切り換えるよう指示する。そして、テスト信号をスピーカ３へ出力し、波形検出器１４が検出した送話信号の波形と、テスト信号の波形との偏差が所定値よりも小さくなるように、波形成形器Ａ１２と波形成形器Ｂ１３の成形パラメータを変更する。
【００４７】
その他の構成・動作については、前述の第２実施形態と同様なので、説明を省略する。
【００４８】
図５は、第３実施形態の音声入出力装置が、波形成形器Ａ１２と波形成形器Ｂ１３の成形パラメータを調整する処理のフローチャートである。図５に示すように、本実施形態のフローチャートでは、前述の第２実施形態における図３のフローチャートにおいて、生成された送話信号のレベルを検出するステップに代わり、生成された送話信号の波形を検出するステップを設け、送話信号のレベルの絶対値が所定値Ｋ以下であるか否かを判定するステップに代わり、送話信号の波形とテスト信号の波形との偏差が所定値Ｌよりも小さくなるか否かを判定するステップを設ける。また、増幅器Ａ６もしくは増幅器Ｂ７の増幅率を変更するステップに代わり、波形成形器Ａ１２もしくは波形成形器Ｂ１３の成形パラメータを変更するステップを設ける。換言すれば、ステップ５０４、５０５、５０６以外の全ての処理は、前述の第２実施形態における図３のフローチャートの処理と同様であり、その説明は省略する。なお、本フローチャートの処理は、話者がリモコン９から各波形成形器の成形パラメータを調整するよう指示するたびに実行される。
【００４９】
ステップ５０４では、生成された送話信号の波形を検出する。ステップ５０５では、ステップ５０４で検出した送話信号の波形と、ステップ５０２で出力したテスト信号の波形との偏差を算出し、これが所定値Ｌ以下であるか否かを判定する。送話信号の波形とテスト信号の波形との偏差が所定値Ｌ以下でない場合は、ステップ５０６へ進み、波形成形器Ａ１２および波形成形器Ｂ１３の成形パラメータを、送話信号の波形とテスト信号の波形との偏差が小さくなるように変更する。その後、ステップ５０３へ戻り、送話信号の波形とテスト信号の波形との偏差が所定値Ｌ以下となるまで、上述の手順を繰り返す。
【００５０】
このように、生成された送話信号を波形検出器１４によって検出し、検出結果に基づいてＥＣＵ１１が波形成形器Ａ１２および波形成形器Ｂ１３の成形パラメータを変更することで、各マイクの配置や、各マイクのレベル特性および位相特性のばらつき、および車室内の音響特性による音声信号のレベルおよび位相の変動を、より高精度に補正することが可能となるのである。
【００５１】
なお、本フローチャートの処理は、本音声入出力装置が使用されていない際に、一定時間毎に自動的に行うこととしても良い。また、本音声入出力装置を搭載した車両のディーラーや工場等で定期的に行うこととしてもよい。
【００５２】
次に、第３実施形態の音声入出力装置が、送話信号を生成する処理を順に説明する。
【００５３】
話者が発した音声は、マイクＡ１およびマイクＢ２から入力され、それぞれ音声信号が生成される。次に、波形成形器Ａ１２および波形成形器Ｂ１３は、マイクＡ１およびマイクＢ２によって生成された音声信号の波形を、図５のフローチャートの処理を実行することによって設定された成形パラメータに従って、成形する。信号減算器４は、波形成形器Ａ１２によって成形された音声信号のレベルから、波形成形器Ｂ１３によって成形された音声信号のレベルを減算して、２つの音声信号のレベルの偏差を算出し、スピーカ３から出力された音声の音声信号成分を除去した送話信号を生成する。
【００５４】
このように、本実施形態の音声入出力装置では、各マイクが生成した音声信号の波形を波形成形器Ａ１２および波形成形器Ｂ１３によってそれぞれ成形し、信号減算器４は成形された各音声信号のレベルの偏差を算出して送話信号を生成する。これにより、各マイクの配置や、各マイクのレベル特性のばらつき、および車室内の音響特性によらず、スピーカ３から出力された音声を、より高い精度で送話信号から除去することが可能となるのである。また、波形成形器は１つだけ設けることとし、どちから一方の音声信号の波形を成形することとしても良い。
【００５５】
なお、上述した各実施形態において、信号減算器４の後段にエコーキャンセラやハウリング防止回路等を設けてもよい。これにより、本音声入出力装置の構成は複雑化するが、より高品質な通話を行うことが可能となる。また、マイクは３つ以上あっても良いし、スピーカも複数あってもよい。
【００５６】
さらに、上述した各実施形態は、それぞれ単独で用いても良いし、複合して用いても良いことは言うまでもない。
【図面の簡単な説明】
【図１】本発明の第１実施形態である音声入出力装置の全体構成を示すブロック図である。
【図２】本発明の第２実施形態である音声入出力装置の全体構成を示すブロック図である。
【図３】第２実施形態の音声入出力装置が、増幅器Ａと増幅器Ｂの増幅率を調整する処理のフローチャートである。
【図４】本発明の第３実施形態である音声入出力装置の全体構成を示すブロック図である。
【図５】第３実施形態の音声入出力装置が、波形成形器Ａと波形成形器Ｂの成形パラメータを調整する処理のフローチャートである
【符号の説明】
１…マイクＡ、２…マイクＢ、３…スピーカ、４…信号減算器、５…通信装置[0001]
BACKGROUND OF THE INVENTION
The present invention relates to a voice input / output device.
[0002]
[Prior art]
In recent years, due to the development of communication technology, devices that make calls in various call modes are known.
[0003]
For example, Patent Document 1 discloses a portable terminal that allows three speakers to talk simultaneously. This portable terminal can communicate with two communication systems, the PDC system and the PHS system, and an echo canceller is provided for each communication system. When three people make a call at the same time, the speaker A connects the line to the speaker B and the speaker C by using different communication methods and makes a call. The call between the speaker B and the speaker C is performed through the portable terminal of the speaker A. Since the communication is performed using a different communication method, an echo provided for each communication method is used. The canceller can make a call without blocking the call voices of the speaker B and the speaker C.
[0004]
[Patent Document 1]
JP 2000-196736 A
[0005]
[Problems to be solved by the invention]
On the other hand, with the diversification of such call modes, the configuration of the call device has also become complicated. In particular, a voice input / output device that assumes a function of a telephone device is a representative example, and the degree of complexity is remarkable due to the mounting of a howling prevention function, an echo cancellation function, and the like. However, there are cases where a voice input / output device having a complicated configuration cannot be used depending on the restrictions imposed when using the communication device. However, if simplification is performed by omitting the functions of the voice input / output device, the call quality deteriorates and the call itself becomes difficult.
[0006]
An object of the present invention is to provide a voice input / output device capable of high-quality calls with a simple configuration in view of the above problems.
[0007]
[Means for Solving the Problems]
  In order to achieve the above object, in the voice input / output device according to claim 1, each of a plurality of voice input means for inputting a speaker's voice and generating a voice signal outputs the voice of the other party. A voice signal is input together with the voice of the speaker by calculating the deviation of the level of each voice signal generated by each of the plurality of voice input means, which is arranged at an equal distance from the output means, and generating a transmission signal based on the deviation. The processing means for removing the voice output from the voice output means mixed in the means from the transmission signal, the transmission signal generated by the processing means is transmitted to the other party, and the reception signal transmitted from the other party is received. Communication means for outputting to the voice output meansA voice input / output device that operates by being disposed on a bisector that divides a vehicle compartment width in a vehicle into two equal parts in parallel with the traveling direction of the vehicle.It is characterized by that.
[0008]
Thus, in the voice input / output device according to the first aspect, the voice of the speaker is input from a plurality of voice input means arranged at an equal distance from the voice output means for outputting the voice of the other party. At this time, since the plurality of sound input means are arranged at equal distances from the sound output means, when the sound from the sound output means is input to each sound input means, the sound signals have the same level. For this reason, the processing means calculates the deviation of the level of each voice signal generated by each voice input means, thereby removing the voice signal component based on the voice from the voice output means, and transmitting based on the voice of the speaker. A signal is generated. Since the transmission signal generated in this way does not include the signal component output from the voice output means that causes echo or howling, when the transmission signal is transmitted to the other party by the communication means, The other party can hear the speaker's voice with clear voice.
[0009]
  As described above, according to the voice input / output device of the first aspect, it is possible to prevent the occurrence of echo and howling, and a high-quality call can be made with a simple configuration.
When a speaker utters a voice from a location located at an equal distance from each voice input means, the level of each voice signal generated by each voice input means becomes equal from the voice of the speaker, and the processing means sets each voice. When the signal level deviation is calculated, the voice signal component based on the voice of the speaker is removed. However, in claim 1, since the width of the vehicle compartment in the vehicle is arranged on a bisector that bisects in parallel with the traveling direction of the vehicle, no matter what seat the speaker sits on and speaks, The distance from the person to each voice input means is not equal, and a call can be made without removing the voice signal component based on the voice of the speaker.
[0010]
Preferably, two voice input means are provided, and each of the voice input means and the voice output means are preferably arranged on a straight line. As a result, it is possible to reduce the size of the voice input / output device while preventing the occurrence of echoes and howling, thereby facilitating installation.
[0011]
  According to a third aspect of the present invention, there is provided a voice input / output device comprising: a plurality of voice input means for inputting a voice of a speaker to generate a voice signal; a voice output means for outputting a voice of a call partner; and a voice output means. Processing means for processing at least one of the sound signals generated by the plurality of sound input means so that the levels of the sound signals by the sound input to the plurality of sound input means are equal; A processing unit that calculates a level deviation and generates a transmission signal based on the level deviation, thereby removing the voice output from the voice output unit mixed with the voice of the speaker from the transmission signal. And a communication means for transmitting the transmission signal generated by the processing means to the other party, receiving the reception signal transmitted from the other party, and outputting the received signal to the voice output means.A voice input / output device that operates by being disposed on a bisector that divides a vehicle compartment width in a vehicle into two equal parts in parallel with the traveling direction of the vehicle.It is characterized by that.
[0012]
Thus, in the voice input / output device according to the third aspect, the voice of the speaker is input from a plurality of voice input means, but when each voice input means is not arranged at an equal distance from the voice output means, Each voice input means cannot generate a voice signal of the same level when the voice from the voice output means is inputted. That is, the processing means cannot remove the sound signal component based on the sound from the sound output means only by using the sound signal generated by each sound input means as it is. In addition, even if each voice input means is arranged at an equal distance from the voice output means, if the level characteristics and phase characteristics of each voice input means vary greatly, each voice input means will receive a voice from the voice output means. Is input, an audio signal of the same level cannot be generated, and the processing means cannot sufficiently remove the audio signal component based on the audio from the audio output means. Furthermore, the processing means may not be able to remove the audio signal component based on the audio from the audio output means, for example, due to the acoustic characteristics in the passenger compartment.
[0013]
  Therefore, in the voice input / output device according to claim 3, at least one of the voice signals generated by each voice input means is processed so that the level of the voice signal by the voice input from the voice output means to each voice input means becomes equal. The transmission signal is generated by calculating the deviation of the level of each processed audio signal. Thereby, the audio signal component based on the sound from the sound output means is removed regardless of the arrangement of each sound input means, the variation in the level characteristics and phase characteristics of each sound input means, and the acoustic characteristics in the passenger compartment. It is possible to generate a transmission signal based on the voice of the speaker.
When a speaker utters a voice from a location located at an equal distance from each voice input means, the level of each voice signal generated by each voice input means becomes equal from the voice of the speaker, and the processing means sets each voice. When the signal level deviation is calculated, the voice signal component based on the voice of the speaker is removed. However, in Claim 3, since the width of the compartment in the vehicle is arranged on a bisector that bisects in parallel with the traveling direction of the vehicle, no matter what seat the speaker sits on and speaks, The distance from the person to each voice input means is not equal, and a call can be made without removing the voice signal component based on the voice of the speaker.
[0014]
Preferably, the processing means has a function of adjusting the level of the audio signal. By amplifying and adjusting the level of the audio signal generated by each audio input means, the level of the audio signal due to the arrangement of each audio input means, variations in the level characteristics of each audio input means, and the acoustic characteristics in the passenger compartment It is possible to easily correct the fluctuation.
[0015]
Preferably, the processing means has a function of adjusting the phase of the audio signal. As a result, it is possible to perform high-accuracy correction in consideration of the arrangement of the respective voice input means, the variation in the phase characteristics of each voice input means, and the variation in the phase of the voice signal due to the acoustic characteristics in the passenger compartment.
[0016]
According to a sixth aspect of the present invention, it is desirable to include a detection unit that detects a transmission signal generated by the processing unit, and a control unit that controls the operation of the processing unit based on the detection result of the detection unit. The level of the generated transmission signal is detected by the detection unit, and the control unit controls the operation of the processing unit based on the detection result, so that the voice signal can be corrected with higher accuracy.
[0019]
  Claim 7As described above, the audio input / output device is preferably arranged in a device attached to the ceiling of any one of the room light, the room mirror, the map lamp, or the overhead module in the vehicle interior. As a result, it is possible to make a call without removing the audio signal component based on the voice of the speaker, and it is possible to effectively utilize the space in the vehicle interior.
[0020]
DETAILED DESCRIPTION OF THE INVENTION
(First embodiment)
FIG. 1 is a block diagram showing the overall configuration of a voice input / output device according to the first embodiment of the present invention.
[0021]
The microphone A1 and the microphone B2, which are voice input means, are small microphones, for example, and generate a voice signal by inputting the voice of the speaker. The speaker 3 which is a voice output means is a small speaker, for example, and outputs the voice of the other party. Note that the microphone A1 and the microphone B2 in this embodiment are both arranged at a distance d from the speaker 3 and are arranged so that the microphone A1 and the microphone B2 are in a straight line with the speaker 3 in between. Has been. This is because it is possible to reduce the size of the voice input / output device while preventing the occurrence of echoes and howling and to facilitate installation.
[0022]
The voice input / output device is installed above the room light of the vehicle. As a result, regardless of the seat on which the speaker is sitting, the distance between the speaker and each microphone is not equal, and a call can be made without removing the audio signal component based on the speaker's voice ( Details will be described later). Furthermore, the space in the passenger compartment can be used effectively. The voice input / output device is installed at a location where the distance between the speaker and each microphone is not equal, that is, on a bisector that bisects the width of the passenger compartment in parallel with the traveling direction of the vehicle. If it installs in, it may install anywhere.
[0023]
The signal subtractor 4 that is a processing means is composed of, for example, an operational amplifier, and algebraically subtracts the level of the audio signal generated by the microphone B2 from the level of the audio signal generated by the microphone A1, and the level of the two audio signals is reduced. The transmission signal is generated by calculating the deviation. Regarding the generation of the transmission signal, the level of the audio signal generated by the microphone A1 is algebraically subtracted from the level of the audio signal generated by the microphone B2, and a level deviation between the two audio signals is calculated to obtain the transmission signal. It may be generated. As for the subtraction of the level of the audio signal, for example, the level of the audio signal generated by the microphone A1 and the microphone B2 may be temporarily digitized by an A / D converter, and the subtraction may be performed by a digital subtraction circuit. Further, since the signal of the speaker to be transmitted is also subtracted by the signal subtracter 4 and the level of the deviation is reduced, it is preferable to connect an amplifier to the front stage or the rear stage of the signal subtractor 4.
[0024]
The communication device 5 that is a communication means is, for example, an in-vehicle wireless device, and transmits the transmission signal generated by the signal subtracter 4 to the other party of the call. In addition, it receives a reception signal transmitted from the other party and outputs it to the speaker 3. As for communication with the other party, for example, a communication function installed in the navigation device may be used, or a communication function of a mobile phone may be used.
[0025]
Next, processing in which the voice input / output device of the first embodiment generates a transmission signal will be described in order.
[0026]
The voice uttered by the speaker is input from the microphone A1 and the microphone B2, and a voice signal is generated respectively. Next, the signal subtracter 4 algebraically subtracts the level of the audio signal generated by the microphone B2 from the level of the audio signal generated by the microphone A1, and calculates a deviation between the levels of the two audio signals. Is generated as a transmission signal. Since the voice input / output device is installed above the room light of the vehicle, the distance between the speaker and each microphone is not equal regardless of the seat on which the speaker is sitting. Therefore, the level of the voice signal component based on the voice of the speaker of the voice signal generated by each microphone is not equal, and even if the signal subtractor 4 subtracts the level of the two voice signals algebraically, the voice of the speaker The audio signal component based on is not removed. On the other hand, since the speaker 3 and each microphone are arranged at equal distances, the level of the sound signal component based on the sound output from the speaker 3 of the sound signal generated by each microphone becomes substantially equal, and the signal subtracter 4 By algebraically subtracting the levels of the two audio signals, the audio signal component based on the audio output from the speaker 3 is removed.
[0027]
As described above, in the audio input / output device of the present embodiment, the levels of the audio signals generated by the microphones A1 and B2 arranged at the same distance from the speaker 3 are algebraically subtracted to obtain the levels of the two audio signals. By generating the deviation as the transmission signal, the sound output from the speaker 3 mixed in the microphone A1 and the microphone B2 can be removed from the transmission signal. As a result, the occurrence of echoes and howling can be prevented, and high-quality calls can be made with a simple configuration.
(Second Embodiment)
FIG. 2 is a block diagram showing the overall configuration of a voice input / output device according to the second embodiment of the present invention. As shown in FIG. 2, in this embodiment, in the voice input / output device in the first embodiment described above, the amplifier A6 and the amplifier B7, the level detector 8, the remote controller 9, the switching circuit 10, and the ECU 11 are newly provided. Is different from the first embodiment.
[0028]
The processing means, the amplifier A6 and the amplifier B7, are variable amplifiers capable of changing the amplification factor, and amplify the level of each audio signal generated by the microphone A1 and the microphone B2 according to the set amplification factor. The amplification factor of each amplifier is changed by the ECU 11. As for the amplification of the level of the audio signal, it may be amplified after being converted into a digital signal once by an A / D converter, for example.
[0029]
The level detector 8 which is a detecting means is a signal measuring device, for example, and detects the level of the transmission signal generated by the signal subtractor 4 and outputs it to the ECU 11. Regarding the detection of the level of the transmission signal, for example, it may be detected by converting it into a digital signal by an A / D converter.
[0030]
The remote controller 9 is a multifunction remote controller having, for example, a numeric keypad and a function key, and instructs the ECU 11 to adjust the amplification factors of the amplifier A6 and the amplifier B7. The instruction for adjusting the amplification factor may be instructed from a computer via a communication network. In addition, an adjustment switch may be provided for each amplifier, and the user or a service person may perform adjustment manually.
[0031]
The switching circuit 10 is composed of, for example, a flip-flop, and switches the connection destination of the speaker 3 to the communication device 5 side or the ECU 11 side in accordance with an instruction from the ECU 11. The switching of the connection destination of the speaker 3 may be performed using, for example, a contactless relay.
[0032]
When the ECU 11 is instructed by the remote controller 9 to adjust the amplification factors of the amplifier A6 and the amplifier B7, the ECU 11 instructs the switching circuit 10 to switch the connection destination of the speaker 3 to the ECU 11 side. Then, a test signal is output to the speaker 3, and the amplification factors of the amplifier A6 and the amplifier B7 are changed so that the level of the transmission signal detected by the level detector 8 becomes a predetermined value or less. These operations may be performed by a CPU such as a car navigation device or a communication device mounted on the vehicle.
[0033]
Note that the microphone A1 of this embodiment is arranged away from the speaker 3 by a distance d1, and the microphone B2 is arranged away from the speaker 3 by a distance d2.
[0034]
Other configurations and operations are the same as those in the first embodiment described above, and a description thereof will be omitted.
[0035]
FIG. 3 is a flowchart of a process in which the audio input / output device of the second embodiment adjusts the amplification factors of the amplifier A6 and the amplifier B7. The processing of this flowchart is executed every time the speaker instructs the remote controller 9 to adjust the amplification factor of each amplifier.
[0036]
In step 301, the switch of the switching circuit 10 is switched to the ECU 11 side. In step 302, output of a test signal is started. The test signal may be a TSP signal, sine wave, white noise, or impulse, and is output from the speaker 3.
[0037]
In step 303, the process waits for a predetermined time and waits for the transmission signal to be generated from the test signal. In step 304, the level of the generated transmission signal is detected according to the test signal output in step 302. In step 305, it is determined whether or not the absolute value of the level of the transmission signal detected in step 304 is equal to or less than a predetermined value K. If the absolute value of the level of the transmission signal is equal to or less than the predetermined value K, the process proceeds to step 307 and the output of the test signal is terminated. Thereafter, in step 308, the switch of the switching circuit 10 is switched to the communication device 5 side, and the process is terminated. On the other hand, if the absolute value of the level of the transmission signal detected in step 305 is not less than or equal to the predetermined value K, the process proceeds to step 306 and the amplification factor of the amplifier A6 or amplifier B7 is set to zero. Change to get closer. Thereafter, the process returns to step 303, and the above procedure is repeated until the absolute value of the level of the transmission signal becomes equal to or less than the predetermined value K.
[0038]
Thus, when the microphone A1 and the microphone B2 are not arranged at the same distance from the speaker 3, the audio signal components based on the audio from the speaker 3 among the audio signals generated by the respective microphones do not have the same level. The subtracter 4 cannot remove the sound signal component based on the sound from the speaker 3 from the transmission signal. In addition, even if the microphone A1 and the microphone B2 are arranged at the same distance from the speaker 3, the sound signal component based on the sound from the speaker 3 does not have the same level when the variation in level characteristics of the microphones is large. The signal subtracter 4 cannot sufficiently remove the sound signal component based on the sound from the speaker 3 from the transmission signal. Furthermore, the signal subtractor 4 may not be able to remove the audio signal component based on the audio from the speaker 3 from the transmission signal, for example, due to the acoustic characteristics in the passenger compartment. Therefore, by adjusting the amplification factor of each amplifier so that the level of the audio signal component based on the sound from the speaker 3 is equal, and generating a transmission signal based on the deviation of the level of each amplified audio signal, The sound signal component based on the sound from the speaker 3 can be removed from the transmission signal regardless of the arrangement of the microphones, the variation in level characteristics of the microphones, and the acoustic characteristics in the passenger compartment. Further, the generated transmission signal is detected by the level detector 8, and the ECU 11 changes the amplification factors of the amplifier A6 and the amplifier B7 based on the detection result. It is possible to correct the fluctuation of the level characteristic and the fluctuation of the level of the audio signal due to the acoustic characteristic in the vehicle interior with high accuracy.
[0039]
Note that the processing of this flowchart may be automatically performed at regular intervals when the voice input / output device is not used. Further, it may be performed periodically at a dealer or factory of a vehicle equipped with the voice input / output device.
[0040]
Next, processing in which the voice input / output device of the second embodiment generates a transmission signal will be described in order.
[0041]
The voice uttered by the speaker is input from the microphone A1 and the microphone B2, and a voice signal is generated respectively. Next, the amplifier A6 and the amplifier B7 amplify the audio signal generated by the microphone A1 and the microphone B2 according to the amplification factor set by executing the processing of the flowchart of FIG. The signal subtractor 4 algebraically subtracts the level of the audio signal amplified by the amplifier B7 from the level of the audio signal amplified by the amplifier A6, and calculates a deviation between the levels of the two audio signals. A transmission signal is generated by removing the audio signal component of the audio output from.
[0042]
As described above, in the audio input / output device according to the present embodiment, the levels of the audio signals generated by the microphone A1 and the microphone B2 are amplified by the amplifier A6 and the amplifier B7, respectively, and the deviation of the level of each amplified audio signal is subtracted. Calculated by the device 4 to generate a transmission signal. As a result, the sound output from the speaker 3 can be removed from the transmission signal regardless of the arrangement of the microphones, the variation in the level characteristics of the microphones, and the acoustic characteristics in the passenger compartment. The distance between each microphone and the speaker may be equal. Further, only one amplifier may be provided, and one of the audio signals may be amplified from either one.
(Third embodiment)
FIG. 4 is a block diagram showing the overall configuration of a voice input / output device according to the third embodiment of the present invention. As shown in FIG. 4, in this embodiment, in the audio input / output device in the second embodiment described above, a waveform shaper A12 and a waveform shaper B13 are provided in place of the amplifier A6 and the amplifier B7, respectively, and level detection is performed. The difference from the second embodiment is that a waveform detector 14 is provided in place of the detector 8.
[0043]
The waveform shaper A12 and the waveform shaper B13 shape the waveform of each audio signal generated by the microphone A1 and the microphone B2 according to the set shaping parameter, and change the level and phase of each audio signal according to the shaping parameter. The molding parameters of each waveform shaper are variable and are changed by the ECU 11. Regarding the change of the level and phase of the audio signal, for example, the audio signal may be converted into a digital signal by an A / D converter, and may be performed by z conversion, wavelet conversion, or the like.
[0044]
The waveform detector 14 is a signal measuring device, for example, and detects the waveform of the transmission signal generated by the signal subtracter 4. Regarding the detection of the waveform of the transmission signal, it may be detected by converting it into a digital signal by an A / D converter, for example.
[0045]
The remote controller 9 of this embodiment instructs the ECU 11 to adjust the shaping parameters of the waveform shaper A12 and the waveform shaper B13.
[0046]
When the remote controller 9 instructs the ECU 11 to adjust the shaping parameters of the waveform shaper A12 and the waveform shaper B13, the ECU 11 instructs the switching circuit 10 to switch the connection destination of the speaker 3 to the ECU 11 side. To do. Then, the waveform shaper A12 and the waveform shaper are output so that the deviation between the waveform of the transmission signal detected by the waveform detector 14 and the waveform of the test signal is smaller than a predetermined value by outputting the test signal to the speaker 3. The molding parameter of B13 is changed.
[0047]
Other configurations and operations are the same as those in the second embodiment described above, and a description thereof will be omitted.
[0048]
FIG. 5 is a flowchart of a process in which the voice input / output device of the third embodiment adjusts the shaping parameters of the waveform shaper A12 and the waveform shaper B13. As shown in FIG. 5, in the flowchart of the present embodiment, the waveform of the generated transmission signal is replaced with the step of detecting the level of the generated transmission signal in the flowchart of FIG. 3 in the second embodiment. And a step of determining whether or not the absolute value of the level of the transmitted signal is equal to or less than a predetermined value K, the deviation between the waveform of the transmitted signal and the waveform of the test signal is greater than the predetermined value L. There is a step of determining whether or not also becomes smaller. Further, in place of the step of changing the amplification factor of the amplifier A6 or the amplifier B7, a step of changing the shaping parameter of the waveform shaper A12 or the waveform shaper B13 is provided. In other words, all the processes other than steps 504, 505, and 506 are the same as the processes of the flowchart of FIG. 3 in the second embodiment described above, and a description thereof will be omitted. The process of this flowchart is executed every time the speaker instructs the remote controller 9 to adjust the shaping parameters of each waveform shaper.
[0049]
In step 504, the waveform of the generated transmission signal is detected. In step 505, a deviation between the waveform of the transmission signal detected in step 504 and the waveform of the test signal output in step 502 is calculated, and it is determined whether or not this is equal to or less than a predetermined value L. If the deviation between the waveform of the transmission signal and the waveform of the test signal is not equal to or less than the predetermined value L, the process proceeds to step 506, and the shaping parameters of the waveform shaper A12 and the waveform shaper B13 are set. Change so that the deviation from the waveform is small. Thereafter, the process returns to step 503, and the above-described procedure is repeated until the deviation between the waveform of the transmission signal and the waveform of the test signal becomes equal to or less than a predetermined value L.
[0050]
In this way, the generated transmission signal is detected by the waveform detector 14, and the ECU 11 changes the shaping parameters of the waveform shaper A12 and the waveform shaper B13 based on the detection result. Variations in the level characteristics and phase characteristics of the microphones and fluctuations in the level and phase of the audio signal due to the acoustic characteristics in the passenger compartment can be corrected with higher accuracy.
[0051]
Note that the processing of this flowchart may be automatically performed at regular intervals when the voice input / output device is not used. Further, it may be performed periodically at a dealer or factory of a vehicle equipped with the voice input / output device.
[0052]
Next, processing in which the voice input / output device of the third embodiment generates a transmission signal will be described in order.
[0053]
The voice uttered by the speaker is input from the microphone A1 and the microphone B2, and a voice signal is generated respectively. Next, the waveform shaper A12 and the waveform shaper B13 shape the waveform of the audio signal generated by the microphone A1 and the microphone B2 in accordance with the shaping parameters set by executing the processing of the flowchart of FIG. The signal subtracter 4 subtracts the level of the audio signal formed by the waveform shaper B13 from the level of the audio signal formed by the waveform shaper A12, and calculates a deviation between the levels of the two audio signals. The transmission signal from which the audio signal component of the audio output from 3 is removed is generated.
[0054]
As described above, in the audio input / output device according to the present embodiment, the waveform of the audio signal generated by each microphone is formed by the waveform shaper A12 and the waveform shaper B13, respectively, and the signal subtracter 4 is used for each of the formed audio signals. A transmission signal is generated by calculating a level deviation. As a result, the sound output from the speaker 3 can be removed from the transmission signal with higher accuracy regardless of the arrangement of the microphones, the variation in the level characteristics of the microphones, and the acoustic characteristics in the passenger compartment. It becomes. Further, only one waveform shaper may be provided, and the waveform of one audio signal may be formed from either one.
[0055]
In each of the embodiments described above, an echo canceller, a howling prevention circuit, or the like may be provided after the signal subtractor 4. This complicates the configuration of the voice input / output device, but enables a higher quality call. Further, there may be three or more microphones and a plurality of speakers.
[0056]
Furthermore, it goes without saying that each of the above-described embodiments may be used alone or in combination.
[Brief description of the drawings]
FIG. 1 is a block diagram showing an overall configuration of a voice input / output device according to a first embodiment of the present invention.
FIG. 2 is a block diagram showing an overall configuration of a voice input / output device according to a second embodiment of the present invention.
FIG. 3 is a flowchart of a process in which the audio input / output device of the second embodiment adjusts the amplification factors of amplifier A and amplifier B.
FIG. 4 is a block diagram showing an overall configuration of a voice input / output device according to a third embodiment of the present invention.
FIG. 5 is a flowchart of a process in which the voice input / output device of the third embodiment adjusts the shaping parameters of the waveform shaper A and the waveform shaper B.
[Explanation of symbols]
DESCRIPTION OF SYMBOLS 1 ... Microphone A, 2 ... Microphone B, 3 ... Speaker, 4 ... Signal subtractor, 5 ... Communication apparatus

Claims

Each of a plurality of voice input means for inputting a speaker's voice and generating a voice signal is arranged at an equal distance from the voice output means for outputting the voice of the other party,
By calculating a deviation of the level of each voice signal generated by each of the plurality of voice input means, and generating a transmission signal based on the deviation, mixed into the voice input means together with the voice of the speaker, Processing means for removing the voice output from the voice output means from the transmission signal;
A voice input / output device comprising: communication means for transmitting a transmission signal generated by the processing means to a call partner, receiving a reception signal transmitted from the call partner, and outputting the received signal to the voice output means ;
An audio input / output device that operates by being arranged on a bisector that bisects the width of a vehicle compartment in the vehicle in parallel with the traveling direction of the vehicle .

2. The voice input / output apparatus according to claim 1, wherein two voice input means are provided, and each of the voice input means and the voice output means are arranged in a straight line.

A plurality of voice input means for inputting a speaker's voice and generating a voice signal;
Voice output means for outputting the voice of the other party,
Processing means for processing at least one of the sound signals generated by the plurality of sound input means so that the levels of the sound signals by the sound input from the sound output means to the plurality of sound input means are equal;
Output from the voice output means mixed with the voice of the speaker together with the voice of the speaker by calculating a deviation of the level of each voice signal after processing by the processing means and generating a transmission signal based on the deviation. Processing means for removing the generated voice from the transmission signal;
A voice input / output device comprising: communication means for transmitting a transmission signal generated by the processing means to a call partner, receiving a reception signal transmitted from the call partner, and outputting the received signal to the voice output means ;
An audio input / output device that operates by being arranged on a bisector that bisects the width of a vehicle compartment in the vehicle in parallel with the traveling direction of the vehicle .

4. The voice input / output apparatus according to claim 3, wherein the processing means has a function of adjusting a level of the voice signal.

The voice input / output device according to claim 3 or 4, wherein the processing means has a function of adjusting a phase of the voice signal.

Detecting means for detecting a transmission signal generated by the processing means;
4. The voice input / output device according to claim 3, further comprising a control unit that controls an operation of the processing unit based on a detection result of the detection unit.

The audio input / output device is disposed in a device attached to a ceiling of any one of a room light, a room mirror, a map lamp, or an overhead module in a vehicle interior . Voice input / output device.