JP7400593B2

JP7400593B2 - Information processing device and program

Info

Publication number: JP7400593B2
Application number: JP2020062762A
Authority: JP
Inventors: 直毅楠本
Original assignee: Brother Industries Ltd
Current assignee: Brother Industries Ltd
Priority date: 2020-03-31
Filing date: 2020-03-31
Publication date: 2023-12-19
Anticipated expiration: 2040-03-31
Also published as: JP2021163069A

Description

本願は、画像形成装置を音声により制御する技術に関するものである。 The present application relates to a technology for controlling an image forming apparatus by voice.

特許文献１には、パネル部でのタッチ操作による指示と、音声による指示とに基づいた操作処理を行う電子機器が記載されている。 Patent Document 1 describes an electronic device that performs operation processing based on instructions by touch operation on a panel section and instructions by voice.

特開２０１９－１３９６７４号公報Japanese Patent Application Publication No. 2019-139674

しかし、特許文献１に記載の電子機器では、音声による画像形成装置の設定パラメータの値の変更指示は、その設定パラメータの値そのものを指定して指示しているので、ユーザがその設定パラメータの値と画像形成結果の関係をよく理解していないような場合、指示後の設定パラメータの値で画像形成装置に画像形成を行わせた場合であっても、ユーザの意図通りの画像形成がなされない虞が生ずる。 However, in the electronic device described in Patent Document 1, an instruction to change the value of a setting parameter of an image forming apparatus by voice is given by specifying the value of the setting parameter itself. If the relationship between the image forming result and the image forming result is not well understood, the image may not be formed as intended by the user even if the image forming apparatus forms the image using the setting parameter values specified by the user. There is a risk.

本願は、ユーザの意図通りの画像形成を行うことが可能となる、音声による設定パラメータの変更技術を提供することを目的とする。 An object of the present application is to provide a technology for changing setting parameters by voice, which enables image formation as intended by the user.

上記目的を達成するため、本願の情報処理装置は、通信インタフェースと、制御装置と、を備え、制御装置は、通信インタフェースを介して接続された、音声を入力及び出力するスマートスピーカから、画像形成装置のユーザが発音することにより入力された音声の内容を認識し、認識された音声の内容が、画像形成装置が形成した画像に対するユーザの不満あるいは要求を示す内容である場合、認識された音声の内容に基づいて、画像形成装置が変更すべき設定パラメータの項目及び値を決定するための処理を実行し、決定された設定パラメータの項目及び値を含むコマンドを生成し、生成されたコマンドを画像形成装置に送信する。 In order to achieve the above object, the information processing device of the present application includes a communication interface and a control device, and the control device controls image formation from a smart speaker connected via the communication interface that inputs and outputs audio. The content of the voice input by the user of the device is recognized by pronunciation, and if the content of the recognized voice indicates the user's dissatisfaction with or request for the image formed by the image forming device, the recognized voice Based on the contents of the image forming apparatus, the image forming apparatus executes processing to determine the setting parameter items and values to be changed, generates a command including the determined setting parameter items and values, and executes the generated command. Send to image forming device.

なお、本明細書に開示される技術は、種々の形態で実現することが可能であり、例えば、情報処理方法、画像形成装置の制御方法、装置の機能または上記方法を実現するためのコンピュータプログラム、そのコンピュータプログラムを記録した記録媒体、等の形態で実現することができる。 Note that the technology disclosed in this specification can be realized in various forms, such as an information processing method, a method for controlling an image forming apparatus, a function of the apparatus, or a computer program for realizing the above method. , a recording medium on which the computer program is recorded, etc.

本願によれば、ユーザの意図通りの画像形成を行うことが可能となる。 According to the present application, it is possible to perform image formation as intended by the user.

本願の一実施形態に係る画像形成システムの構成を示すブロック図である。1 is a block diagram showing the configuration of an image forming system according to an embodiment of the present application. 図１の画像形成システムによって実行される印刷制御処理のシーケンス図である。2 is a sequence diagram of print control processing executed by the image forming system of FIG. 1. FIG. 図２のシーケンス図の続きのシーケンス図である。3 is a sequence diagram that is a continuation of the sequence diagram in FIG. 2. FIG. ログ情報として記憶されるデータの一例を示す図である。FIG. 3 is a diagram showing an example of data stored as log information.

以下、本願の実施の形態を図面に基づいて詳細に説明する。 Hereinafter, embodiments of the present application will be described in detail based on the drawings.

図１は、本願の一実施形態に係る画像形成システム１０００の構成を示している。画像形成システム１０００は、中継器１００と、プリンタ２００と、スマートスピーカ３００と、アプリケーションサーバ４００とにより主として構成されている。なお、本実施形態の画像形成システム１０００では、中継器１００とプリンタ２００とスマートスピーカ３００は、同じユーザによって利用される。 FIG. 1 shows the configuration of an image forming system 1000 according to an embodiment of the present application. The image forming system 1000 mainly includes a repeater 100, a printer 200, a smart speaker 300, and an application server 400. Note that in the image forming system 1000 of this embodiment, the repeater 100, printer 200, and smart speaker 300 are used by the same user.

画像形成システム１０００で用いられるアクセスポイント５０は、例えば、ＩＥＥＥ８０２．１１ａ／ｂ／ｇ／ｎの規格に従った通信方式を用いて無線ＬＡＮのアクセスポイントとしての機能を実現する。アクセスポイント５０は、ＬＡＮ７０に接続されている。ＬＡＮ７０は、例えば、イーサネット（登録商標）規格に準拠して構築された有線ネットワークである。ＬＡＮ７０は、インターネット８０に接続されている。アプリケーションサーバ４００は、インターネット８０に接続されている。 The access point 50 used in the image forming system 1000 realizes a function as a wireless LAN access point using a communication method according to the IEEE802.11a/b/g/n standard, for example. Access point 50 is connected to LAN 70. The LAN 70 is, for example, a wired network constructed in accordance with the Ethernet (registered trademark) standard. LAN 70 is connected to the Internet 80. Application server 400 is connected to the Internet 80.

中継器１００は、アプリケーションサーバ４００とプリンタ２００との間で送受信されるデータを中継する装置である。中継器１００は、中継器１００全体を制御するためのＣＰＵ１１０と、ＲＯＭやＲＡＭ、ＮＶＭなどのメモリ１２０と、プリンタ２００とＰ２Ｐ通信するためのブルートゥース（登録商標）ＩＦ１６０と、アプリケーションサーバ４００と通信するための無線ＬＡＮＩＦ１８０と、を主に備えている。中継器１００は、例えば、スマートフォン等のモバイル端末によって構成されている。なお、ＮＶＭは、non-volatile memoryの略語であり、Ｐ２Ｐは、Peer to Peerの略語であり、ＩＦは、Interfaceの略語である。 The relay device 100 is a device that relays data sent and received between the application server 400 and the printer 200. The repeater 100 communicates with a CPU 110 for controlling the entire repeater 100, a memory 120 such as ROM, RAM, and NVM, a Bluetooth (registered trademark) IF 160 for P2P communication with the printer 200, and an application server 400. It mainly includes a wireless LAN IF 180. The repeater 100 is configured by, for example, a mobile terminal such as a smartphone. Note that NVM is an abbreviation for non-volatile memory, P2P is an abbreviation for Peer to Peer, and IF is an abbreviation for Interface.

メモリ１２０は、データ記憶領域１２２及び制御プログラム領域１２４を含んでいる。データ記憶領域１２２は、ＣＰＵ１１０が制御プログラムなどを実行する際に必要なデータなどを記憶する領域である。また、制御プログラム領域１２４は、ＯＳや制御プログラム、ファームウェアなどを記憶する領域である。 Memory 120 includes a data storage area 122 and a control program area 124. The data storage area 122 is an area for storing data necessary for the CPU 110 to execute a control program and the like. Further, the control program area 124 is an area for storing an OS, control programs, firmware, and the like.

ブルートゥースＩＦ１６０は、アンテナを含み、ブルートゥース方式に準拠した近距離無線通信を行うためのインタフェースである。ブルートゥースＩＦ１６０を介して、中継器１００は、プリンタ２００と通信可能に接続される。無線ＬＡＮＩＦ１８０は、アンテナを含み、例えば、ＩＥＥＥ８０２．１１ａ／ｂ／ｇ／ｎの規格に従った通信方式を用いて無線通信を行う。これにより、中継器１００は、アクセスポイント５０を介してＬＡＮ７０及びインターネット８０に接続され、アプリケーションサーバ４００と通信可能に接続される。なお、プリンタ２００が無線ＬＡＮＩＦを備える場合には、中継器１００は無線ＬＡＮＩＦ１８０を介して、プリンタ２００と通信可能に接続するように構成してもよい。 The Bluetooth IF 160 includes an antenna and is an interface for performing short-range wireless communication based on the Bluetooth method. The repeater 100 is communicably connected to the printer 200 via the Bluetooth IF 160. The wireless LAN IF 180 includes an antenna and performs wireless communication using a communication method according to, for example, the IEEE802.11a/b/g/n standard. Thereby, the repeater 100 is connected to the LAN 70 and the Internet 80 via the access point 50, and is communicably connected to the application server 400. Note that if the printer 200 includes a wireless LAN IF, the repeater 100 may be configured to be communicably connected to the printer 200 via the wireless LAN IF 180.

プリンタ２００は、例えば、ＣＰＵとメモリを含む制御部２１０と、制御部２１０の制御に従って印刷を行う印刷機構２５０と、ブルートゥースＩＦ２６０と、を備えている。印刷機構２５０は、シートに画像を印刷する機構であり、電子写真方式、インクジェット方式、サーマル方式等の印刷機構である。ブルートゥースＩＦ２６０は、アンテナを含み、ブルートゥース方式に準拠した近距離無線通信を行うためのインタフェースであり、上述のように中継器１００との通信のために用いられる。なお、プリンタ２００は、ブルートゥースＩＦ２６０に代えて無線ＬＡＮＩＦを備え、中継器１００と通信可能に接続するよう構成してもよい。 The printer 200 includes, for example, a control unit 210 including a CPU and a memory, a printing mechanism 250 that performs printing under the control of the control unit 210, and a Bluetooth IF 260. The printing mechanism 250 is a mechanism that prints an image on a sheet, and is a printing mechanism using an electrophotographic method, an inkjet method, a thermal method, or the like. The Bluetooth IF 260 includes an antenna and is an interface for performing short-range wireless communication based on the Bluetooth method, and is used for communication with the repeater 100 as described above. Note that the printer 200 may be configured to include a wireless LAN IF instead of the Bluetooth IF 260 and to be communicably connected to the repeater 100.

プリンタ２００は、無線ＬＡＮＩＦを備えていない場合、中継器１００とは通信可能に接続されるが、アプリケーションサーバ４００とは直接、通信することができない。 If the printer 200 is not equipped with a wireless LAN IF, it is communicably connected to the repeater 100, but cannot communicate directly with the application server 400.

スマートスピーカ３００は、ユーザが発話した音声に応じて特定の処理を実行する装置である。特定の処理は、例えば、音声データを生成して、アプリケーションサーバ４００に送信する処理を含む。スマートスピーカ３００は、ＣＰＵとメモリとを含む制御部３１０と、音声入出力部３５０と、ブルートゥースＩＦ３６０と、無線ＬＡＮＩＦ３８０と、を備えている。 The smart speaker 300 is a device that performs specific processing in response to audio uttered by a user. The specific process includes, for example, a process of generating audio data and transmitting it to the application server 400. The smart speaker 300 includes a control section 310 including a CPU and memory, an audio input/output section 350, a Bluetooth IF 360, and a wireless LAN IF 380.

音声入出力部３５０は、スピーカとマイクとを含み、音声の入力と音声の出力に関する処理を実行する。例えば、音声入出力部３５０は、制御部３１０の制御に従って、ユーザが発話した音声を検出し、その音声を示す音声データを生成する。また、音声入出力部３５０は、受信した音声データに応じた音声をスピーカから発生する。 The audio input/output unit 350 includes a speaker and a microphone, and executes processing related to audio input and audio output. For example, the voice input/output unit 350 detects the voice uttered by the user under the control of the control unit 310 and generates voice data representing the voice. Furthermore, the audio input/output unit 350 generates audio from a speaker according to the received audio data.

無線ＬＡＮＩＦ３８０は、アンテナを含み、例えば、ＩＥＥＥ８０２．１１ａ／ｂ／ｇ／ｎの規格に従った通信方式を用いて無線通信を行う。これにより、スマートスピーカ３００は、アクセスポイント５０を介してＬＡＮ７０及びインターネット８０に接続され、アプリケーションサーバ４００と通信可能に接続される。 The wireless LAN IF 380 includes an antenna and performs wireless communication using a communication method according to, for example, the IEEE802.11a/b/g/n standard. Thereby, the smart speaker 300 is connected to the LAN 70 and the Internet 80 via the access point 50, and is communicably connected to the application server 400.

アプリケーションサーバ４００は、例えば、いわゆるクラウドサービスを提供する事業者が運営するサーバである。アプリケーションサーバ４００は、アプリケーションサーバ４００全体を制御するＣＰＵ４１０と、ＲＯＭ、ＲＡＭ、ＨＤＤ、ＳＳＤ及び光ディスクドライブなどを含む記憶部４２０と、を備えている。アプリケーションサーバ４００は、さらに、インターネット８０と接続するためのネットワークＩＦ４８０を備えている。なお、図１では、アプリケーションサーバ４００は、概念的に１個のサーバとして図示されているが、実際は、互いに通信可能に接続された複数個のサーバを含む、いわゆるクラウドサーバである。 The application server 400 is, for example, a server operated by a company that provides a so-called cloud service. The application server 400 includes a CPU 410 that controls the entire application server 400, and a storage unit 420 that includes ROM, RAM, HDD, SSD, optical disk drive, and the like. The application server 400 further includes a network IF 480 for connecting to the Internet 80. Note that although the application server 400 is conceptually illustrated as one server in FIG. 1, it is actually a so-called cloud server that includes a plurality of servers that are communicably connected to each other.

記憶部４２０は、データ記憶領域４２２及びプログラム記憶領域４２４を含んでいる。データ記憶領域４２２は、ＣＰＵ４１０が処理を行う際に必要なデータなどを記憶する記憶領域として、また、ＣＰＵ４１０が処理を行う際に生成される種々の中間データを一時的に格納するバッファ領域として機能する。プログラム記憶領域４２４は、ＯＳ、情報処理プログラム、その他各種のアプリやファームウェアなどを記憶する領域である。情報処理プログラムには、音声解析プログラム４２４ａ及び印刷関連プログラム４２４ｂが含まれる。音声解析プログラム４２４ａは、例えば、アプリケーションサーバ４００の運営者によって、アプリケーションサーバ４００にアップロードされることによって提供される。印刷関連プログラム４２４ｂは、例えば、アプリケーションサーバ４００のリソースを利用して印刷サービスを提供する事業者、例えば、プリンタ２００を製造する事業者によって、アプリケーションサーバ４００にアップロードされることによって提供される。なお、音声解析プログラム４２４ａの全部または一部が、プリンタ２００を製造する事業者によって提供されてもよい。あるいは、印刷関連プログラム４２４ｂの全部または一部がアプリケーションサーバ４００を運営する事業者によって提供されてもよい。 The storage unit 420 includes a data storage area 422 and a program storage area 424. The data storage area 422 functions as a storage area for storing data required when the CPU 410 performs processing, and as a buffer area for temporarily storing various intermediate data generated when the CPU 410 performs processing. do. The program storage area 424 is an area that stores the OS, information processing programs, various other applications, firmware, and the like. The information processing program includes a voice analysis program 424a and a printing related program 424b. The audio analysis program 424a is provided by, for example, being uploaded to the application server 400 by the operator of the application server 400. The printing-related program 424b is provided by being uploaded to the application server 400, for example, by a business that provides printing services using the resources of the application server 400, such as a business that manufactures the printer 200. Note that all or part of the voice analysis program 424a may be provided by a business that manufactures the printer 200. Alternatively, all or part of the print-related program 424b may be provided by a business operator that operates the application server 400.

アプリケーションサーバ４００、特にＣＰＵ４１０は、音声解析プログラム４２４ａを実行することによって、音声解析処理部４２４ａ′（図２参照）として機能する。音声解析処理部４２４ａ′は、音声認識処理や形態素解析処理を実行する。音声認識処理は、音声データを解析して、音声データによって示される発話の内容を示すテキストデータを生成する処理である。形態素解析処理は、そのテキストデータを解析して、発話の内容に含まれる単語などの構成単位（形態素と呼ばれる）の抽出や、抽出された形態素の種別（例えば、品詞の種別）の特定を行う処理である。 The application server 400, particularly the CPU 410, functions as a speech analysis processing section 424a' (see FIG. 2) by executing the speech analysis program 424a. The speech analysis processing unit 424a' executes speech recognition processing and morphological analysis processing. Speech recognition processing is processing that analyzes audio data and generates text data indicating the content of the utterance indicated by the audio data. The morphological analysis process analyzes the text data to extract constituent units such as words (called morphemes) included in the content of the utterance, and to identify the type of the extracted morpheme (for example, the type of part of speech). It is processing.

また、アプリケーションサーバ４００、特にＣＰＵ４１０は、印刷関連プログラム４２４ｂを実行することによって、印刷関連処理部４２４ｂ′（図２参照）として機能する。印刷関連処理部４２４ｂ′は、音声データを解析して得られるテキストデータを用いて、プリンタ２００に動作指示を行うコマンドを生成する処理などを実行する。 Furthermore, the application server 400, particularly the CPU 410, functions as a print-related processing unit 424b' (see FIG. 2) by executing a print-related program 424b. The print-related processing unit 424b' uses text data obtained by analyzing audio data to perform processing such as generating a command for instructing the printer 200 to operate.

図２及び図３は、画像形成システム１０００によって実行される印刷制御処理のシーケンスを示している。印刷制御処理は、スマートスピーカ３００とアプリケーションサーバ４００とが協働して、プリンタ２００に印刷を実行させる処理である。 2 and 3 show a sequence of print control processing executed by the image forming system 1000. The print control process is a process in which the smart speaker 300 and the application server 400 cooperate to cause the printer 200 to execute printing.

図２及び図３の印刷制御処理は、所定の事前処理が実行された以降の処理を示している。その事前処理とは、まず、ユーザが、画像形成システム１０００に含まれない端末装置、例えばスマートフォンやＰＣ等を用いてスマートスピーカ３００にアクセスし、上記制御部３１０に含まれるメモリにユーザＩＤと声紋を登録する。次に、ユーザは、アプリケーションサーバ４００とプリンタ２００との間でデータが正しく送受信されるように、中継器１００、プリンタ２００及びスマートスピーカ３００を設定する。さらに、ユーザは、上記端末装置にインストールされた印刷アプリを用いてプリンタ２００に印刷を指示する。このような事前処理の実行により、ユーザは、プリンタ２００から印刷結果を受け取り、その印刷結果に不満あるいは要求を抱いている状態となっている。 The print control processing in FIGS. 2 and 3 shows the processing after the predetermined pre-processing is executed. The pre-processing means that the user first accesses the smart speaker 300 using a terminal device not included in the image forming system 1000, such as a smartphone or a PC, and stores the user ID and voiceprint in the memory included in the control unit 310. Register. Next, the user configures the repeater 100, the printer 200, and the smart speaker 300 so that data is correctly transmitted and received between the application server 400 and the printer 200. Furthermore, the user instructs the printer 200 to print using the print application installed on the terminal device. By performing such pre-processing, the user receives the print result from the printer 200 and is dissatisfied with or has a request for the print result.

図２において、まずＳ２で、ユーザが発話する。ユーザは、印刷結果が自身の意図した濃度より少し薄く感じたので、スマートスピーカ３００に対して、例えば「もう少し濃くなる？」と問いかけている。印刷制御処理は、スマートスピーカ３００がその発話された音声を検出した場合に、開始する。 In FIG. 2, first in S2, the user speaks. The user feels that the print result is a little lighter in density than he or she intended, so he asks the smart speaker 300, for example, "Would you like it to be a little darker?" The print control process starts when the smart speaker 300 detects the spoken voice.

Ｓ４では、スマートスピーカ３００は、ユーザによって発話された音声を示す音声データを生成する。つまり、「もう少し濃くなる？」との音声がスマートスピーカ３００に入力されると、スマートスピーカ３００は、その音声を示す音声データを生成する。 In S4, smart speaker 300 generates audio data representing the audio uttered by the user. In other words, when the voice "Would you like it to be a little darker?" is input to the smart speaker 300, the smart speaker 300 generates audio data representing the voice.

次に、Ｓ６では、スマートスピーカ３００は、その音声データと登録済みのユーザＩＤとをアプリケーションサーバ４００の音声解析処理部４２４ａ′に送信する。音声データの送信には、公知のプロトコル、例えば、ＨＴＴＰが用いられる。 Next, in S6, the smart speaker 300 transmits the audio data and the registered user ID to the audio analysis processing unit 424a' of the application server 400. A known protocol such as HTTP is used to transmit the audio data.

アプリケーションサーバ４００がその音声データとユーザＩＤとを受信すると、Ｓ８にて、アプリケーションサーバ４００の音声解析処理部４２４ａ′は、受信された音声データを解析する。具体的には、音声解析処理部４２４ａ′は、音声データに対して音声認識処理を実行し、音声データによって示される音声を示すテキストデータを生成する。例えば、「もう少し濃くなる？」との音声を示す音声データを受信した場合には、音声解析処理部４２４ａ′は、その音声の内容を示すテキストデータを生成する。音声解析処理部４２４ａ′は、さらに、そのテキストデータに対して形態素解析処理を実行する。これにより、生成されたテキストデータから、例えば、「もう少し」、「濃く」、「なる？」などの単語が抽出されるとともに、これらの単語の品詞種別（例えば、名詞、動詞）が特定される。音声解析処理部４２４ａ′は、形態素解析結果として、抽出された単語に品詞種別を対応付けたリストを生成する。 When the application server 400 receives the voice data and user ID, the voice analysis processing unit 424a' of the application server 400 analyzes the received voice data in S8. Specifically, the voice analysis processing unit 424a' performs voice recognition processing on the voice data and generates text data representing the voice represented by the voice data. For example, when receiving voice data indicating the voice ``Would it be a little darker?'', the voice analysis processing unit 424a' generates text data indicating the content of the voice. The speech analysis processing unit 424a' further performs morphological analysis processing on the text data. As a result, words such as "a little more," "darker," and "naru?" are extracted from the generated text data, and the part of speech of these words (e.g., noun, verb) is identified. . The speech analysis processing unit 424a' generates a list in which extracted words are associated with part-of-speech types as a result of morphological analysis.

次に、Ｓ１０では、音声解析処理部４２４ａ′は、生成されたテキストデータと、形態素解析結果と、スマートスピーカ３００から受信されたユーザＩＤと、を、印刷関連処理部４２４ｂ′に渡す。具体的には、音声解析処理部４２４ａ′は、例えば、データ記憶領域４２２内の所定領域にテキストデータと形態素解析結果とユーザＩＤとを格納して、印刷関連プログラム４２４ｂをコールする。 Next, in S10, the speech analysis processing section 424a' passes the generated text data, the morphological analysis result, and the user ID received from the smart speaker 300 to the printing-related processing section 424b'. Specifically, the speech analysis processing unit 424a' stores the text data, the morphological analysis result, and the user ID in a predetermined area within the data storage area 422, and calls the print-related program 424b.

音声解析処理部４２４ａ′からテキストデータと形態素解析結果とユーザＩＤとを受け取ると、Ｓ１２にて、印刷関連処理部４２４ｂ′は、テキストデータと形態素解析結果とを用いて、次に行うべき内容を決定する処理を実行する。具体的には、印刷関連処理部４２４ｂ′は、「もう少し」から何かの変更が小程度であり、「濃く」から何かは設定パラメータの項目が濃度パラメータであって、さらに変更する方向が増加方向であり、「なる？」から可能かどうかを問い合わせていることであると認識する。そして、この認識結果から、印刷関連処理部４２４ｂ′は、ユーザの発話内容が「濃度パラメータの値を現在の値より少し濃い方向に増加させること」と認識し、そのためにまず、プリンタ２００に対して、濃度パラメータの現在の設定値を問い合わせることであると決定する。そして、印刷関連処理部４２４ｂ′は、濃度パラメータの現在の設定値を問い合わせるためのコマンド（以下「現設定要求コマンド」という）を生成する。 Upon receiving the text data, morphological analysis results, and user ID from the speech analysis processing section 424a', in S12, the printing-related processing section 424b' uses the text data and the morphological analysis results to determine what to do next. Execute the process to determine. Specifically, the print-related processing unit 424b' determines that the setting parameter item for "a little more" is a small change, and the setting parameter item for "darkening" is a density parameter, and that there is no further change in the direction. It is recognized that the number is increasing, and that the question "Will it become?" is asking whether or not it is possible. Based on this recognition result, the print-related processing unit 424b' recognizes that the content of the user's utterance is to "increase the value of the density parameter in a direction slightly darker than the current value," and for this purpose, it first sends the message to the printer 200. Then, it is decided that the current setting value of the concentration parameter is to be queried. The print-related processing unit 424b' then generates a command (hereinafter referred to as "current setting request command") for inquiring about the current setting value of the density parameter.

次に、Ｓ１４では、印刷関連処理部４２４ｂ′は、現設定要求コマンドを中継器１００に送信する。中継器１００は、Ｓ１６にて、受信した現設定要求コマンドをそのまま、プリンタ２００に送信する。 Next, in S14, the print-related processing unit 424b' transmits a current setting request command to the repeater 100. In S16, the repeater 100 transmits the received current setting request command to the printer 200 as is.

プリンタ２００は、現設定要求コマンドを受信すると、Ｓ１８にて、濃度パラメータの現在の設定値及びプリンタ２００のモデル名を含む応答（以下「現設定応答」という）を中継器１００に送信する。 When printer 200 receives the current setting request command, it transmits a response (hereinafter referred to as "current setting response") including the current setting value of the density parameter and the model name of printer 200 to repeater 100 in S18.

中継器１００は、Ｓ２０にて、受信した現設定応答をそのまま、アプリケーションサーバ４００に送信する。アプリケーションサーバ４００の印刷関連処理部４２４ｂ′は、Ｓ２２にて、受信した現設定応答に基づいた処理を実行する。具体的には、例えば、現設定応答に、濃度パラメータの現在の設定値として“５”が含まれているとする。アプリケーションサーバ４００は、例えば、データ記憶領域４２２内にプリンタ２００を含む各種画像形成装置それぞれについて全設定パラメータの情報を記憶している。印刷関連処理部４２４ｂ′は、現設定応答に含まれるモデル名と一致する画像形成装置、つまりプリンタ２００の全設定パラメータの情報から、濃度パラメータとして設定可能な最大値を取得する。その最大値が、例えば“１０”であるとすると、印刷関連処理部４２４ｂ′は、濃度パラメータの値を現在値“５”からさらに“５”段階増加できることを知る。そこで、印刷関連処理部４２４ｂ′は、そのことを知らせるための音声データの基になるテキストデータを生成する。例えば、印刷関連処理部４２４ｂ′は、「５」、「濃くできる」と、小程度に対して「２」を選び、「あと５段階濃くできます。２段階濃くして再印刷しましょうか」のテキストデータを生成する。そして、印刷関連処理部４２４ｂ′は、Ｓ２４にて、生成したテキストデータを音声解析処理部４２４ａ′に渡す。 In S20, the repeater 100 transmits the received current setting response to the application server 400 as is. The print-related processing unit 424b' of the application server 400 executes processing based on the received current setting response in S22. Specifically, for example, assume that the current setting response includes "5" as the current setting value of the density parameter. For example, the application server 400 stores information on all setting parameters for each of various image forming apparatuses including the printer 200 in a data storage area 422. The print-related processing unit 424b' obtains the maximum value that can be set as the density parameter from information on all setting parameters of the image forming apparatus, that is, the printer 200, that matches the model name included in the current setting response. If the maximum value is, for example, "10", the print-related processing unit 424b' knows that the value of the density parameter can be further increased by "5" steps from the current value "5". Therefore, the print-related processing unit 424b' generates text data that is the basis of audio data to notify the user of this fact. For example, the print-related processing unit 424b' selects "5", "can be darkened" and "2" for the small degree, and then asks "you can make it darker by 5 more levels. Shall we print again by making it darker by 2 levels?" generate text data. Then, the print-related processing section 424b' passes the generated text data to the speech analysis processing section 424a' in S24.

次に、Ｓ２６では、音声解析処理部４２４ａ′は、そのテキストデータに基づいて音声データを作成する。そして、音声解析処理部４２４ａ′は、図３のＳ２８にて、作成した音声データをスマートスピーカ３００に送信する。 Next, in S26, the voice analysis processing section 424a' creates voice data based on the text data. Then, the audio analysis processing unit 424a' transmits the created audio data to the smart speaker 300 in S28 of FIG.

次に、Ｓ３０では、スマートスピーカ３００は、受信した音声データを音声として発生させる。この音声を聞いたユーザは、Ｓ３２にて、例えば「ＯＫ」と発話する。また、例えば「１段階だけ濃くして」と発話してもよい。これに応じて、スマートスピーカ３００及び音声解析処理部４２４ａ′は、Ｓ３２にて、上記Ｓ４～Ｓ１０の処理と同様の処理を実行し、テキストデータを生成して、印刷関連処理部４２４ｂ′に渡す。 Next, in S30, the smart speaker 300 generates the received audio data as audio. The user who has heard this voice utters, for example, "OK" in S32. Alternatively, for example, you may say, "Darken it by one level." In response to this, the smart speaker 300 and the audio analysis processing unit 424a' execute the same processing as the processing in S4 to S10 above in S32, generate text data, and pass it to the print-related processing unit 424b'. .

Ｓ３６では、印刷関連処理部４２４ｂ′は、そのテキストデータに基づいてコマンドを作成する。具体的には、印刷関連処理部４２４ｂ′は、２段階であれば、濃度パラメータの値を現在の設定値＋２、すなわち７にして、再印刷することを指示するコマンドを生成する。そして、アプリケーションサーバ４００は、Ｓ３８にて、生成したコマンドを中継器１００に送信する。中継器１００は、Ｓ４０にて、受信した再印刷のコマンドをそのまま、プリンタ２００に送信する。Ｓ４２では、プリンタ２００は、受信した再印刷のコマンドに従って、濃度パラメータの設定値にて印刷機構２５０を制御し、制御部２１０に記憶している前回印刷に用いた印刷データに基づく再印刷を実行する。 In S36, the print-related processing unit 424b' creates a command based on the text data. Specifically, in the case of 2-stage printing, the print-related processing unit 424b' generates a command instructing to change the density parameter value to the current setting value + 2, that is, 7, and to reprint. The application server 400 then transmits the generated command to the repeater 100 in S38. In S40, the repeater 100 transmits the received reprint command to the printer 200 as is. In S<b>42 , the printer 200 controls the printing mechanism 250 using the density parameter settings according to the received reprint command, and executes reprinting based on the print data used for the previous printing stored in the control unit 210 . do.

印刷関連処理部４２４ｂ′は、上記Ｓ３８で、印刷コマンドを中継器１００に送信するときに、プリンタ２００に再印刷を指示したことを知らせるための音声データの基になるテキストデータを生成し、音声解析処理部４２４ａ′に渡すようにしてもよい。これにより、スマートスピーカ３００から、プリンタ２００に再印刷を指示したことを知らせる音声が発生するので、ユーザは、プリンタ２００から再印刷結果が出力されることを事前に知ることができる。 When the print-related processing unit 424b' sends the print command to the repeater 100 in S38, it generates text data that is the basis of the voice data for notifying the printer 200 that reprinting has been instructed, and outputs the voice data. It may also be passed to the analysis processing section 424a'. As a result, the smart speaker 300 generates a sound informing that the printer 200 has been instructed to reprint, so the user can know in advance that the printer 200 will output the reprint results.

また、再印刷結果に不満な場合には、ユーザは再度、不満な内容をスマートスピーカ３００に発話する。これに応じて、上記Ｓ４～Ｓ４２の処理が繰り返され、プリンタ２００から再々印刷結果が出力される。 Furthermore, if the user is dissatisfied with the reprint result, the user speaks again to the smart speaker 300 about the content of the dissatisfaction. In response, the processes of S4 to S42 are repeated, and the printer 200 outputs the print results again.

なお、プリンタ２００は、濃度パラメータの設定値を変更できるように構成されているが、他のプリンタでは、濃度パラメータの設定値を変更できないモデルもある。その場合には、印刷関連処理部４２４ｂ′は、設定変更できないことを知らせるための音声データの基になるテキストデータを生成し、音声解析処理部４２４ａ′に渡すようにすればよい。 Note that although the printer 200 is configured so that the set value of the density parameter can be changed, there are models of other printers in which the set value of the density parameter cannot be changed. In that case, the print-related processing section 424b' may generate text data that is the basis of the audio data for informing that the settings cannot be changed, and pass it to the audio analysis processing section 424a'.

また、本実施形態では、上記Ｓ２で、ユーザが「もう少し濃くなる？」と問いかけているので、上記Ｓ３０で、「あと５段階濃くできます。２段階濃くして再印刷しましょうか」とユーザに返答している。これに代えて、上記Ｓ２２～Ｓ３４の処理を省略して、ユーザに返答せずに、処理をＳ２０からＳ３６に進ませ、「濃く」から決定した濃度パラメータの現在値に、「もう少し」から少し増加方向に決定した値だけ加算して、再印刷することを指示するコマンドを生成するようにしてもよい。 In addition, in this embodiment, in S2 above, the user asks, "Would you like it to be a little darker?", so in S30, the user asks, "You can make it 5 more shades darker. Shall we make it 2 shades darker and reprint?" is responding to. Alternatively, the processes of S22 to S34 described above are omitted, the process proceeds from S20 to S36 without replying to the user, and the current value of the density parameter determined from "Darker" is changed from "A little more" to a little more. A command may be generated to instruct reprinting by adding the determined value in the increasing direction.

なお、本実施形態において、ユーザが発話した内容やスマートスピーカ３００から発生された音声の内容は、一例に過ぎない。また、決定した設定パラメータの項目も、濃度パラメータに限らず、ユーザの発話内容に応じて様々な設定パラメータの項目が決定される。さらに、パラメータ値も、増加方向の変更に限らず、減少方向の変更もあり得る。また、変更しないこともあり得る。 Note that in this embodiment, the content uttered by the user and the content of the audio generated from the smart speaker 300 are merely examples. Furthermore, the determined setting parameter items are not limited to the density parameter, and various setting parameter items are determined depending on the content of the user's utterance. Furthermore, the parameter values are not limited to changes in the increasing direction, but may also be changed in the decreasing direction. It is also possible that it will not be changed.

このように、本実施形態の画像形成システム１０００では、プリンタ２００からの印刷結果に不満あるいは要求を抱いたユーザは、抱いた不満あるいは要求の内容をスマートスピーカ３００に発話するだけで、その不満を解消する印刷結果、あるいはその要求に沿った印刷結果になるように、自動的に設定パラメータの項目とその値が決定されて、プリンタ２００に対する印刷制御がなされるので、ユーザの意図通りの画像形成を行うことが可能となる。特に、変更すべき設定パラメータの値と印刷結果の関係をよく理解していないようなユーザにとって、ユーザが直接、設定パラメータの項目とパラメータ値を指示しなくてもよいので、有用である。 As described above, in the image forming system 1000 of the present embodiment, a user who is dissatisfied with or has a request for the print result from the printer 200 can express his or her dissatisfaction by simply speaking the contents of the dissatisfaction or request to the smart speaker 300. Since the setting parameter items and their values are automatically determined and the printing control for the printer 200 is performed so as to produce the print result that resolves the problem or the print result that meets the request, the image can be formed as intended by the user. It becomes possible to do this. This is particularly useful for users who do not fully understand the relationship between setting parameter values to be changed and print results, since the user does not have to directly specify setting parameter items and parameter values.

また、ユーザは印刷結果に対する直感的な印象を発話するだけで、印刷設定が変更されて意図通りの印刷結果が得られるので、ユーザは直感的に印刷設定の変更ができるようになる。この結果、ユーザは、印刷アプリからユーザの意図通りの印刷結果が得られるような印刷設定を探す必要がなくなる。また、印刷アプリに予め登録されている印刷設定にはない、新規な印刷設定に変更できることもあり得る。 Further, the user can change the print settings intuitively because the user can change the print settings and obtain the intended print result just by uttering his or her intuitive impression of the print result. As a result, the user does not need to search for print settings from the print application that will allow the print result as intended by the user to be obtained. Furthermore, it may be possible to change to new print settings that are not included in the print settings registered in advance in the print application.

本実施形態の画像形成システム１０００では、アプリケーションサーバ４００が、ユーザの発話内容の解析からプリンタ２００へ送信するコマンドの生成まで行うので、スマートスピーカ３００として種々のものを採用してもよい。つまり、機種依存性がない。また、中継器１００も、アプリケーションサーバ４００とプリンタ２００との間で送受信されるデータをそのまま中継するだけであるので、データの中継ができるものであればよく、機種依存性がない。 In the image forming system 1000 of this embodiment, the application server 400 performs everything from analyzing the content of the user's utterances to generating commands to be sent to the printer 200, so various smart speakers 300 may be used. In other words, there is no model dependence. Further, since the relay device 100 simply relays the data sent and received between the application server 400 and the printer 200, it is sufficient that it can relay data, and there is no model dependence.

図４は、アプリケーションサーバ４００の印刷関連処理部４２４ｂ′が記憶部４２０に記憶するログ情報のフォーマットの一例を示している。印刷関連処理部４２４ｂ′は、ユーザがスマートスピーカ３００に発話すると、発話の度にログ情報を取得し、記憶部４２０に記憶するようにしている。ログ情報は、図４に示すように、１．発話データと２．プリンタ情報とを対応付けて記憶する。 FIG. 4 shows an example of the format of log information stored in the storage unit 420 by the print-related processing unit 424b' of the application server 400. When the user speaks into the smart speaker 300, the print-related processing unit 424b' acquires log information every time the user speaks and stores it in the storage unit 420. As shown in FIG. 4, the log information includes 1. Utterance data and 2. It is stored in association with the printer information.

１．発話データとしては、１－１：日時、１－２：地域、１－３：生データ及び１－４：抽出データを取得する。「日時」は、ユーザが発話した日時である。「地域」は、ユーザが発話したときに居る地域である。地域は、国家単位、県単位、市町村単位、さらに細かい単位など、いずれの単位でもよい。例えば、スマートスピーカ３００にＧＰＳを搭載し、ＧＰＳから得られた位置情報に基づいて、「地域」を決定すればよい。「生データ」は、スマートスピーカ３００から入力された音声データそのものである。「抽出データ」とは、「生データ」から、上述のようにして生成されたテキストデータと、このテキストデータに形態素解析処理を施して生成された形態素解析結果のことである。 1. As speech data, 1-1: date and time, 1-2: region, 1-3: raw data, and 1-4: extracted data are acquired. “Date and time” is the date and time when the user spoke. The "region" is the region where the user is when he/she speaks. A region may be any unit, such as a national unit, a prefecture, a municipality, or a more detailed unit. For example, the smart speaker 300 may be equipped with a GPS, and the "region" may be determined based on position information obtained from the GPS. “Raw data” is the audio data itself input from the smart speaker 300. "Extracted data" refers to the text data generated from the "raw data" as described above, and the morphological analysis results generated by performing morphological analysis processing on this text data.

２．プリンタ情報としては、２－１：印刷アプリ情報、２－２：印刷設定及び２－３：モデル情報を取得する。「印刷アプリ情報」は、上記図２及び図３の印刷制御処理の事前処理で用いた印刷アプリの情報のことである。印刷アプリの情報は、その印刷アプリがインストールされている端末装置に問い合わせるか、あるいは、プリンタ２００が印刷に使用された印刷アプリの情報を記憶している場合には、上記Ｓ１４で、現設定要求コマンドを送信するときに、印刷アプリの情報も問い合わせるようにすればよい。「印刷設定」は、プリンタ２００の発話時に設定中の印刷設定である。Ｓ１４では、上記Ｓ２で発話された発話内容に関連する設定パラメータの情報のみ要求するようにしたが、全ての設定パラメータの設定情報を要求するようにすればよい。「モデル情報」は、プリンタ２００のモデル名のことである。上記Ｓ１８では、プリンタ２００は、モデル名も含んだ現設定応答を生成するので、このモデル名を用いればよい。なお、「印刷設定」は、プリンタ２００に設定中の印刷設定の全てであっても、そのうちの一部であってもよい。一部とは、１つであってもよい。この場合、発話内容に関係の深い１つの印刷設定を選択するようにしてもよい。 2. As printer information, 2-1: print application information, 2-2: print settings, and 2-3: model information are acquired. "Printing application information" is information about the printing application used in the pre-processing of the printing control processing shown in FIGS. 2 and 3 above. Information about the printing application can be obtained by inquiring the terminal device on which the printing application is installed, or, if the printer 200 stores information about the printing application used for printing, by requesting the current settings in S14 above. When sending a command, you can also inquire about the print application information. “Print settings” are print settings that are being set when the printer 200 speaks. In S14, only information on setting parameters related to the content of the utterance uttered in S2 is requested, but setting information on all setting parameters may be requested. “Model information” is the model name of the printer 200. In S18 above, the printer 200 generates a current setting response that also includes the model name, so this model name may be used. Note that the "print settings" may be all of the print settings being set in the printer 200, or may be some of them. A part may be one. In this case, one print setting closely related to the content of the utterance may be selected.

さらにログ情報を、発話データ内にプリンタ２００の設定に関する用語が含まれているログ情報と、プリンタ２００の設定に関する用語が含まれていないログ情報とに分類し、各類型毎に記憶しておく。このように記憶しておけば、類型毎にログ情報を使用でき、便利である。なお、プリンタ２００の設定に関する用語が含まれているか否かは、１－４：抽出データに基づいて直ちに判断できる。 Furthermore, the log information is classified into log information whose utterance data includes terms related to the settings of the printer 200 and log information which does not include terms related to the settings of the printer 200, and is stored for each type. . If stored in this way, log information can be used for each type, which is convenient. Note that whether or not a term related to the settings of the printer 200 is included can be immediately determined based on 1-4: Extracted Data.

このようにして取得したログ情報は、プリンタ２００に対するユーザの幅広い好みや要望を知り、今後の製品開発に生かすために利用する。従来から、印刷アプリのＵＩ画面に対するユーザ操作は操作ログとして記憶されていたが、ユーザがしたいこと（Ｗａｎｔ）を把握できる範囲は限られていた。例えば、濃度パラメータの設定値が最大に設定されている場合、ユーザはその設定値でよいのか、もっと濃くしたいのか、操作ログからでは判断が困難である。一方、ユーザの発話が「もっと濃くしたい」や「まだ薄いな」であり、これがログ情報として収集されていた場合、ユーザのＷａｎｔは、装置の設定可能な範囲を超えたＷａｎｔであることが分かる。このように、本実施形態のログ情報からは、ユーザの幅広い好みや要望を知ることができる。 The log information acquired in this manner is used to learn about a wide range of user preferences and requests regarding the printer 200, and to utilize this information in future product development. Conventionally, user operations on the UI screen of a printing application have been stored as an operation log, but the range in which what the user wants to do has been limited. For example, when the setting value of the density parameter is set to the maximum, it is difficult for the user to determine from the operation log whether the setting value is acceptable or whether the user wants to make it darker. On the other hand, if the user's utterances are ``I want it to be darker'' or ``It's still lighter'' and this is collected as log information, it can be seen that the user's Want exceeds the range that can be set by the device. . In this way, from the log information of this embodiment, it is possible to know a wide range of user preferences and requests.

なお、ログ情報として、音声データをテキスト化しただけの情報を記憶することも考えられるが、それでは、何を基準にユーザがその発話に及んだか、後から分かり難い。そこで、本実施形態では、プリンタの設定等の情報と発話の情報とを対応付けて、ログ情報として記憶するようにしている。これにより、ユーザのＷａｎｔの把握の正確性が大きく向上し、ログ情報としての有用性も大きく向上する。 Note that it is conceivable to store information simply obtained by converting voice data into text as the log information, but in that case, it would be difficult to later determine on what basis the user made the utterance. Therefore, in this embodiment, information such as printer settings and utterance information are associated with each other and stored as log information. As a result, the accuracy of understanding what the user wants is greatly improved, and the usefulness as log information is also greatly improved.

また、ユーザの好みや要望を知る方法として、例えば、プリンタのサポートサイトなどにユーザの要望欄を設け、自由に書き込んでもらう方法がある。この方法で得られたユーザの好みや要望には、偏りがある場合がある。つまり、積極的に書き込む意思のあるユーザからの好みや要望しか得られないからである。これに対して、本実施形態のようにして得られるログ情報からは、偏りのない広い範囲のユーザからの好みや要望を得ることができる。このため、幅広いユーザのＷａｎｔに対応したサービスの提供等の事業拡大や、次の製品開発等に生かすことができる。 Further, as a method of knowing the user's preferences and requests, for example, there is a method of setting up a request column for users on a printer support site, etc., and asking users to freely write in their requests. User preferences and requests obtained using this method may be biased. In other words, only the preferences and requests of users who are willing to write are obtained. On the other hand, from the log information obtained as in this embodiment, it is possible to obtain unbiased preferences and requests from a wide range of users. Therefore, it can be utilized for business expansion, such as providing services that meet the wants of a wide range of users, and for the development of the next product.

以上説明したように、本実施形態のアプリケーションサーバ４００は、ネットワークＩＦ４８０と、ＣＰＵ４１０と、を備えている。そして、ＣＰＵ４１０は、ネットワークＩＦ４８０を介して接続された、音声を入力及び出力するスマートスピーカ３００から、プリンタ２００のユーザが発音することにより入力された音声の内容を認識し（Ｓ８）、認識された音声の内容が、プリンタ２００が形成した画像に対するユーザの不満あるいは要求を示す内容である場合、認識された音声の内容に基づいて、プリンタ２００が変更すべき設定パラメータの項目及び値を決定し（Ｓ１２）、決定された設定パラメータの項目及び値を含むコマンドを生成し（Ｓ３６）、生成されたコマンドをプリンタ２００に送信する（Ｓ３８）。 As explained above, the application server 400 of this embodiment includes the network IF 480 and the CPU 410. Then, the CPU 410 recognizes the contents of the voice input by the user of the printer 200 pronouncing it from the smart speaker 300 connected via the network IF 480 and inputting and outputting voice (S8). If the content of the voice indicates the user's dissatisfaction with or request for the image formed by the printer 200, the printer 200 determines the setting parameter items and values to be changed based on the recognized voice content ( S12), a command including the determined setting parameter items and values is generated (S36), and the generated command is sent to the printer 200 (S38).

このように、本実施形態のアプリケーションサーバ４００では、ユーザは直接、設定パラメータの項目とパラメータ値を指示しなくても、ユーザの意図通りの画像形成を行うことが可能となる。 In this manner, the application server 400 of this embodiment allows the user to form an image as intended by the user without directly instructing the setting parameter items and parameter values.

ちなみに、本実施形態において、アプリケーションサーバ４００は、「情報処理装置」の一例である。ネットワークＩＦ４８０は、「通信インタフェース」の一例である。ＣＰＵ４１０は、「制御装置」の一例である。プリンタ２００は、「画像形成装置」の一例である。 Incidentally, in this embodiment, the application server 400 is an example of an "information processing device." Network IF 480 is an example of a "communications interface." CPU 410 is an example of a "control device." Printer 200 is an example of an "image forming apparatus."

また、ＣＰＵ４１０は、決定された設定パラメータ項目のプリンタ２００における現在の設定値をプリンタ２００から取得し（Ｓ２０）、取得された現在の設定値及び認識された音声の内容に基づいて、決定された設定パラメータ項目の値を決定する（Ｓ２２）。 Further, the CPU 410 obtains the current setting value of the determined setting parameter item in the printer 200 from the printer 200 (S20), and based on the obtained current setting value and the content of the recognized voice, the CPU 410 The value of the setting parameter item is determined (S22).

これにより、設定パラメータの値は自動的に決定されるので、便利である。 This is convenient because the values of the setting parameters are automatically determined.

また、ＣＰＵ４１０は、決定された設定パラメータ項目の値を決定する前に、決定された設定パラメータ項目において設定可能な最大値又は最小値と、取得された現在の設定値とを比較し、現在の設定値に加算又は減算できる値の範囲を発音する音声データを、ネットワークＩＦ４８０を介してスマートスピーカ３００に送信し（Ｓ２８）、これに応じて、ユーザが現在の設定値に加算又は減算する値を発音した場合、現在の設定値に加算又は減算する値を認識し、決定された設定パラメータ項目の値を、認識された値分現在の設定値を加算又は減算した値に決定する（Ｓ３６）。 Furthermore, before determining the value of the determined setting parameter item, the CPU 410 compares the maximum or minimum value that can be set in the determined setting parameter item with the obtained current setting value, and Audio data that pronounces the range of values that can be added to or subtracted from the set value is transmitted to the smart speaker 300 via the network IF 480 (S28), and in response, the user selects the value to be added to or subtracted from the current set value. If the sound is generated, the value to be added or subtracted from the current setting value is recognized, and the value of the determined setting parameter item is determined to be the value obtained by adding or subtracting the current setting value by the recognized value (S36).

これにより、ユーザは、現在の設定値に加算又は減算する値を発音するだけで、設定パラメータの値を指示できるので、便利である。 This is convenient because the user can specify the value of the setting parameter simply by pronouncing the value to be added to or subtracted from the current setting value.

また、ＣＰＵ４１０は、プリンタ２００から現在の設定値を取得できなかった場合、取得できなかったことを発音する音声データを、ネットワークＩＦ４８０を介してスマートスピーカ３００に送信する。 Furthermore, when the CPU 410 is unable to obtain the current setting value from the printer 200, it transmits audio data that sounds that the current setting value could not be obtained to the smart speaker 300 via the network IF 480.

これにより、ユーザは、プリンタ２００が設定値を変更できるかどうか音声により知ることができるので、便利である。 This is convenient because the user can hear via voice whether or not the printer 200 can change the setting value.

また、ＣＰＵ４１０は、認識された音声の内容が、プリンタ２００に対して再度の画像形成を指示する内容である場合、画像形成の実行リクエストを含むコマンドを生成し（Ｓ３６）、生成されたコマンドをプリンタ２００に送信する（Ｓ３８）。 Further, if the content of the recognized voice is content that instructs the printer 200 to form an image again, the CPU 410 generates a command including a request to execute image formation (S36), and executes the generated command. The information is sent to the printer 200 (S38).

これにより、ユーザは、声で指示するだけで、プリンタ２００に再度の画像形成を指示できるので、便利である。 This is convenient because the user can instruct the printer 200 to form an image again simply by giving an instruction by voice.

また、ＣＰＵ４１０は、コマンドを送信した後、プリンタ２００に対して再度の画像形成を指示したことを発音する音声データを、ネットワークＩＦ４８０を介してスマートスピーカ３００に送信する。 Further, after transmitting the command, the CPU 410 transmits to the smart speaker 300 via the network IF 480 audio data that indicates that the printer 200 has been instructed to form an image again.

これにより、ユーザは、指示した内容をプリンタ２００に指示されたことを音声で知ることができるので、便利である。 This is convenient because the user can hear by voice that the printer 200 has been instructed to perform the instruction.

また、ＣＰＵ４１０は、ネットワークＩＦ４８０を介して接続された、プリンタ２００とＰ２Ｐ通信を行う中継器１００を介して、プリンタ２００とコマンドを含むデータの送受信を行う。 Further, the CPU 410 transmits and receives data including commands to and from the printer 200 via the relay 100 that is connected via the network IF 480 and performs P2P communication with the printer 200 .

これにより、プリンタ２００がネットワークＩＦを備えていなくても、プリンタ２００は、中継器１００を介してアプリケーションサーバ４００とデータの送受信を行うことができる。 As a result, even if the printer 200 is not equipped with a network IF, the printer 200 can send and receive data to and from the application server 400 via the repeater 100.

なお、本発明は上記実施形態に限定されるものでなく、その趣旨を逸脱しない範囲で様々な変更が可能である。 Note that the present invention is not limited to the above embodiments, and various changes can be made without departing from the spirit thereof.

（１）上記実施形態では、音声データを解析する処理は、アプリケーションサーバ４００の音声解析処理部４２４ａ′が実行している。これに代えて、音声データを解析する処理の一部または全部は、スマートスピーカ３００が実行してもよい。また、音声データを解析する処理の一部または全部は、印刷関連処理部４２４ｂ′が実行してもよい。例えば、音声解析処理部４２４ａ′は、音声認識処理を行ってテキストデータを生成する処理だけを行い、単語を抽出する形態素解析処理は、印刷関連処理部４２４ｂ′が実行してもよい。 (1) In the above embodiment, the audio analysis processing unit 424a' of the application server 400 executes the process of analyzing audio data. Alternatively, part or all of the process of analyzing audio data may be executed by the smart speaker 300. Furthermore, part or all of the processing for analyzing audio data may be executed by the print-related processing unit 424b'. For example, the speech analysis processing section 424a' may perform only the processing of performing speech recognition processing to generate text data, and the printing-related processing section 424b' may perform the morphological analysis processing of extracting words.

（２）上記実施形態では、画像形成装置として、プリンタ２００を採用したが、これに限らず、印刷機能にスキャン機能やファックス機能を加えた複合機を採用してもよい。この場合には、例えば、スマートスピーカ３００に入力される音声に応じて、その複合機に印刷を行わせることができる。 (2) In the above embodiment, the printer 200 is used as the image forming apparatus, but the present invention is not limited to this, and a multifunction device that has a scanning function or a facsimile function in addition to a printing function may be used. In this case, for example, the multifunction device can be caused to print in response to audio input to the smart speaker 300.

（３）アプリケーションサーバ４００は、クラウドサーバであるが、ＬＡＮ７０に接続され、インターネット８０に接続されないローカルサーバであってもよい。この場合には、スマートスピーカ３００からアプリケーションサーバ４００にユーザＩＤなどの識別情報を送信せず、音声データだけを送信してもよい。 (3) Although the application server 400 is a cloud server, it may be a local server connected to the LAN 70 and not connected to the Internet 80. In this case, only the audio data may be transmitted from the smart speaker 300 to the application server 400 without transmitting identification information such as a user ID.

（４）中継器１００とプリンタ２００とを接続するインタフェースは、ブルートゥースＩＦ１６０に限らず、例えば、ＵＳＢなどの有線インタフェースであってもよいし、ＮＦＣ（Near field communicationの略）などの他の無線インタフェースであってもよい。 (4) The interface for connecting the repeater 100 and the printer 200 is not limited to the Bluetooth IF 160, but may also be a wired interface such as a USB, or another wireless interface such as NFC (abbreviation for near field communication). It may be.

（５）上記実施形態では、中継器１００が、アプリケーションサーバ４００とプリンタ２００との間で送受信されるデータを中継する構成としたが、さらにアプリケーションサーバ４００との間にスマートスピーカ３００を介して中継してもよい。この場合、スマートスピーカ３００のブルートゥースＩＦ１６０あるいは無線ＬＡＮＩＦ１８０を用いた通信が中継器１００との間で行われる。また、プリンタ２００が無線ＬＡＮＩＦを備えているような場合、アプリケーションサーバ４００が中継器１００を介さずに直接プリンタ２００と通信するようにしてもよいし、あるいはアプリケーションサーバ４００がスマートスピーカ３００を介してプリンタ２００と通信するようにしてもよい。 (5) In the above embodiment, the relay device 100 is configured to relay data sent and received between the application server 400 and the printer 200, but it is also configured to relay data between the application server 400 and the smart speaker 300. You may. In this case, communication using the Bluetooth IF 160 or wireless LAN IF 180 of the smart speaker 300 is performed with the repeater 100. Furthermore, if the printer 200 is equipped with a wireless LAN IF, the application server 400 may communicate directly with the printer 200 without going through the repeater 100, or the application server 400 may communicate with the printer 200 directly through the smart speaker 300. It may also be configured to communicate with the printer 200.

（６）上記実施形態において、ハードウェアによって実現されていた構成の一部をソフトウェアに置き換えるようにしてもよく、逆に、ソフトウェアによって実現されていた構成の一部をハードウェアに置き換えるようにしてもよい。 (6) In the above embodiment, a part of the configuration realized by hardware may be replaced with software, or conversely, a part of the configuration realized by software may be replaced by hardware. Good too.

５０…アクセスポイント、７０…ＬＡＮ、８０…インターネット、１００…中継器、１１０…ＣＰＵ、１２０…メモリ、１６０，２６０，３６０…ブルートゥースＩＦ、１８０，３８０…無線ＬＡＮＩＦ、２００…プリンタ、２１０…制御部、２５０…印刷機構、３００…スマートスピーカ、３１０…制御部、３５０…音声入出力部、４００…アプリケーションサーバ、４１０…ＣＰＵ、４２０…記憶部、４２４ａ…音声解析プログラム、４２４ｂ…印刷関連プログラム、４２４ｂ′…印刷関連処理部、４２４ａ′…音声解析処理部、４８０…ネットワークＩＦ、１０００…画像形成システム。 50... Access point, 70... LAN, 80... Internet, 100... Repeater, 110... CPU, 120... Memory, 160, 260, 360... Bluetooth IF, 180, 380... Wireless LAN IF, 200... Printer, 210... Control unit , 250...Printing mechanism, 300...Smart speaker, 310...Control unit, 350...Audio input/output unit, 400...Application server, 410...CPU, 420...Storage unit, 424a...Audio analysis program, 424b...Printing related program, 424b '...Print-related processing unit, 424a'...Audio analysis processing unit, 480...Network IF, 1000...Image forming system.

Claims

a communication interface;
a control device;
Equipped with
The control device includes:
Recognizing the content of the input voice when the user of the image forming apparatus pronounces it from a smart speaker connected via the communication interface that inputs and outputs voice,
When the content of the recognized voice is content indicating the user's dissatisfaction with or request for the image formed by the image forming apparatus,
executing processing for determining setting parameter items to be changed by the image forming apparatus based on the content of the recognized voice;
obtaining the current setting value of the determined setting parameter item in the image forming apparatus from the image forming apparatus;
A maximum value that can be set in the determined configuration parameter item before determining the value of the determined configuration parameter item based on the acquired current configuration value and the content of the recognized voice. or comparing the minimum value with the acquired current setting value, and transmitting audio data to the smart speaker via the communication interface to pronounce a range of values that can be added to or subtracted from the current setting value. ,
When the user pronounces a value to be added to or subtracted from the current setting value in response to the voice data received by the smart speaker being generated as voice;
Recognizing a value to be added to or subtracted from the current set value;
determining a value obtained by adding or subtracting the recognized value to the obtained current value of the determined configuration parameter item;
Generate a command including items and values of the determined configuration parameters,
transmitting the generated command to the image forming apparatus;
Information processing device.

The control device includes:
If the current setting value cannot be acquired from the image forming apparatus, transmitting audio data that sounds that the current setting value could not be acquired to the smart speaker via the communication interface;
The information processing device according to claim 1 .

The control device includes:
If the content of the recognized voice is content that instructs the image forming apparatus to form an image again,
generating a command including a request to execute the image formation;
transmitting the generated command to the image forming apparatus;
The information processing device according to claim 1 or 2 .

The control device includes:
After transmitting the command, transmitting audio data to the smart speaker via the communication interface that indicates that the image forming apparatus is instructed to form an image again;
The information processing device according to claim 3 .

The control device includes:
transmitting and receiving data including the command to and from the image forming apparatus via a repeater connected via the communication interface and performing P2P communication with the image forming apparatus;
The information processing device according to any one of claims 1 to 4 .

If the text data of the input voice is " Would it be a little darker? ",
The control device includes:
Extracting “a little more”, “deeper” and “ naru? ” from the text data,
Recognize the degree from the extracted “a little more”,
Recognizing the concentration from the extracted “darkness”,
Recognizing that it is inquiring whether it is possible from the extracted " Naru? ",
Based on the content of the recognized voice, an image density item is determined as the item of the setting parameter, and a value that makes the density of the image darker is determined as the value of the setting parameter.
The information processing device according to any one of claims 1 to 5 .

A program executable by a computer of an information processing device equipped with a communication interface,
to the computer;
a recognition process that recognizes the content of the voice input by a user of the image forming apparatus pronouncing it from a smart speaker connected via the communication interface that inputs and outputs voice;
When the content of the voice recognized by the recognition process is content indicating the user's dissatisfaction with or request for the image formed by the image forming apparatus,
a process for determining setting parameter items to be changed by the image forming apparatus based on the content of the recognized voice;
a process of acquiring the current setting value of the determined setting parameter item in the image forming apparatus from the image forming apparatus;
A maximum value that can be set in the determined configuration parameter item before determining the value of the determined configuration parameter item based on the acquired current configuration value and the content of the recognized voice. or comparing the minimum value with the acquired current setting value, and transmitting audio data to the smart speaker via the communication interface to pronounce a range of values that can be added to or subtracted from the current setting value. processing and
When the user pronounces a value to be added to or subtracted from the current setting value in response to the voice data received by the smart speaker being generated as voice;
a process of recognizing a value to be added to or subtracted from the current setting value;
a process of determining a value obtained by adding or subtracting the recognized value to the obtained current value of the determined setting parameter item;
a generation process that generates a command including the determined setting parameter items and values;
a transmission process of transmitting the command generated by the generation process to the image forming apparatus;
A program to run.