WO2006005129A1 - Video compression apparatus and method - Google Patents
Video compression apparatus and method Download PDFInfo
- Publication number
- WO2006005129A1 WO2006005129A1 PCT/AU2005/001021 AU2005001021W WO2006005129A1 WO 2006005129 A1 WO2006005129 A1 WO 2006005129A1 AU 2005001021 W AU2005001021 W AU 2005001021W WO 2006005129 A1 WO2006005129 A1 WO 2006005129A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- video
- camera
- motion data
- motion
- detecting
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Ceased
Links
Classifications
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/50—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding
- H04N19/503—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding involving temporal prediction
- H04N19/51—Motion estimation or motion compensation
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/50—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding
- H04N19/503—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding involving temporal prediction
- H04N19/51—Motion estimation or motion compensation
- H04N19/527—Global motion vector estimation
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N23/00—Cameras or camera modules comprising electronic image sensors; Control thereof
- H04N23/60—Control of cameras or camera modules
- H04N23/66—Remote control of cameras or camera parts, e.g. by remote control devices
- H04N23/661—Transmitting camera control signals through networks, e.g. control via the Internet
Definitions
- the present invention relates to the field of video capture and transmission. More particularly the present invention relates to apparatus and methods for capturing, transmitting and compressing video signals for the purpose of transmission over communication networks.
- Video conferencing is another example of video transmission over a communications network.
- the large bandwidth required for video conferencing means that a dedicated network must be used. Therefore, the attendant cost of the specialist video conferencing equipment, including the dedicated network, makes video conferencing economically feasible for only large enterprises.
- video conferencing like webcams, also requires the conference to occur in a video conferencing room to take advantage of the specialist hardware.
- a video transmission apparatus including: attachment means for attaching the apparatus to a video camera; sensor means for detecting motion of the camera and generating motion data therefrom; means for receiving video from the camera; means for associating the motion data with the video; and means for wirelessly transmitting the video and associated motion data to a receiver.
- a video compression assistance device attachable to or incorporable within a video capture device, the video-compression assistance device including: sensor means for detecting motion of the video capture device as video is captured and generating motion data therefrom; and . means for associating the motion data with the captured video, such that the motion data may be utilised in compressing die captured video.
- the present invention takes the approach of detecting camera motion whilst capturing video and utilising the detected motion to assist in the compression of that captured video. Essentially, the invention utilises hardware to reduce the computational burden on and increase the efficiency of video compression algorithms.
- the apparatus may be attached to a conventional video camera, to allow video captured by the camera to be transmitted, over a communications network.
- the motion data associated with the video can be used to efficiently determine the optimum offset of adjacent video frames to minimise inter-frame differences introduced by camera motion, thereby improving the compression rate of the video and reducing the computational burden on a compression algorithm.
- the attachment means attaches the apparatus to the video camera through the accessory shoe of the camera.
- the sensor means detects rotation of the camera about at least one axis.
- the sensor means detects rotation of the camera about a yaw axis (thereby detecting camera pan) and a pitch axis (thereby detecting camera tilt) .
- Camera pan and tilt may be detected by any suitable sensor means.
- the sensor means may include:
- the sensor means may generate motion data by any suitable means.
- the sensor means includes transducer means for transducing camera motion to variations in an electrical signal, whereupon the motion data can be extracted from the electrical signal.
- the motion data includes a sequence of values, each representing an instantaneous angular velocity of the camera about an axis, such as a pitch axis and/or a yaw axis.
- the means for associating the motion data with the video includes means (such as an on-screen display device (OSD)) for inserting indicia representative of the motion data into each frame of the video.
- the indicia may, for example, be alphanumeric characters overlayed in a small area of the frame.
- Other methods for associating motion data with a video frame that allows the data to be easily extracted from the frame could also be used.
- each frame has associated motion data that is indicative of the instantaneous angular velocity of the camera about at least one axis at the time of the frame.
- Timing information of the frame may be extracted from the frame itself, for example by the use of a video sync separator.
- a video compression method said video comprising a plurality of frames, the method including the steps of: detecting motion of the camera as the video is captured and generating motion data therefrom; ⁇ associating the motion data with the captured video; utilising the motion data to determine an offset between adjacent frames to thereby minimise inter-frame differences between said frames; and utilising the offset in compressing the video.
- the step of detecting motion of the camera may include the step of detecting rotation of the camera about at least one axis.
- the camera motion may be detected by attaching an apparatus having motion sensing means to the camera.
- the motion sensing means may include a first gyroscope configured to detect motion of the camera about a yaw axis, and a second gyroscope configured to detect rotation of the camera about a pitch axis.
- the sensor means includes transducer means for translating camera motion to variations in an electrical signal, whereupon the motion data can be extracted from the electrical signal.
- the motion data includes a sequence of values, each representing an instantaneous angular velocity of the camera about an axis, such as a pitch axis and/or a yaw axis.
- the step of associating motion data with a frame typically includes the step of inserting indicia representative of the motion data into the frame.
- the indicia may be alphanumeric characters, overlayed in a small area of the frame. It is to be understood however that any indicia that can be extracted from the frame could be used.
- the motion data may be digitally encoded and linked to the frames of digitally stored video.
- the step of extracting the motion data from the frame may include performing optical character recognition on the frame.
- the step of utilising the motion data to determine an offset of an adjacent frame includes forming a global motion vector from the motion data, said vector being used as a starting point for determining the offset of the adjacent frame.
- the method may be performed using suitable computer software and/or hardware.
- the present invention also provides a video camera having attached thereto a video transmission apparatus as described herein.
- Figure 1 is a perspective view of a video camera with an apparatus according to an embodiment of the invention attached.
- Figure 2 is a schematic illustration of the components of an apparatus according to an embodiment of the invention.
- Figure 3 is a schematic illustration of the video and motion encoder of the invention.
- Figure 4 is a schematic illustration of the video transport components of an embodiment of the invention.
- Figure 5 is a schematic illustration of a receiver suitable for receiving motion encoded video.
- Figure 6 is a schematic illustration of the components of the invention for associating motion data with a video frame.
- Figure 7 is a schematic illustration of a video-camera-enabled mobile phone handset having a video compression assistance device according the second aspect of the present invention incorporated therein.
- Figure 8 is an illustration of the use of motion encoded video for video compression.
- Figure 9 is an illustration of the user of a motion vector derived from motion data in global motion compensation between adjacent frames.
- Figure 10 is an illustration of the frame difference of adjacent frames with and without global motion compensation.
- FIG. 1 there is illustrated a video transmission apparatus 10 that is attached to a video camera 12 via the video camera's accessory shoe 18.
- a transmitter 19 is mounted on the top of the apparatus 10 for transmitting motion encoded video to a receiver (not shown) .
- the camera lens 14 points in a forward direction 16, with the camera 12 and the attached apparatus 10 being rotatable about both a pitch axis (camera tilt) and/or a yaw axis (camera pan) .
- the video camera can be used in conventional manner to capture video images in a digital or analog format, that are transmitted by the apparatus 10 to a receiver for compression and transmission over a communications network.
- the apparatus includes means for detecting motion of the camera and generating motion data therefrom.
- the motion detection means includes a first gyroscope (not shown) positioned along a yaw axis, to capture camera pan motion and a second gyroscope (not shown) positioned along a pitch axis to capture camera tilt motion.
- the operation and structure of gyroscopes is known to those skilled in the art and will not be described in further detail here.
- motion sensors 20 including the abovementioned gyroscopes are positioned within the apparatus 10 to sense the occurrence of camera tilt and pan.
- Motion data 26 representing the sensed camera motion is generated from the output of the, motion sensors 20.
- video data 23 is also output from the video camera 12 and is forwarded with the motion data 26 to the motion encoder 22.
- the motion data 26 is associated with the video data to form motion encoded video 25.
- a transmitter 19 receives the motion encoded video 25 and transmits it to a receiver (not shown) for compression and transmission over a communications network.
- the camera 12 outputs video in a conventional manner with the motion sensors 20 simultaneously outputting motion data 26 to the encoder 22.
- the motion sensors 20 include first and second gyroscopes orientated within the apparatus so that their axes are in horizontal and vertical planes both substantially normal (perpendicular) to the direction of the camera 12.
- the gyroscopes operate as transducers and convert sensed physical rotation of the camera about each axis to variations in an electrical signal, as would be understood by those skilled in the art.
- the gyroscopes are calibrated with a reference voltage to represent zero angular velocity about an axis.
- a sensor scale factor is used to represent the proportional change in the reference voltage relative to a corresponding change in angular velocity about an axis. To represent rotation in both directions an output of less than or greater than the reference voltage can be used.
- the scale factor is dependent on the characteristics of the sensor but will typically have units of millivolt per degree per second mV/°/s. Consequently the maximum angular velocity is the highest rate of change of degrees per second.
- the motion data 26 may be extracted from the electrical signal produced by the transducer by a number of means.
- the gyroscope output may be connected to an analogue to digital converter (not shown) which converts the analogue output to a sequence of digital values in a manner known to those skilled in the art.
- the analogue signal may be conditioned for input to the analogue to digital converter in a conventional manner.
- a 10 bit analogue to digital converter will convert the analogue signal from the gyroscope to a series of digital values 25 having one of 1,024 possibilities.
- the motion encoded video 28 is forwarded to the transmitter 19 which in turn forwards the motion encoded video to a receiver 30 for compression and transmission over a communications network, such as the Internet.
- a communications network such as the Internet.
- Figure 5 which illustrates operation of the receiver, the motion encoded video 28 is digitised by a video digitiser 32 and forwarded to a decoder 34 where the digitised video 36 and motion data 26 are separated from the motion encoded video 28.
- the motion data 26 extracted from the motion encoded video 28 is utilised to compensate for global motion 38 in the compression of the video 40.
- the compressed video is better suited for transmission over a communications network (not shown) .
- the process of associating motion data 26 with the video 24 is now described by reference to Figure 6.
- analogue motion data 26 is passed to an analogue to digital converter 46 for processing by a microprocessor 50 also resident in the apparatus 10.
- the composite video signal 42 is delivered from the camera 12 and passed to a video sync separator 44 which extracts timing information, such as the composite and vertical sync, from the composite video 42 .
- timing information such as the composite and vertical sync
- the video sync output of the sync separator goes high as understood by those skilled in the art. Extraction of timing information in this way allows the most current digitised motion data from the analogue to the digital converter to be associated with the current video frame of the composite video.
- This synchronisation is achieved by connecting the vertical sync of the video sync separator 44 to an input pin on the microprocessor 50.
- another input pin of the microprocessor 50 is connected to the output of the analogue to digital converter 46.
- the microprocessor 50 continuously polls the state of the input pin and triggers the sampling period for the conversion of the analogue motion data 26 to a digital format.
- the microprocessor initiates the conversion during the vertical blanking interval of the video signal 42.
- a number of samples are taken with the sample average being output by the microprocessor 50 to a video overlay device 48 which is also coupled as an output from the composite video 42 and the video sync separator 44.
- the output of the microprocessor 50 is converted to indicia such as alpha numeric text.
- An onscreen display device forming part of the video overlay 48 displays the text at a particular location on the correct frame of composite video 42 by utilising the timing data from the video sync separator 44.
- motion encoded video comprising the video image with overlayed motion data 28 is produced.
- the device 11 in this embodiment takes the form a microprocessor that is incorporated into the circuit board (not shown) of the handset 9, along with other hardware components, such as a video compression chip 27.
- the video camera 13 incorporated into the handset captures video in a digital, rather than analogue form, as does the camcorder described above.
- a digital file 15 stores the aggregate of the video frames captured by the video camera 13 in a suitable format.
- Motion sensors 20, incorporating gyroscopes (not shown) as described above as well as an accelerometer (not shown) for detecting translational motion of the camera 13, detect camera motion as the video is captured in the manner described above.
- the motion data is encoded in a digital form and is linked to the digital video before being stored in a file 21.
- a motion associating video component 23 receives and motion data from video 15 and motion data files 21 and associates the motion data with the video frames in the manner described above.
- the motion compensated video is passed to the video compressor 27 for compression in the manner described below, and then to a transmitter 27 for transmission over the mobile telephone network either to another handset, or to a receiver for transmission over the Internet.
- receivers motion encoded video 28 from the camcorder attachment described above receivers motion encoded video 28 from the camcorder attachment described above.
- the video compression process occurring at the receiver is essentially the same as the on-board compression occurring on the mobile telephone handset embodiment just described.
- an analogue motion encoded video is transmitted, it is received and demodulated at the receiver.
- a video digitiser 32 converts the analogue video 28 into a plurality of digital video frames in a manner understood by those skilled in the art.
- a computer program running on the receiver performs optical character recognition (OCR) 52 on the received frames to extract the motion data overlayed on the frame.
- OCR optical character recognition
- the frame is then cropped 54 to remove the section where the motion data appeared.
- Simultaneously the extracted motion data is passed with the cropped image to a global motion compensation process 38 which utilises the motion data for the purpose of compression.
- a video may be compressed where the difference between a series of frames is due mainly to global motion of the camera between the frames, rather than local motion of objects within the frame.
- This can be exploited by instead of storing each individual frame, storing a reference image and information about the global motion between the frames. The original frames can then be reconstructed after transmission by applying the information to the reference frame.
- additional processing is required to determine the direction and magnitude of the optimum offset between frames to take advantage of global motion compensation.
- the current and previous frames must iterate over a large number of combinations of displacement and differencing in a two dimensional area to locate this optimum offset.
- the motion data supplied to the compression algorithm along with the video describes the combined pan and tilt motion of the camera when originally capturing the frame, it can be formed into a vector (direction and magnitude) of the camera movement between the frames and can be used as a starting point in minimising the search for the optimum offset. It will thus be realised that supplying the motion data transforms the search area from a two dimensional to a linear space. It will also be realised that the computational burden of compression can be reduced by the supply of motion data on each video frame. This may allow for real time compression of video signals received from the handheld camera 16 thus allowing for live broadcast of video over a communications network such as the Internet.
- the previous frame 58 is illustrated comprising a matrix of pixels making up an image.
- a motion vector is used to find the optimum offset between the previous frame 58 and the current frame 60.
- the large area 62 of common image data between the previous and current frame can be used in compressing the video for transmission over the communications network. This process is also illustrated by reference to Figure 9 with the common area 62 between the frames being shown.
Landscapes
- Engineering & Computer Science (AREA)
- Multimedia (AREA)
- Signal Processing (AREA)
- Compression Or Coding Systems Of Tv Signals (AREA)
- Studio Devices (AREA)
Abstract
Description
Claims
Priority Applications (1)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| AU2005262282A AU2005262282A1 (en) | 2004-07-12 | 2005-07-12 | Video compression apparatus and method |
Applications Claiming Priority (2)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| AU2004903823A AU2004903823A0 (en) | 2004-07-12 | Video transmission apparatus and method | |
| AU2004903823 | 2004-07-12 |
Publications (1)
| Publication Number | Publication Date |
|---|---|
| WO2006005129A1 true WO2006005129A1 (en) | 2006-01-19 |
Family
ID=35783451
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| PCT/AU2005/001021 Ceased WO2006005129A1 (en) | 2004-07-12 | 2005-07-12 | Video compression apparatus and method |
Country Status (1)
| Country | Link |
|---|---|
| WO (1) | WO2006005129A1 (en) |
Cited By (4)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| WO2008020339A1 (en) * | 2006-08-18 | 2008-02-21 | Sony Ericsson Mobile Communications Ab | Angle correction for camera |
| US9838533B2 (en) | 2015-07-14 | 2017-12-05 | Ujet, Inc. | Customer communication system including scheduling |
| EP3843374A1 (en) * | 2019-12-25 | 2021-06-30 | Canon Kabushiki Kaisha | Image capturing apparatus |
| US11228906B2 (en) | 2015-07-14 | 2022-01-18 | Ujet, Inc. | Customer communication system |
Citations (4)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US4689673A (en) * | 1985-06-10 | 1987-08-25 | Nec Corporation | Movement compensation predictive encoder for a moving picture signal with a reduced amount of information |
| WO1998051083A1 (en) * | 1997-05-09 | 1998-11-12 | Commotion, Inc. | Camera motion sensing system |
| JP2001318586A (en) * | 2000-05-08 | 2001-11-16 | Hitachi Ltd | Motion-based motion generator |
| US20030058347A1 (en) * | 2001-09-26 | 2003-03-27 | Chulhee Lee | Methods and systems for efficient video compression by recording various state signals of video cameras |
-
2005
- 2005-07-12 WO PCT/AU2005/001021 patent/WO2006005129A1/en not_active Ceased
Patent Citations (4)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US4689673A (en) * | 1985-06-10 | 1987-08-25 | Nec Corporation | Movement compensation predictive encoder for a moving picture signal with a reduced amount of information |
| WO1998051083A1 (en) * | 1997-05-09 | 1998-11-12 | Commotion, Inc. | Camera motion sensing system |
| JP2001318586A (en) * | 2000-05-08 | 2001-11-16 | Hitachi Ltd | Motion-based motion generator |
| US20030058347A1 (en) * | 2001-09-26 | 2003-03-27 | Chulhee Lee | Methods and systems for efficient video compression by recording various state signals of video cameras |
Cited By (6)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| WO2008020339A1 (en) * | 2006-08-18 | 2008-02-21 | Sony Ericsson Mobile Communications Ab | Angle correction for camera |
| US7593627B2 (en) | 2006-08-18 | 2009-09-22 | Sony Ericsson Mobile Communications Ab | Angle correction for camera |
| US9838533B2 (en) | 2015-07-14 | 2017-12-05 | Ujet, Inc. | Customer communication system including scheduling |
| US11228906B2 (en) | 2015-07-14 | 2022-01-18 | Ujet, Inc. | Customer communication system |
| EP3843374A1 (en) * | 2019-12-25 | 2021-06-30 | Canon Kabushiki Kaisha | Image capturing apparatus |
| US11368621B2 (en) | 2019-12-25 | 2022-06-21 | Canon Kabushiki Kaisha | Image capturing apparatus |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| KR101231518B1 (en) | Camera control apparatus, camera system, electronic conference system, and camera control method | |
| US8315481B2 (en) | Image transmitting apparatus, image receiving apparatus, image transmitting and receiving system, recording medium recording image transmitting program, and recording medium recording image receiving program | |
| EP0905977A3 (en) | Image synthesis apparatus | |
| TWI433057B (en) | Improved pre-alarm buffer | |
| WO2004086748A3 (en) | Systems and methods for multi-resolution image processing | |
| JP2008225600A (en) | Image display system, image transmission apparatus, image transmission method, image display apparatus, image display method, and program | |
| JP2011087253A (en) | Imaging apparatus, information processing device and information processing method | |
| CN115269718A (en) | Millisecond data synchronization device and method for photoelectric pod | |
| JP2000083239A (en) | Monitor system | |
| CN113711580A (en) | Video processing device, video processing method, and video processing program | |
| WO2006005129A1 (en) | Video compression apparatus and method | |
| US20060093224A1 (en) | Image capturing apparatus and image distributing system | |
| AU2005262282A1 (en) | Video compression apparatus and method | |
| JP4999559B2 (en) | Image processing apparatus, image processing method, program, and recording medium | |
| KR101104699B1 (en) | Apparatus and method for acquiring information of video content using a mobile terminal | |
| JP2019009615A (en) | Surveillance camera device, surveillance video distribution method, and surveillance system | |
| JPH0690444A (en) | Portrait transmission system | |
| KR20180135971A (en) | A terminal and method for setting a data protocol for a photographed image | |
| KR20120078980A (en) | Device and method for extracting direction information of image in wireless terminal | |
| JP2000244796A (en) | Camera system | |
| JP2007096412A (en) | Moving picture photographing apparatus and moving picture transmitting/receiving system | |
| US11012619B2 (en) | Transmission device, transmission method, reception device, reception method, and imaging device | |
| JP2007251532A (en) | Imaging apparatus and face area extraction method | |
| JP2006165666A (en) | Surveillance camera, surveillance image monitor device, surveillance image storage device, and surveillance system | |
| JPH0937230A (en) | Image monitor |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| AK | Designated states |
Kind code of ref document: A1 Designated state(s): AE AG AL AM AT AU AZ BA BB BG BR BW BY BZ CA CH CN CO CR CU CZ DE DK DM DZ EC EE EG ES FI GB GD GE GH GM HR HU ID IL IN IS JP KE KG KM KP KR KZ LC LK LR LS LT LU LV MA MD MG MK MN MW MX MZ NA NG NI NO NZ OM PG PH PL PT RO RU SC SD SE SG SK SL SM SY TJ TM TN TR TT TZ UA UG US UZ VC VN YU ZA ZM ZW |
|
| AL | Designated countries for regional patents |
Kind code of ref document: A1 Designated state(s): GM KE LS MW MZ NA SD SL SZ TZ UG ZM ZW AM AZ BY KG KZ MD RU TJ TM AT BE BG CH CY CZ DE DK EE ES FI FR GB GR HU IE IS IT LT LU LV MC NL PL PT RO SE SI SK TR BF BJ CF CG CI CM GA GN GQ GW ML MR NE SN TD TG |
|
| 121 | Ep: the epo has been informed by wipo that ep was designated in this application | ||
| WWE | Wipo information: entry into national phase |
Ref document number: 2005262282 Country of ref document: AU |
|
| NENP | Non-entry into the national phase |
Ref country code: DE |
|
| WWW | Wipo information: withdrawn in national office |
Country of ref document: DE |
|
| ENP | Entry into the national phase |
Ref document number: 2005262282 Country of ref document: AU Date of ref document: 20050712 Kind code of ref document: A |
|
| WWP | Wipo information: published in national office |
Ref document number: 2005262282 Country of ref document: AU |
|
| 122 | Ep: pct application non-entry in european phase |