AU769655B2 - Automatic person meta-data labeller - Google Patents
Automatic person meta-data labeller Download PDFInfo
- Publication number
- AU769655B2 AU769655B2 AU57684/01A AU5768401A AU769655B2 AU 769655 B2 AU769655 B2 AU 769655B2 AU 57684/01 A AU57684/01 A AU 57684/01A AU 5768401 A AU5768401 A AU 5768401A AU 769655 B2 AU769655 B2 AU 769655B2
- Authority
- AU
- Australia
- Prior art keywords
- images
- colour
- image
- code
- selecting
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Ceased
Links
- 238000000034 method Methods 0.000 claims description 89
- 238000004590 computer program Methods 0.000 claims description 18
- 230000001419 dependent effect Effects 0.000 claims description 13
- 230000015654 memory Effects 0.000 claims description 11
- 230000008569 process Effects 0.000 description 35
- 238000002372 labelling Methods 0.000 description 11
- 239000003086 colorant Substances 0.000 description 7
- 238000012545 processing Methods 0.000 description 6
- 230000011218 segmentation Effects 0.000 description 5
- 238000001514 detection method Methods 0.000 description 4
- 230000009471 action Effects 0.000 description 3
- 230000008859 change Effects 0.000 description 3
- 230000000694 effects Effects 0.000 description 3
- 230000006870 function Effects 0.000 description 3
- 238000013459 approach Methods 0.000 description 2
- 230000008901 benefit Effects 0.000 description 2
- 230000005540 biological transmission Effects 0.000 description 2
- 238000010586 diagram Methods 0.000 description 2
- 230000001552 evening effect Effects 0.000 description 2
- 238000012805 post-processing Methods 0.000 description 2
- 239000004065 semiconductor Substances 0.000 description 2
- 238000003860 storage Methods 0.000 description 2
- 102100035353 Cyclin-dependent kinase 2-associated protein 1 Human genes 0.000 description 1
- 230000003190 augmentative effect Effects 0.000 description 1
- 238000007621 cluster analysis Methods 0.000 description 1
- 238000011161 development Methods 0.000 description 1
- 230000018109 developmental process Effects 0.000 description 1
- 230000003993 interaction Effects 0.000 description 1
- 238000012432 intermediate storage Methods 0.000 description 1
- 230000007246 mechanism Effects 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 235000021178 picnic Nutrition 0.000 description 1
- 239000002243 precursor Substances 0.000 description 1
- 230000000717 retained effect Effects 0.000 description 1
- 230000009466 transformation Effects 0.000 description 1
- 230000000007 visual effect Effects 0.000 description 1
Landscapes
- Image Analysis (AREA)
Description
S&FRef: 560356
AUSTRALIA
PATENTS ACT 1990 COMPLETE SPECIFICATION FOR A STANDARD PATENT
ORIGINAL
Name and Address of Applicant: Actual Inventor(s): Address for Service: Canon Kabushiki Kaisha 30-2, Shimomaruko 3-chome, Ohta-ku Tokyo 146 Japan Daniel John Lloyd-Jones Spruson Ferguson St Martins Tower,Level 31 Market Street Sydney NSW 2000 (CCN 3710000177) Automatic Person Meta-Data Labeller 0 0.
00% 0: *0 .00.
0 0 0 0 Invention Title: ASSOCIATED PROVISIONAL APPLICATION DETAILS [33] Country [31] Applic. No(s) AU PQ9216 [32] Application Date 04 Aug 2000 The following statement is a full description of this invention, including the best method of performing it known to me/us:- 5815c 1 -ii nl*i lll u*l .l irri r:i AUTOMATIC PERSON META-DATA LABELLER Technical Field of the Invention The present invention relates generally to the field of image post-processing and, in particular, to aspects of collation and sorting of images. The present invention relates to a method and apparatus for identifying a designated person in a plurality of images.
The invention also relates to a computer program product including a computer readable medium having recorded thereon a computer program for identifying a designated person in a plurality of images.
Background Art The advent of digital photography encourages users of digital cameras to take large numbers of photographs, this deriving, at least in part, from the fact that such photographs do not require traditional development, with its cost and inconvenience.
Digital images are, in contrast, available for use almost immediately, once the film diskette is suitably loaded into a personal computer or suitable display device.
It is, however, time consuming for the photographer to sort and label these S voluminous images, and consequently, the sorting and labelling process is often neglected. As a result, the photographer runs a risk of accumulating a growing number of images, many of which are not readily accessed because of the absence of a convenient method of labelling.
i0 0 20 Present methods of automatic identification using image processing are still unsatisfactory, and often cannot detect the presence of any person in an image, let alone a specific person. It is possible, for example, to detect the presence of people in a photograph using detection of skin colour via colour histograms, however this technique is non-specific, and relatively inaccurate.
0 S000 25 Disclosure of the Invention 0 It is an object of the present invention to substantially overcome, or at least ameliorate, one or more disadvantages of existing arrangements.
.oo$ 200701; 11:47 560356.doc i i~lI il~i Lr -2- According to a first aspect of the invention, there is provided a method of identifying, from a set of images, images in which a specified object is present, the method comprising the steps of: choosing, from the set of images, an indicative image in which the specified object is present; selecting a part of said object; searching the set of images for the part dependent upon the colour of the part; and tagging the images in the set in which the part is found.
According to another aspect of the invention, there is provided an apparatus for identifying, from a set of images, images in which a specified object is present, the apparatus comprising: means for selecting, from the set of images, an indicative image in which the specified object is present; means for selecting a part of said object; means for searching the set of images for the part dependent upon the colour of the part; and o: means for tagging the images in the set in which the part is found.
According to another aspect of the invention, there is provided an apparatus for 20 identifying, from a set of images, images in which a specified object is present, the apparatus comprising: a memory for storing a program; and a processor for executing the program, said program comprising: code for selecting, from the set of images, an indicative image in which the 25 specified object is present; code for selecting a part of said object; code for searching the set of images for the part dependent upon the colour of the part; and code for tagging the images in the set in which the part is found.
code for tagging the images in the set in which the part is found.
200701; 1 1:47 560356.doc 2a According to another aspect of the invention, there is provided a computer program which is configured to make a computer execute a procedure for identifying, from a set of images, images in which a specified object is present, the program comprising: code for selecting, from the set of images, an indicative image in which the specified object is present; code for selecting a part of said object; code for searching the set of images for the part dependent upon the colour of the part; and code for tagging the images in the set in which the part is found.
According to another aspect of the invention, there is provided a computer program product including a computer readable medium having recorded thereon a computer program which is configured to make a computer execute a procedure for identifying, from a set of images, images in which a specified object is present, the program comprising: code for selecting, from the set of images, an indicative image in which the specified object is present; code for selecting a part of said object; code for searching the set of images for the part dependent upon the colour of the 20 part; and :code for tagging the images in the set in which the part is found.
a 200701; 11:47 560356.doc I r m~r l r.ii x.~ri ur *ul ul4li -3- Brief Description of the Drawings A number of preferred embodiments of the present invention will now be described with reference to the drawings, in which: Fig. 1 depicts an illustrative image containing a number of people and other objects; Fig. 2 shows a flow process diagram for a preferred embodiment of the invention; Fig. 3 depicts one method of fashion accessory selection in accordance with the preferred embodiment; Fig. 4 provides more detail for the initial manual sub-process of Fig. 2; Fig. 5 shows the time-stamp grouping process of Fig. 4 in more detail; Fig. 6 depicts a seed/region growing technique in accordance with the preferred embodiment; Fig. 7 shows the automatic identification step discussed in relation to Fig. 2 in more detail; and Fig. 8 is a schematic block diagram of a general purpose computer upon which the preferred embodiment of the present invention can be practiced.
Detailed Description including Best Mode oooo :.,..Where reference is made in any one or more of the accompanying drawings to steps and/or features, which have the same reference numerals, those steps and/or features have for the purposes of this description the same function(s) or operation(s), unless the contrary intention appears.
V.-In the context of this specification, the word "comprising" means "including principally but not necessarily solely" or "having" or "including" and not "consisting only 25 of'. Variations of the word comprising, such as "comprise" and "comprises" have corresponding meanings. Furthermore, the terms "image", and "photograph" are used interchangeably, unless otherwise indicated, and the terms "clothing" and "fashion accessory" are also used interchangeably, unless a contrary intention is stated.
200701; 11:47 560356.doc *r I -i II -4- Fig. 1 shows a "stick-person" representation of a photograph to be processed in accordance with one disclosed arrangement. The photograph 100 depicts a number of people 102-108, and a tree 118. Each person 102 to 108 is wearing various articles of clothing and other fashion accessories, a distinctive fashion accessory or item of clothing for each being depicted as 110 to 116 respectively.
The disclosed arrangement is most effective if a fashion accessory 110 worn by a person 102 has a distinctive colour which is unique, ie. no one else in the set of photographs is wearing an accessory with the same colour. The disclosed arrangement is, however, still useful, even when the aforementioned condition is not strictly met.
Fig. 2 shows a flow chart representation of a labelling process 214 according to a disclosed arrangement. The labelling process 214 requires the user to designate, in a step 200, a desired person in an indicative image, whereafter the process 214 automatically identifies the presence of that person in the remaining images under consideration, or in a subset thereof. This approach is particularly applicable for automatic labelling of photographs from a particular event, such as a party. In Fig. 2, the labelling process 214 commences with an initial manual processing step 200, after which an automatic identification step 204 is performed. Thereafter, the labelling process 214 presents, to the user, a reduced sub-set of selected images in a step 208. Consequently the user, in a final :°oooo S. manual selection step 212, discards any remaining incorrectly identified photographs, thereby arriving at a set of photographs with the desired person 102 being present therein.
The aforementioned arrangement need not be 100% effective in order to be useful, since even a lesser degree of performance nonetheless reduces the amount of final manual processing which the user needs to perform. Incorrect image identification which may be due, for example, to incorrect colour matching, or multiple colour instances a 25 couch having the same colour as the selected clothing worn by the desired person), results in false matches, however the user is nonetheless able to consider a reduced sub-set of images in the step 212, and thereby more conveniently remove the incorrectly matched images.
200701; 11:47 560356.doc I r Fig. 3 shows an arrangement by means of which the initial manual selection step 200 described in relation to Fig. 2 is performed. In Fig. 3, the image 100 undergoing the labelling process 214 is shown on a display 302 of a PC 300, the displayed image being shown in more detail in an inset 304. The user makes use of a mouse 306 which is connected to the PC 300 by a connection 308, to point to a particular fashion accessory 110. The action of pointing, depicted by an arrow 310, involves positioning a cursor using the mouse 306, and clicking a mouse button 312. The fashion accessory 110 is discerned by an application running on the PC 300, in a manner described in more detail with reference to Fig. 6. By pointing to the particular fashion accessory 110, the user identifies the key element by which the disclosed arrangement is able to both search for the desired person 102 in the plurality of images being considered, and to label the images in which the desired person 102 is present.
Fig. 4 depicts the initial manual processing step 200 in more detail.
In a step 402, event meta-data, stored in a memory 806 (see Fig. 8) is utilised, using a processor 805 (see Fig. 8) in order to reduce an image search space to be considered by the automatic identification sub-process 204 (see Fig. The use of meta- .data ensures, for example, that only photographs taken during a defined period of time are considered by the labelling process 214, this being based on the hypothesis that it is 0 *unlikely that the desired person 102 wears the same fashion accessory 110 all the time. In other words, if the user wishes to identify the presence of the particular person 102 in a 0 large core set of images, the user needs to repeat the labelling process depicted in Fig. 2 for each "event" in the core set of images. For example, if the user has used the digital Vto. camera to take photographs of a wedding, a beach party, and a family picnic over the course of a weekend, then the photographer needs to conduct three labelling passes of the o 25 image data stored in the camera, in order to identify a particular person who may have been present at all three events. The photographer needs to identify a distinctive fashion accessory for each event. Digital cameras increasingly provide meta-data as part of their normal operation, this being useful in the aforementioned process 402. Although user 200701; 11:47 560356.doc i~w.ii~.niCYiiiil -6input, in addition to camera-provided meta-data, can be used to demarcate the boundaries of different events, automatic meta-data provision is preferable.
Whichever basis, ie. automatic or semi-automatic is used to provide the metadata in the step 402, a subsequent time-stamp grouping step 404 performs, using the processor 805, analysis of time-stamp meta-data for the photographs. This time-stamp information is used to collect sets of photographs, into groups of images on an event basis. Thereafter, in a decision step 406, a decision is made, by the processor 805, as to whether a manual selection of a particular fashion accessory by the user is required, or alternatively, whether an automatic search based on pre-loaded known accessories is to be used. If a manual selection is to be used, then the process 200 is directed in accordance with a "manual" arrow to a step 400, in which the user points to the distinctive fashion accessory which identifies the desired person as described in relation to Fig. 3. The action of pointing, depicted by the arrow 310, involves positioning a cursor using the mouse 306, and clicking the mouse button 312, thereby selecting the distinctive fashion accessory. If, on the other hand, a more automated search is to be performed, then the process 200 is directed in accordance with an "auto" arrow from the decision step 406 to a step 408 in which a menu of pre-loaded accessory colours, and possibly associated accessory shapes and sizes, is displayed to the user. In a following step 410, the user makes a selection from the menu, thereby defining one of the known accessories to be used to by the automatic identification step 204. Known accessories can include, for example, distinctive team jerseys such as are used in sports games, such jerseys being typically coloured in distinctive bright yellow, green or other colours. A particular "advantage provided by the automatic search mode is that once known accessory colours, and perhaps shapes and sizes, have been pre-loaded by the user, and stored in the memory 25 806 or 809, it is subsequently not necessary for the user to repeatedly provide, or select, sample characteristics upon which the subsequent search process is to be based.
Fig. 5 shows the time-stamp grouping process 404 in more detail, according one disclosed arrangement. Fig. 5 shows time-stamp grouping for a particular day, however the process can be extended to perform time-stamp grouping across an arbitrary time 200701; 11:47 560356.doc a 311 r iulr*r*ijii-li---ii 'i~ -7period. In a first step 502, a time-stamp of an image, stored in the memory 806 or 809, is read by the processor 805. In a following decision step 504, a check is made as to whether the time stamp falls within a period 10.00am to 6.00pm, which typically forms a normal working day during which time people tend to wear the same set of clothes. If the time-stamp does fall within this period, then the time-stamp process 404 is directed in accordance with a YES arrow to a grouping step 506, which groups all images falling within this time interval into a single "event". Thereafter, the time-stamp process 404 is directed back to the checking step 502 to consider a time-stamp for a next image. If, on the other hand, the present image time-stamp does not fall into the aforementioned time interval, the process 404 is directed in accordance with a NO arrow to a following decision step 508, where a time interval from 6.00pm to 8.00pm is considered, this being a "post work" to "pre-evening activity" time interval, during which people usually tend to change their clothes.
During this time interval, people typically go home from their places of employment, and may change clothes once or twice, before commencing evening activities. Accordingly, allowance is made for a number of different events during this time period, these events spanning, say, a period of one hour each. If an image timestamp falls within one of these periods, the time-stamp process 404 is directed to a grouping step 510, where time stamps, and associated images, occurring within a time interval of less than one hour duration are defined as belonging to the same event.
oooo Thereafter, the time-stamp process 404 is directed back to the checking step 502 for a next image. If, however, the time-stamp of the image being considered does not fall into the aforementioned time periods, the time-stamp process 404 is directed in accordance with the NO arrow to a decision block 512, which considers a late night time interval extending between 11.00pm and 6.00am. This time interval generally relates either to sleep, or to late night party activities. In both cases, people tend to wear the same set of clothing, or if they do change clothes say after a party, a three hour time interval is likely to capture most photographs, belonging to a single event. Accordingly, if a time-stamp of the photograph in question falls within this time period (ie 1 lpm to 6am), the time-stamp 200701; 11:47 560356.doc a a rr -8process 404 is directed to a grouping step 514, where images whose time-stamps fall within a period of less than three hours are grouped into the same event. Thereafter the time-stamp process 404 is directed back to the checking step 502. If an image time-stamp does not, however, fall into this time period, then the time-stamp process 404 is directed in accordance with a "NO" arrow to a final decision block 520, which considers all time periods not already accounted for in the previous decision blocks 504, 508 and 512. A time-stamp grouping period of I V 2 hours is considered, in a grouping step 518, to constitute an event in this case, whereafter the time-stamp process 404 is directed from the grouping step 518 back to the checking step 502. If the time-stamp of the photograph in question corresponds to a different day to the one being considered, the time-stamp process 404 is directed accordingly to a NO arrow 522 back to the checking step 502. It will be apparent, that the aforementioned time-stamp process can be augmented, and/or amended, in order to extend the use of meta-data for grouping images into events.
Furthermore, user interaction is also possible, without altering the fundamental feature of grouping images into events, in order to allow use of disclosed arrangement. It will be further apparent, that grouping of images into events can be performed using other methods. Thus, for example, meta-data can be grouped using cluster analysis applied to a see. frequency with which photographs are taken over a given time period.
Fig. 6 shows a seed/region growing technique (also called a colour segmentation technique) used in the automatic identification step 204 of Fig. 2. This technique is used to designate a colour area which forms the identifying attribute of the desired person being searched for, across the set of images by the processor 805. It is also used to reduce the computational requirements for comparison of images. It will be recalled from the description given in relation to Fig. 3, that in a "manual selection" arrangement, as described subsequently in relation to Fig. 4, the photographer pointed to a distinctive fashion accessory 110 in the indicative image 100. The aforementioned pointing is depicted in Fig. 6 by the arrow 3 10 pointing into the centre of the oval shaded area 600, which in the present example, forms a part of the fashion accessory 110 which has a particular distinctive colour 602. Taking a practical example, if the shaded area 600 is a 200701; 11:47 560356.doc -9shirt worn by Bob, and on the assumption that Bob is to be identified in the set of images under consideration, then the photographer points into the centre of the "shirt" 600, as depicted by the arrow 310. In the alternate "automatic selection" arrangement, as described in relation to Fig. 4, the distinctive colour 602 is provided as a result of the user selection from the menu of known accessory colours (see step 410 in Fig. 4) the preloaded attributes being stored in the memory 806 or 809. While the action of pointing and clicking using the mouse (see the description in relation to Fig. 3) or alternately, the process of user selection from the menu of pre-loaded accessories, may be sufficient to identify the fashion accessory 110, a seed/region growing technique is used in one arrangement in order to "fill" the oval shaded area 600. In the present example, when the photographer thus clicks on Bobs shirt, and assuming that the shirt is red, the photographer will notice on the display 302, that a red colour emanates from the clicked cursor and "fills" Bob's shirt. This provides a visual feedback to the photographer that the desired fashion accessory, and the appropriate colour, has in fact been selected.
Application of colour segmentation to the selected fashion accessory avoids a problem which might otherwise arise if, for example, the particular pixel selected by the mouse 6@9* point-and-click happens to be of a different colour, which is speckled in small quantities throughout the shirt. Seed/region growing techniques are available for identifying a 0@ 9 *o*o region having substantially a uniform colour, these techniques being based on selection of a seed pixel of the specified colour and subsequent comparison thereof to neighbouring pixels. Growing techniques utilize a principle whereby a coordinate, in colour space, 9~*t 9 oo. associated with a neighbouring pixel is tested against a corresponding coordinate of the "reference" pixel which has been pointed to, and a distance between the two points in colour space is used as a criterion for deciding whether the two pixels are substantially of 25 the same colour. The use of LAB colour space is recommended, in contrast for example to a Red, Green, Blue (RGB) colour space, since the LAB space provides a more linear relationship when calculating distances between pixel coordinates in colour space. In particular, a criterion used in relation to LAB colour space for determining a similarity between pixels is described mathematically as follows: 200701; 11:47 560356.doc
/(L
2
-L
2
+(A
2 2
+(B
2
-B)
2 1 [1] where subscripts indicate colour coordinates for the reference pixel, and "2" subscripts represent colour coordinates for a pixel being tested. Equation indicates that if a Euclidean distance between coordinates of the two pixels being considered is less than or equal to one, then the colour of the two pixels is considered to be the same.
Once a reference colour region 602 is defined for the selected accessory 110 of the indicative image 100 using the seed growing technique, the same technique is applied to each of the plurality of images of interest in the event, ie. 606, 608 610 where these images 606, 608, 610 constitute a set of images which have been grouped according to the time-stamp grouping process 404 (see Figs. 4 and It is noted, however, that whereas in the present arrangement the colour segmentation (ie. the seed/region growing techniques) is applied to the indicative image by the user pointing a mouse, the technique is then automatically applied to each of the plurality of images, without requiring user intervention. Automatic colour segmentation can be performed, for example, by placing seeds in different locations of an image, growing regions around the seeds, placing further seeds where required if the segmentation process "misses" certain regions, and continuing in this manner until the entire image is segmented. After the application of the seed/region growing technique, the image 606 comprises a number of different regions, eg. 612, and similarly, images 608 and 610 comprise a number of different regions.
Application of the seed/region growing technique transforms images which typically have i a very large number of different colours to images with a significantly reduced number of colour regions, typically 12 to 15. These "distinct colours" are used in a comparison or searching step 702 (see Fig. performed by the processor 805, when searching for the designated fashion accessory. Fig. 6 shows that the image 610 contains a region 614 which has the same colour as the region 600, this being depicted by use of the same crosshatching pattern for the regions 614 and 600.
200701; 11:47 560356.doc -11- Fig. 7 shows the automatic identification step 204, which is performed by the processor 805, and discussed in relation to Fig. 2 in more detail. In a step 700, and considering a single event, colour regions in each of the images of the selected set are subjected to the seed/region growing technique as previously described in relation to Fig.
6. The indicative image 100 selected by the user in the pointing step 400 (see Fig. 4) is also subjected to the seed/region growing process. In a following step 702, the selected region 600 in the designated image is compared, using the processor 805, with each of the regions eg. 612 in the other images in the set. Thereafter, in a step 704, an image 610 in the event set which has a region of the same colour, ie. 614, is categorised or tagged as containing the desired fashion accessory, and accordingly, as possibly containing the desired person. From a practical perspective, the image 610 is labelled as containing the desired fashion accessory, and accordingly, as containing the desired person whose presence is being sought. As noted, application of the seed/region growing technique to typical consumer photographs produces processed images having, typically, fifteen or so different colour regions. One advantage of using the seed/region growing technique as a precursor to performing colour comparisons, for example using the LAB Euclidean distance measure described in Equation is that the number of regions to be compared is reduced to a tractable level. This allows for rapid and convenient processing, by the processor 805, in contrast to the situation which would prevail if images were compared on a pixel by pixel basis, which would require a number of computations of a different, and greater, order than that proposed in the disclosed arrangement. In effect, the seed/region growing technique transforms images associated with the particular event into images containing only "distinct" colour regions. The original photographs are, of course, retained in the memory 806 or 809, and the "transformed" image only maintained temporarily for use in the automatic identification step 204 (see Fig. Accordingly, the "transformed" image is typically determined as part of the automatic identification process, and is not, in general, stored for later use. If, however, the user wishes to search the set of photographs for a different person, for example, then the transformed images 200701; 11:47 560356.doc -12may advantageously be stored, in the memory 806 or 809, for use in that search, rather than again needing to perform the transformation using the processor 805.
Although the previous description described colour comparisons using a Euclidean distance applied to LAB colour space, the comparison can be performed in other ways. For example, fuzzy logic can be used to compensate for subtle differences between images due to differences in lighting and colour conditions. Furthermore, other factors such as size and shape of colour regions can also be incorporated into the comparison step 702. In this latter case, a larger region of colour would have a higher confidence of being a correct match when compared to a reference colour region, than would a colour region of only a few pixels in size. Furthermore, selection of multiple colour reference regions can be used, making use of the same basic mechanism as previously described with reference to a single colour area. This can be arranged, for instance, by selecting a dominant colour target, in addition to one or more secondary colour targets, whose presence is not required for initial matching process, but which can S 15 be used as secondary indicators of a correct match.
The aforementioned automatic identification sub-process 204 can be refined by inclusion of skin colour detection as an additional consideration. The use of skin colour detection in conjunction with colour detection using the specified colour of the selected fashion accessory 110 will typically improve the accuracy of the embodiment, since this ensures that a photograph actually contains a person, as well as the identified colour, since the particular colour selected may also, in a given image, attach to a couch.
The aforementioned arrangements are most effective when applied in relation to images taken at a single event such as a party. In such a case, people are, generally, o• wearing the same clothes and fashion accessories for the duration of the event, and hence the entire set of images can be searched using the same set of defined fashion accessories.
Furthermore, the arrangements are most effective when people are wearing distinctive clothes, since this makes the automatic identification sub-process 204 more effective.
One approach to improving the effectiveness of the aforementioned arrangement is to provide people at an event with a small colour sticker, or label, having a distinct and 200701; 11:47 560356.doc 13unique colour for each person. This would improve the effectiveness of the described arrangement, and the sticker could be removed by automatic post-processing from the final images.
Turning to Fig. 8, the method of Figs. 2, 4 5, and 7 are preferably practiced using a conventional general-purpose computer system 800, such as that shown in Fig. 8 wherein the processes of Figs. 2, 4, 5 and 7 can be implemented as software, such as an application program executing within the computer system 800. In particular, the steps of identifying a person in one or more images are effected by instructions in the software that are carried out by the computer. The software can be divided into two separate parts, one part for carrying out the method of identifying a person in one or more images, and another part to manage the user interface between the former and the user. The software can be stored in a computer readable medium, including the storage devices described below, for example. The software is loaded into the computer from the computer readable medium, and then executed by the computer. A computer readable medium having such software or computer program recorded on it is a computer program product.
The use of the computer program product in the computer preferably effects an advantageous apparatus for identifying a person in one or more images in accordance with the disclosed arrangements.
The computer system 800 comprises a computer module 801, input devices such .oool as a keyboard 802 and mouse 803, output devices including a printer 815 and a display V device 814. A Modulator-Demodulator (Modem) transceiver device 816 is used by the computer module 801 for communicating to and from a communications network 820, for example connectable via a telephone line 821 or other functional medium. The modem 816 can be used to obtain access to the Internet, and other network systems, such as a Local Area Network (LAN) or a Wide Area Network (WAN).
The computer module 801 typically includes at least one processor unit 805, a memory unit 806, for example formed from semiconductor random access memory (RAM) and read only memory (ROM), input/output interfaces including a video interface807, and an I/O interface813 for the keyboard802 and mouse 803 and 200701; 11:47 560356.doc 3 e~ -14optionally a joystick (not illustrated), and an interface 808 for the modem 816. A storage device 809 is provided and typically includes a hard disk drive 810 and a floppy disk drive 811. A magnetic tape drive (not illustrated) can also be used. A CD-ROM drive 812 is typically provided as a non-volatile source of data. The components 805 to 813 of the computer module 801, typically communicate via an interconnected bus 804 and in a manner which results in a conventional mode of operation of the computer system 800 known to those in the relevant art. Examples of computers on which the embodiments can be practised include IBM-PC's and compatibles, Sun Sparcstations or alike computer systems evolved therefrom.
Typically, the application program of the disclosed arrangement is resident on the hard disk drive 810 and read and controlled in its execution by the processor 805.
Intermediate storage of the program and any data fetched from the network 820 can be accomplished using the semiconductor memory 806, possibly in concert with the hard :i disk drive 810. In some instances, the application program can be supplied to the user 15 encoded on a CD-ROM or floppy disk and read via the corresponding drive 812 or 811, oO.o or alternatively can be read by the user from the network 820 via the modem device 816.
Still further, the software can also be loaded into the computer system 800 from other computer readable medium including magnetic tape, a ROM or integrated circuit, a magneto-optical disk, a radio or infra-red transmission channel between the computer module 801 and another device, a computer readable card such as a PCMCIA card, and ooo# the Intemrnet and Intranets including email transmissions and information recorded on websites and the like. The foregoing is merely exemplary of relevant computer readable mediums. Other computer readable mediums can be practiced without departing from the scope and spirit of the invention.
The method of identifying a person in one or more images can alternatively be implemented in dedicated hardware such as one or more integrated circuits performing the functions or sub functions of identifying a person in one or more images. Such dedicated hardware can include graphic processors, digital signal processors, or one or more microprocessors and associated memories.
200701; 11:47 560356.doc I Ir a Industrial Applicability It is apparent from the above that the embodiment(s) of the invention are applicable to the image processing, consumer and professional photography industries.
The foregoing describes only some embodiments of the present invention, and modifications and/or changes can be made thereto without departing from the scope and spirit of the invention, the arrangements being illustrative and not restrictive. Thus, for example, although the description has been couched in terms of identifying a person in one or more images, the arrangements are equally applicable to identifying a particular object in the one or more images, where the particular object has component parts which have distinctive colours, shapes, sizes and so on.
200701; 11:47 560356.doc 1
Claims (4)
1. A method of identifying, from a set of images, images in which a specified object is present, the method comprising the steps of: selecting, from the set of images, an indicative image in which the specified object is present; selecting a part of said object; searching the set of images for the part dependent upon the colour of the part; and tagging the images in the set in which the part is found.
2. A method according to claim 1, wherein said object is a person, and said part of said object is a fashion accessory worn by the person.
3. A method according to claim 1, wherein said selecting step comprises manual provision of designation information, in relation to the indicative image, said designation information identifying said part of the object. 0%oo
4. A method according to claim 1, wherein prior to said selecting step the 20 method comprises defining said part of the object as an item in a menu, and wherein said !'"""selecting step comprises selecting said part from the menu. A method according to claim 1, wherein: the selecting step comprises an additional sub-step selecting another part of said object having another colour; searching step comprises an additional sub-step of searching said set of images for said other part of said object as well as for said part of said object; and the tagging step comprises tagging images if both said part and said other part are found therein. 200701; 11:47
560356.doc nn iClu~L~rr*T~iUi-1- -17- 6. A method according to claim 1, wherein: the searching step comprises an additional sub-step of searching said set of images for human skin colour as well as for said part of said image; and the tagging step comprises tagging the images if both said part and said skin colour are found therein. 7. A method according to claim 1, comprising, prior to the choosing step, additional steps of: deriving meta-data for a core set of images; grouping said core set into one or more event image sets dependent upon said meta-data; and designating the set of images from said one or more event image sets. 8. A method according to claim 7, wherein said meta-data comprises time stamps associated with the images in the core set, and said grouping step comprises, in relation to an image in said core set, a sub-step of: assigning said image to an event if an associated time stamp falls within a predetermined event time interval. 20 9. A method according to claim 7, wherein said meta-data comprises time stamps for the images in the core set, and said grouping step comprises, in relation to an image in said core set of images, a sub-step of: assigning said image to an event if an associated time stamp can be clustered with time stamps associated with other images in said event. A method according to claim 1, wherein said searching step comprises sub-steps of: colour segmenting images in the set of images into at least one colour region; e 00 determining for each image in the set whether said colour matches said at least o one colour region; and 200701; 11:47 560356.doc I -18- flagging a presence of said object if a match is found. 11. A method according to claim 10, wherein: said part, in addition to the colour, has at least one of a distinctive size attribute and a distinctive shape attribute; said determining step comprises at least a further sub-step of: determining whether a size of said region matches the distinctive size; and determining whether a shape of said region matches the distinctive shape;and said flagging step comprises flagging a presence of said object if at least one of a colour match, a size match, and a shape match is found. 12. An apparatus for identifying, from a set of images, images in which a specified object is present, the apparatus comprising: means for selecting, from the set of images, an indicative image in which the specified object is present; means for selecting a part of said object; means for searching the set of images for the part dependent upon the colour of S 20 the part; and means for tagging the images in the set in which the part is found. 13. An apparatus according to claim 12, further comprising: means for manual provision of designation information, in relation to the indicative image, to thereby identify said part of the object. !ii 14. An apparatus according to claim 12, further comprising: .means for defining said part of the object as an item in a menu; and 00.0 0 &means for selecting said part from the menu. oo 200701; 11:47 560356.doc ~nr I l-;iiil-iiU -19- An apparatus according to claim 12, further comprising: means for deriving meta-data for a core set of images; means for grouping said core set into one or more event image sets dependent upon said meta-data; and means for choosing the set of images from said one or more event image sets. 16. An apparatus according to claim 12, further comprising: means for colour segmenting images in the set of images into at least one colour region; means for determining whether said colour matches said at least one colour region; and means for flagging a presence of said object if a match is found. 17. An apparatus for identifying, from a set of images, images in which a specified object is present, the apparatus comprising: a memory for storing a program; and a processor for executing the program, said program comprising: code for selecting, from the set of images, an indicative image in which the specified object is present; 20 code for selecting a part of said object; oooo ""code for searching the set of images for the part dependent upon the colour of the part; and code for tagging the images in the set in which the part is found. 18. A computer program which is configured to make a computer execute a procedure for identifying, from a set of images, images in which a specified object is present, the program comprising: code for selecting, from the set of images, an indicative image in which the specified object is present; code for selecting a part of said object; 200701; 11:47 560356.doc code for searching the set of images for the part dependent upon the colour of the part; and code for tagging the images in the set in which the part is found. 19. A computer program according to claim 18, further comprising: code for manual provision of designation information, in relation to an indicative image, to thereby identify said part of the object. A computer program according to claim 18, further comprising: code for defining said part of the object as an item in a menu; and code for selecting said part from the menu. 21. A computer program according to claim 18, further comprising: code for deriving meta-data for a core set of images; code for grouping said core set into one or more event image sets dependent upon said meta-data; and code for choosing a desired image set, comprising an indicative image and at least one target image, from said one or more event image sets. o o oooo 20 22. A computer program according to claim 18, further comprising: ~code for colour segmenting images in the set of images into at least one colour region; code for determining whether said colour matches said at least one colour region; and 25 code for flagging a presence of said object if a match is found. 23. A computer program product including a computer readable medium having recorded thereon a computer program which is configured to make a computer execute a procedure for identifying, from a set of images, images in which a specified object is present, the program comprising: 200701; 11:47 560356.doc -21 code for selecting, from the set of images, an indicative image in which the specified object is present; code for selecting a part of said object; code for searching the set of images for the part dependent upon the colour of the part; and code for tagging the images in the set in which the part is found. 24. A method of identifying a person in an image substantially as described herein with reference to the accompanying drawings. An apparatus adapted to identify a person in an image substantially as described herein with reference to the accompanying drawings. 26. A computer program which is configured to make a computer execute a procedure to identify a person in an image substantially as described herein with reference to the accompanying drawings. 27. A computer program product including a computer readable medium OG** having recorded thereon a computer program which is configured to make a computer S, 20 execute a procedure to identify a person in an image substantially as described herein o 0* with reference to the accompanying drawings. DATED this seventeenth Day of November, 2003 Canon Kabushiki Kaisha Patent Attorneys for the Applicant SPRUSON FERGUSON O S 0 o• •o 200701; 11:47 560356.doc
Priority Applications (1)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| AU57684/01A AU769655B2 (en) | 2000-08-04 | 2001-07-27 | Automatic person meta-data labeller |
Applications Claiming Priority (3)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| AUPQ9216A AUPQ921600A0 (en) | 2000-08-04 | 2000-08-04 | Automatic person meta-data labeller |
| AUPQ9216 | 2000-08-04 | ||
| AU57684/01A AU769655B2 (en) | 2000-08-04 | 2001-07-27 | Automatic person meta-data labeller |
Publications (2)
| Publication Number | Publication Date |
|---|---|
| AU5768401A AU5768401A (en) | 2002-02-07 |
| AU769655B2 true AU769655B2 (en) | 2004-01-29 |
Family
ID=25631713
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| AU57684/01A Ceased AU769655B2 (en) | 2000-08-04 | 2001-07-27 | Automatic person meta-data labeller |
Country Status (1)
| Country | Link |
|---|---|
| AU (1) | AU769655B2 (en) |
Citations (3)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| WO1994028490A1 (en) * | 1993-05-21 | 1994-12-08 | Imageware Software, Inc. | Image editing system including masking capability |
| US5606821A (en) * | 1994-07-25 | 1997-03-04 | Loral Corporation | Smart weed recognition/classification system |
| JP2000341512A (en) * | 1999-05-27 | 2000-12-08 | Matsushita Electric Ind Co Ltd | Image reading device |
-
2001
- 2001-07-27 AU AU57684/01A patent/AU769655B2/en not_active Ceased
Patent Citations (3)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| WO1994028490A1 (en) * | 1993-05-21 | 1994-12-08 | Imageware Software, Inc. | Image editing system including masking capability |
| US5606821A (en) * | 1994-07-25 | 1997-03-04 | Loral Corporation | Smart weed recognition/classification system |
| JP2000341512A (en) * | 1999-05-27 | 2000-12-08 | Matsushita Electric Ind Co Ltd | Image reading device |
Also Published As
| Publication number | Publication date |
|---|---|
| AU5768401A (en) | 2002-02-07 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| CN101510205B (en) | Method for implementing automatically clustering photographs, apparatus and system | |
| US8170343B2 (en) | Method and system for searching images with figures and recording medium storing metadata of image | |
| CN109189991A (en) | Repeat video frequency identifying method, device, terminal and computer readable storage medium | |
| CN107408212A (en) | System and method for identifying the unwanted photo being stored in equipment | |
| WO2017000716A2 (en) | Image management method and device, and terminal device | |
| CN108256443A (en) | A kind of personnel positioning method, system and terminal device | |
| US9798741B2 (en) | Interactive image selection method | |
| US20120195499A1 (en) | Color description analysis device, color description analysis method, and color description analysis program | |
| CN106250431B (en) | A kind of Color Feature Extraction Method and costume retrieval system based on classification clothes | |
| US7054481B2 (en) | Automatic person meta-data labeller | |
| CN102332034A (en) | Portrait picture retrieval method and device | |
| CN102150163A (en) | Interactive image selection method | |
| CN102804178A (en) | Detecting significant events in consumer image collections | |
| CN105320710B (en) | The vehicle retrieval method and device of resisting illumination variation | |
| WO2014168177A1 (en) | Image processing device | |
| CN108830820A (en) | Electronic device, image-pickup method and computer readable storage medium | |
| Suh et al. | Semi-automatic image annotation using event and torso identification | |
| Luo et al. | Psychophysical study of image orientation perception | |
| US20070070217A1 (en) | Image analysis apparatus and image analysis program storage medium | |
| AU769655B2 (en) | Automatic person meta-data labeller | |
| US11762906B2 (en) | Modality mapping for visual search | |
| CN113096772A (en) | Intelligent diet weight assessment system based on camera shooting technology | |
| CN110457998A (en) | Image data correlating method and equipment, data processing equipment and medium | |
| Shao et al. | Semi-automatic crowdsourcing tool for online food image collection and annotation | |
| Othman et al. | Quantization selection of colour histogram bins to categorize the colour appearance of landscape paintings for image retrieval |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| MK1 | Application lapsed section 142(2)(a) - no request for examination in relevant period | ||
| DA3 | Amendments made section 104 |
Free format text: THE NATURE OF THE AMENDMENT IS: SUBSTITUTE PATENT REQUEST REGARDING ASSOCIATED DETAILS |
|
| FGA | Letters patent sealed or granted (standard patent) |