US20070201750A1 - Image processing method, apparatus, and computer readable recording medium including program therefor - Google Patents

Image processing method, apparatus, and computer readable recording medium including program therefor Download PDF

Info

Publication number
US20070201750A1
US20070201750A1 US11/709,708 US70970807A US2007201750A1 US 20070201750 A1 US20070201750 A1 US 20070201750A1 US 70970807 A US70970807 A US 70970807A US 2007201750 A1 US2007201750 A1 US 2007201750A1
Authority
US
United States
Prior art keywords
image
face
model
representing
recovery
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
US11/709,708
Other versions
US7885477B2 (en
Inventor
Wataru Ito
Hideki Yamagishi
Yuanzhong Li
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Fujifilm Corp
Original Assignee
Fujifilm Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Fujifilm Corp filed Critical Fujifilm Corp
Assigned to FUJIFILM CORPORATION reassignment FUJIFILM CORPORATION ASSIGNMENT OF ASSIGNORS INTEREST (SEE DOCUMENT FOR DETAILS). Assignors: LI, YUANZHONG, ITO, WATARU, YAMAGISHI, HIDEKI
Publication of US20070201750A1 publication Critical patent/US20070201750A1/en
Application granted granted Critical
Publication of US7885477B2 publication Critical patent/US7885477B2/en
Active legal-status Critical Current
Adjusted expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V40/00Recognition of biometric, human-related or animal-related patterns in image or video data
    • G06V40/10Human or animal bodies, e.g. vehicle occupants or pedestrians; Body parts, e.g. hands
    • G06V40/16Human faces, e.g. facial parts, sketches or expressions
    • G06V40/168Feature extraction; Face representation
    • G06V40/169Holistic features and representations, i.e. based on the facial image taken as a whole
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V10/00Arrangements for image or video recognition or understanding
    • G06V10/70Arrangements for image or video recognition or understanding using pattern recognition or machine learning
    • G06V10/74Image or video pattern matching; Proximity measures in feature spaces
    • G06V10/75Organisation of the matching processes, e.g. simultaneous or sequential comparisons of image or video features; Coarse-fine approaches, e.g. multi-scale approaches; using context analysis; Selection of dictionaries
    • G06V10/755Deformable models or variational models, e.g. snakes or active contours
    • G06V10/7557Deformable models or variational models, e.g. snakes or active contours based on appearance, e.g. active appearance models [AAM]
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V40/00Recognition of biometric, human-related or animal-related patterns in image or video data
    • G06V40/10Human or animal bodies, e.g. vehicle occupants or pedestrians; Body parts, e.g. hands
    • G06V40/14Vascular patterns

Definitions

  • the present invention relates to an image processing method and apparatus for recovering a missing element of a predetermined structure, such as a face in an image or the like. It also relates to a computer readable recording medium including a program for causing a computer to perform the method.
  • a photograph of main subject is obtained in partially shaded, depending on the imaging environment or the appearance of the subject. For example, if a person, the major subject, wearing a cap is imaged, the eyebrows or the eyes of the person may be shaded by the rim of the cap, and the eyebrows or the eyes of the person in the recorded image may become collapsed to dark. If a person, the major subject, standing over a net is imaged, the net pattern is superimposed on the face of the person in the recorded image.
  • an excessively high contrast between the subject and the background may cause halation in the high signal region, or collapse in low signal region of the image.
  • the resultant image may be blurred, that is, the image missing high-frequency component may be obtained.
  • a method for correcting digital images obtained by digital cameras or those obtained by scanning photograph prints by scanners is proposed as described, for example, Japanese Unexamined Patent Publication No. 8(1996)-065519.
  • the center of a defective area of an image is specified by the user, and a predetermined surrounding region including the defective area is specified as a removal region.
  • an interpolation region required for interpolating the removal region is specified in the surrounding area of the removal region. Then, the interpolation region is extended gradually and linearly toward the defective area to replace the pixels within the removal region with the pixels of the interpolation region, and thereby the defective area is removed and corrected.
  • the aforementioned method is an effective image correction method when a defective area (removal region) is readily replaced by the surrounding region (interpolation region), such as the region of the sky or the ocean.
  • a defective area retractal region
  • the surrounding region interpolation region
  • an image component with a unique shape such as an eye, a nose, a mouth, or the like
  • a shield or having halation or collapse therein
  • the present invention has been developed in view of the circumstances described above, and it is an object of the present invention to provide an image processing method and apparatus capable of accurately recovering various types of defects in an image. It is a further object of the present invention to provide a computer readable recording medium including a program therefor.
  • An image processing method of the present invention comprises the step of applying a predetermined structure in an input image, representing that an element of the structure is missing, to a model other than the missing element to recover the missing element of the structure in the input image, the model representing the predetermined structure by a statistical characteristic amount obtained by performing predetermined statistical processing on a plurality of images representing the predetermined structure including the recovery target element.
  • An image processing apparatus of the present invention is an apparatus for performing the image processing described above. That is, the apparatus comprises:
  • a recovery means for applying the predetermined structure in an input image, representing that the element of the predetermined structure is missing, to the model other than the missing element to recover the missing element of the structure in the input image.
  • a computer readable recording medium of the present invention is a recording medium including a program for causing a computer to perform the image processing method described above.
  • the features of the image processing method, apparatus, and program therefor are that a recovery target element of a predetermined structure does not exist in an input image but exists only in the model, and the recovery is performed by correlating the predetermined structure with the model other than the recovery target element and estimating the recovery target element based on the correlation.
  • an AAM (Active Appearance Models) scheme may be employed.
  • the AAM scheme is one of the approaches for trying to interpret image contents based on a model. For example, if a face is the interpretation target object, a mathematical model of a face is generated by performing a principal component analysis on the shapes of the face portions of a plurality of learning images, or on the luminance data after normalization of the shapes. Then, the face portion of a new input image is represented by each of the principal components and a weighting parameter thereof to reconstruct the face image.
  • the method is described in a non-patent literature “Active Appearance Models” by T. F. Cootes, et al., In Proc. 5th European Conference on Computer Vision, Springer, Germany, 1998, vol. 2, pp. 484-498 (reference document 1).
  • the “predetermined structure” is a structure appropriate for modeling, i.e., a structure whose variation in the shape or luminance in the image falls within a certain range.
  • a structure from which a statistical characteristic amount having a higher level of explanatory power for the shape or luminance thereof may be derived by performing statistical processing thereon.
  • One of the specific examples is a human face.
  • the recovery target element i.e., the missing element in an input image may include a portion of the region of a predetermined structure, a portion of color signals representing the predetermined structure, a portion of spatial frequency components representing the predetermined structure, the number of gradations representing the predetermined structure, or the like.
  • the loss of a portion of the region of the predetermined structure may include, a portion of the face being collapsed to dark due to the shadow of the rim of the cap of the person, a portion of the face being masked by a shield standing between the face and the imaging device, or the like. Further, it may also include the case that a signal of at least one channel of a portion of the region of the predetermined structure being lost due to a compression or transmission error of the imaging device. Still further, a face without glasses may be deemed as the loss of the glasses.
  • the process for recovering a missing part of the region of the predetermined structure includes the addition (recovery) of appendages (ornaments) to the structure.
  • the loss of a portion of color signals representing the predetermined structure may include the loss of one of R, G, and B signals due to a compression or transmission error of the imaging device, or halation in high signal level component and collapse in low signal level component. Further, it may include the case where a monochrome image is deemed as the loss of color data and reconstructed as a color image.
  • the color space is not limited to RGB, and YCC or the like may also be used. Preferably, however, the process is performed on RGB signals, since the halation in a high signal level component is more likely to occur in R signal.
  • the loss of a portion of spatial frequency components representing the predetermined structure may include the loss of a high frequency component such as an edge. Further, a face without pimples, wrinkles, or the like may be deemed as the loss of spatial frequency components representing the pimples, wrinkles, or the like.
  • the loss of the number of gradations means the state of being a smaller number of gradations, and an image with smaller number of gradations is reconstructed as an image with a greater number of gradations by the image processing of the present invention.
  • the recovery target element may include the aforementioned elements mixed together. Specific examples include the case where a non-makeup face is deemed as the loss of makeup and reconstructed as a made-up face, and the case where an image obtained by a low performance camera, such as a cell phone camera, is reconstructed like an image obtained by a high performance camera.
  • An image representing the predetermined structure including the recovery target element may be an image of the predetermined structure that includes the recovery target element obtained by actually imaged, i.e., the image obtained by imaging the structure without any missing element, or an image obtained by artificially performing correction on an image not including the recovery target object to recover the target element.
  • the “predetermined statistical processing” a dimension compression processing capable of representing the structure by compressing it to a statistical amount with the number of dimensions less than the number of pixels representing the structure be preferable.
  • a specific example may be a multivariate statistical technique, such as the principal component analysis or the like. If the principal component analysis is performed as the “predetermined statistical processing”, the “statistical characteristic amount” means a plurality of principal components obtained by the principal component analysis.
  • the level of the explanatory power described above means that a higher order principal component has a higher level of explanatory power, and a lower order principal component has a lower level of explanatory power, taking the case where the predetermined statistical processing is a principal component analysis as an example.
  • the “(predetermined) structure in an input image” may be detected either automatically or manually. Further, the present invention may further include a process for detecting the structure in an input image, or the region of the structure detected from an input image in advance may be used as the input for applying to the model.
  • the referent of “applying a predetermined structure in an input image to a model other than the missing element” as used herein means arithmetic operations or the like for representing the structure in the image by the model. More specifically, it means to obtain a weighting parameter value for weighting each of the principal components in the mathematical model to represent the structure in the input image by the mathematical model, taking the case where the AAM is used as an example.
  • an arrangement may be made in which a plurality of models of the present invention is provided, each for each attribute of the predetermined structure; information of the attribute of the structure in the input image is obtained; a process (means) for selecting the model according to the obtained attribute is additionally provided; and the predetermined structure in the input image is applied to the selected model.
  • the referent of “attribute” as used herein means, for example, the gender, age, or race, if the predetermined structure is a human face. Further, it may be information for identifying an individual. In this case, the model for each attribute means the model for each individual.
  • Specific methods for obtaining the “attribute” may include any known recognition process for images as described, for example, in Japanese Unexamined Patent Publication No. 11 (1999)-175724, or a method of estimating/obtaining it from the information appended to the image such as GPS information or the like.
  • the image processing method of the present invention may be used for reconstruction of a photograph for funeral, i.e., the photograph of a deceased person, reconstruction of a face image inputted for matching in face authentication, as well as reconstruction of general photograph images.
  • a missing element of a predetermined structure in an image may be recovered completely by applying the structure to a model representing the structure including the missing element as the recovery target element. Accordingly, various different types of defects in an image may be recovered accurately and easily without requiring any operational skills.
  • FIG. 1 schematically illustrates a hardware configuration of a digital photograph printer according to an embodiment of the present invention.
  • FIG. 2 is a block diagram of the digital photograph printer and digital camera according to the embodiment of the present invention, illustrating the functions and process flow thereof.
  • FIGS. 3A and 3B illustrate example screens displayed on the display of the digital photograph printer and digital camera according to the embodiment of the present invention.
  • FIG. 4 illustrates an example menu screen for selecting details of an image recovery/addition process according to an embodiment of the present invention.
  • FIG. 5 is a block diagram illustrating in detail of the image recovery/addition process according to an embodiment of the present invention.
  • FIG. 6 is a flowchart illustrating a process flow for generating a mathematical model of a face image according to the present invention.
  • FIG. 7 illustrates example characteristic points set on a face.
  • FIG. 8 schematically illustrates the variation in face shapes when the value of the weighting factor for a unique vector of a principal component obtained by a principal component analysis for a face shape is varied.
  • FIG. 9 illustrates that face shapes in sample images are transformed into an average face shape, and indicating the luminance thereof under the average face shape.
  • FIG. 10 schematically illustrates the variation in face luminance when the value of the weighting factor on a unique vector of a principal component obtained by principal component analysis for face luminance is varied.
  • FIGS. 11A to 11D illustrate that an input image is varied in a stepwise manner by the image recovery/addition process of the present invention.
  • FIG. 12 is a block diagram illustrating an evolutionary embodiment of the image recovery/addition process according to an embodiment of the present invention.
  • FIG. 13 schematically illustrates the configuration of a digital camera according to another embodiment of the present invention.
  • FIG. 1 schematically illustrates a hardware configuration of a digital photograph printer according to an embodiment of the present invention.
  • the digital photograph printer includes: a film scanner 51 ; a flat head scanner 52 ; a media drive 53 ; a network adapter 54 ; a display 55 ; a keyboard 56 ; a mouse 57 ; a hard disk 58 ; and a photograph print output unit 59 , all of which are connected to an arithmetic/control unit 50 .
  • the arithmetic/control unit 50 controls flows of image input, correction, retouching, and output, in addition to performing arithmetic operations in image processing for correction and retouching in collaboration with a CPU, a main memory, various output interfaces within the unit by executing a program installed from a storage medium, such as a CD-ROM or the like.
  • the restoration process of the present invention is performed in the unit.
  • the film scanner 51 photoelectrically scans an APS negative film or a 135 negative film developed by a developing machine (not shown), and obtains digital image data P 0 representing a photograph image recorded on the negative film.
  • the flat head scanner 52 photoelectrically scans a photograph image on a hard copy, such as an L-size photograph print or the like, and obtains digital image data P 0 .
  • the media drive 53 obtains image data P 0 representing a photograph image recorded on a recording medium, such as a memory card, CD, DVD, or the like.
  • the media drive 53 may also record output image data P 2 on these recording media.
  • image data of an image obtained by, for example, a digital camera is recorded on the memory card.
  • image data obtained by the film scanner 51 in the previous print order are recorded on the CD, DVD, and the like.
  • the network adapter 54 obtains image data P 0 from an order acceptance machine (not shown) of a known network photograph service system.
  • the image data P 0 are image data based on a photograph print order placed by a user, and sent to the order acceptance machine from a personal computer of the user through the Internet.
  • the image data P 0 may also be those sent from the photograph print order acceptance machine installed in front of a lab shop.
  • the display 55 is a display for displaying operation screens for input, correction, retouching, and output of images in the digital photograph printer according to the present embodiment, and a menu for selecting an operation, a processing target image, or the like is displayed.
  • the keyboard 56 and the mouse 57 are used for selecting a specific process.
  • the hard disk 58 includes a control program of the digital photograph printer. In addition, it temporarily stores image data P 0 obtained by the film scanner 51 , flat head scanner 52 , media drive 53 , or network adapter 54 , corrected image data P 1 , and retouched image data P 2 (output image data).
  • the photograph print output unit 59 performs laser scan exposure of a photographic paper based on image data P 2 representing an output image, development, and drying. It also performs back printing, print by print paper cutting, sorting for each order.
  • the printing method may be a laser exposure—thermal development—dye transfer process, or the like.
  • FIG. 2 is a block diagram of the digital photograph printer illustrating the functions and process flow thereof.
  • the digital photograph printer includes: an image input means 1 for inputting image data of a print target image; an image correction means 2 for performing image processing on the image data P 0 inputted thereto based on predetermined image processing conditions to automatically correct image quality of the image represented by the image data P 0 (hereinafter, image data and the image represented by the image data are denoted by the same reference symbol); an image retouching means 3 for performing image processing on the automatically corrected image data P 1 inputted thereto based on an instruction from the operator; and an image output means 4 for outputting a photograph print based on the retouched image data P 2 or outputting the image data P 2 to a recording medium.
  • the image correction means 2 performs white balance, contrast, and sharpness corrections, noise reduction/removal, and the like.
  • the image retouching means performs image retouching, such as manual correction of the processed results of the image correction means 2 , trimming, enlargement/reduction, sepiatization, monochrome conversion, combining with a decorated frame, as well as image recovery according to the present invention.
  • image data P 0 are inputted through the image input means 1 .
  • the operator sets the film on the film scanner 51 , or when outputting image data recorded on a recording medium, such as a memory card or the like, for printing or the like, the operator sets the recording medium on the media drive 53 .
  • a screen for selecting input source of the image data is displayed on the display 55 , and the operator selects the input source using the keyboard 56 or the mouse 57 .
  • the film set on the film scanner 51 is photoelectrically scanned by the film scanner 51 , and the scanning result is subjected to digital conversion to generate image data P 0 , which are then sent to the arithmetic/control unit 50 .
  • the hard copy script such as a photograph print or the like
  • the hard copy script set on the flat head scanner 52 is photoelectrically scanned by the flat head scanner 52 , and the scanning result is subjected to digital conversion to generate image data P 0 , which are then sent to the arithmetic/control unit 50 .
  • image data P 0 recorded on the recording medium, such as a memory card or the like, set on the media drive 53 are read by the arithmetic/control unit 50 . If the input source is a photograph print order acceptance machine of a network photograph service system or a photograph print order acceptance machine installed in front of a photograph shop, image data P 0 is received by the arithmetic/control unit 50 through the network adapter 54 . The image data P 0 obtained in the manner as described above are temporarily stored in the hard disk 58 .
  • the automatic image quality correction is performed on the image P 0 by the image correction means 2 . More specifically, known white balance, contrast, and sharpness corrections, noise reduction/removal, and the like are performed by the image processing program executed in the arithmetic/control unit 50 based on the setup conditions preset in the digital photograph printer, and processed image data P 1 are outputted.
  • the outputted image data P 1 are temporarily stored in the memory of the arithmetic/control unit 50 . Note that the image data P 1 may be temporarily stored in the hard disk 58 .
  • FIG. 3A illustrates an example screen displayed on the display 55 .
  • the operator confirms thumbnail images and selects an image that requires a manual image quality correction or an image with an image retouching order using the mouse 57 or the keyboard 56 (in FIG. 3A , an upper left image DSCF0001 is selected).
  • FIG. 3B an example screen is shown in FIG. 3B , the thumbnail image is displayed on the display 55 in an enlarged form, in addition to the buttons for selecting a manual correction operation or a retouching operation.
  • the operator selects an intended operation from the displayed buttons using the mouse 57 or the keyboard 56 , and further sets a detail of the selected operation as required.
  • a menu screen for selecting a detail of the recovery/addition process is displayed as shown in FIG. 4 .
  • the image retouching means 3 performs image processing according to the selected operation, and outputs processed image data P 2 .
  • the outputted image data P 2 are stored in the memory of the arithmetic/control unit 50 . Note that the image data P 1 may be temporarily stored in the hard disk 58 .
  • the image display on the display 55 by the image retouching means 3 , acceptance of input through the mouse 57 or the keyboard 56 , image processing for manual correction or retouching, and the like are controlled by the program executed in the arithmetic/control unit 50 .
  • the image P 2 is outputted by the image output means 4 .
  • the arithmetic/control unit 50 causes the display 55 to display a screen for selecting an output destination. The operator selects a desired output destination using the mouse 57 or the keyboard 56 , and the image data P 2 are sent to the selected destination by the arithmetic/control unit 50 . If a photograph printing is performed, the image data P 2 are sent to the photograph print output unit 59 , and the image data P 2 are outputted as a photograph print. If the output destination is a recording medium, such as a CD or the like, the image data P 2 are recorded on the CD or the like set on the media drive 53 .
  • the image recovery/addition process according to the present invention which is performed when “recovery/addition” is selected from the screen shown in FIG. 3B , and “Remove Cap Shadow” is selected from the screen shown in FIG. 4 , will now be described in detail.
  • the process for “Remove Cap Shadow” is a process for recovering a portion of an image around the eyes of the subject person being collapsed to dark due to the shadow of the cap of the person as shown in FIG. 3B (hereinafter referred to as “eye area” or simply “eyes”).
  • FIG. 5 is a block diagram illustrating in detail of the image recovery/addition process.
  • the image recovery/addition process is realized by: a face detection section 31 for detecting a face portion P 1 f of an image P 1 ; an image recovery/addition section 32 for recovering the eye area of the detected face portion P 1 f by applying the detected face portion P 1 f to a mathematical model M generated by the AAM scheme (refer to reference document 1) based on a plurality of sample images representing human face portions including the eyes, i.e., the recovery target; and an image reconstruction section 33 for reconstructing an image P 1 f ′ of the face portion with recovered eyes based on a parameter C (to be described later) corresponding to the face portion P 1 f obtained by applying the face portion P 1 f to the mathematical model, and fitting the image P 1 f to the face portion of the image P 1 and merging therein, thereby generating an image P 2 with recovered eye area of the image P 1 .
  • the process steps are controlled by a
  • the mathematical model M is generated based on the flowchart shown in FIG. 6 , and preinstalled on the arithmetic/control unit 50 together with the aforementioned program. A generation process of the mathematical model M will now be described.
  • characteristic points representing a face shape are set on each of a plurality of face images (sample images) that includes the eye area, i.e., the recovery target, as shown in FIG. 7 (step # 1 ).
  • the number of characteristic points is 122 ( FIG. 7 indicates only 60 characteristic points for clarity).
  • Location of each of the characteristic points is predefined like, for example, the first characteristic point is at the left end of the left eye, the 38 th characteristic point is at the midpoint between the eyebrows, and the like.
  • Each characteristic point may be set either manually or automatically through a recognition process. It may be corrected manually as required after automatic setting.
  • an average face shape is calculated based on the characteristic points set on each of the sample images (step # 2 ). More specifically, average values of position coordinates of the characteristic points representing the same region on the respective sample images are obtained.
  • any face shape may be approximated by the following formula (1).
  • S is the shape vector on which the position coordinates of each characteristic point of a face shape are arranged (x 1 , y 1 , - - - x 122 , y 122 ); S 0 is the average face shape vector on which the position coordinates of each characteristic point on the average face shape are arranged; p i is a unique vector representing the i th principal component of the face shape obtained by the principal component analysis; and b i is a weighting factor for each unique vector p i .
  • FIG. 8 schematically illustrates how the face shape varies when the values of the weighting factors b 1 and b 2 for the unique vectors p 1 and p 2 of the top two principal components obtained by the principal component analysis are varied.
  • the variation width is from ⁇ 3sd to +3sd, which is based on the standard deviation sd of the values of the weighting factors b 1 and b 2 when the face shape of each sample image is represented by the formula (1) above.
  • the face shape in the center of the three face shapes in each principal component is the average face shape. This example shows that a component contributing to the contour shape of the face is derived as the first principal component as a result of the principal component analysis, and the face shape varies from an elongated face ( ⁇ 3sd) to a round face (+3sd) by varying the weighting factor b 1 .
  • components contributing to open/close states of the mouth and jaw length are derived as the second principal component, and the face shape varies from a face with long jaw and open mouth ( ⁇ 3sd) to a face with short jaw and closed mouth (+3sd) by varying the weighting factor b 2 .
  • a smaller value of i has stronger explanatory power of the shape. That is, it means a greater contribution to the face shape.
  • each of the sample images is transformed (warped) to the average face shape obtain in step # 2 (step # 4 ). More specifically, the amount of shift for each of the characteristic points between each of the sample images and the average face shape is calculated, and based on the calculated amount of shift, an amount of shift of each of the sample images to the average face shape on a pixel basis is calculated by the following formulae (2) to (5) (two dimensional quintic polynomial), and each of the sample images is warped to the average face shape on a pixel basis.
  • x and y are the coordinates of each characteristic point in each sample image; x′ and y′ are coordinates on the average face shape to be warped; ⁇ x and ⁇ y are amounts of shift to the average shape; n is the order; and a ij and b ij are coefficients.
  • the coefficient for polynomial approximation is obtained by the least squares method.
  • the pixel value thereof is obtained from four adjacent pixels by first order approximation. That is, the pixel value thereof is distributed to four surrounding pixels in proportion to the distance between the coordinate thereof to each of the pixels after warping.
  • FIG. 9 illustrates that face shapes of three sample images are transformed into an average face shape, and indicating the luminance thereof under the average face shape.
  • a principal component analysis is performed (step # 5 ).
  • the luminance of any face image under the average face shape may be approximated by the following formula (6).
  • A is a luminance vector on which the luminance of each pixel under the average face shape is arranged (a 1 , - - - a m , m is the total number of pixels on the average face shape);
  • a 0 is an average face luminance vector on which the average luminance value of each pixel of each sample image in the average face shape is arranged;
  • q i is a unique vector representing the i th principal component of the face luminance obtained by the principal component analysis; and
  • ⁇ i is a weighting factor for each unique vector q i .
  • the variation width is from ⁇ 3sd to +3sd, which is based on the standard deviation sd of the values of the weighting factors ⁇ i1 and ⁇ i2 when the face luminance of each sample image is represented by the formula (6) above.
  • This example shows that a component contributing to the existence or nonexistence of beard is derived as the i th 1 principal component as a result of the principal component analysis, and the face luminance varies from a face with dense beard ( ⁇ 3sd) to a face without beard (+3sd) by varying the weighting factor ⁇ i1 .
  • a component contributing to the shadow on the face is derived as the i th 1 principal component, and the face luminance varies from a face with shadows on the right side ( ⁇ 3sd) to a face with shadows on the left side (+3sd) by varying the weighting factor ⁇ i2 .
  • a smaller value of i has stronger explanatory power of the luminance. That is, it means a greater contribution to the face luminance.
  • the mathematical model M of a face is generated through steps # 1 to # 5 described above. That is, the mathematical model M is represented by a plurality of unique vectors p i representing face shapes and unique vectors q i representing face luminance under the average face shape, which is dimension compressed in which the total number of each type of unique vectors is far smaller than the number of pixels forming the face image.
  • a mathematical model of a face image with 23 unique vectors for the face shape, and 114 unique vectors for the face luminance is generated by setting 122 characteristic points for an image formed of 10,000 pixels and performing the aforementioned process, and it is described that 90% of variations in the shape and luminance may be represented by varying the weighting factor for each unique vector.
  • the face detection section 31 reads in image data P 1 , and detects a face portion P 1 f ( FIG. 11B ) of the image P 1 ( FIG. 11A ). More specifically, the following method as described in Japanese Unexamined Patent Publication No. 2005-108195 (reference document 2) may be applicable. That is, a determination is made whether a face candidate region is present in the image P 1 by inputting a first characteristic amount indicating the direction of a gradient vector representing the direction and magnitude of an edge at each pixel to a plurality of first discriminators (to be described later). If a face candidate region is present, the region is extracted, and the magnitude of the gradient vector at each pixel on the extracted region is normalized.
  • the first/second discriminators are generated through a learning process of a machine learning technique, such as AdaBoost or the like, which uses the first/second characteristic amounts calculated for a plurality of images known to be of faces and a plurality of images known to not be of faces, which are sample images for learning, as input.
  • AdaBoost machine learning technique
  • images with eye areas are collapsed to dark due to a cap shadow are used as sample images for learning.
  • Various different known methods may be used for detecting the face portion P 1 f , which may include: a method that uses a correlation score between a unique face expression and the image itself as described in U.S. Patent Application Publication No. 20040170323 (reference document 3), knowledge database, characteristics extraction, flesh color detection, template matching, graph matching, statistical methods (neural network, SVM, HMM), and the like.
  • the face portion P 1 f may be manually specified using the mouse 57 or the keyboard 56 , or an automatic detection result may be manually corrected.
  • the image recovery/addition section 32 applies the face portion P 1 f to the mathematical model M. More specifically, it reconstructs the image based on the formulae (1) and (6) described above by varying the values of weighting factors for the unique vectors p i and q i from the top principal components in formulae (1) and (6) and obtains weighting factors b i and ⁇ i (collectively referred to as “parameter C”) when the difference between the reconstructed image and the face portion P 1 f is minimal (refer to the reference document 3 for detail).
  • the range of the values of the weighting factors b i and ⁇ i be based on the standard deviations sd of the distributions of b i and ⁇ i when the sample images are represented by the formulae (1) and (6) at the time of generating the model, and limited to, for example, from ⁇ 3sd to +3sd, and if the values do not fall in the range, the average values of the distributions be used. This may prevent an erroneous application of the model.
  • the image reconstruction section 33 reconstructs a face image P 1 f ′ ( FIG. 11C ) by substituting the parameter C to the formulae (1) and (6), and further, generates an image P 2 ( FIG. 11D ) by replacing the face portion P 1 f of the image P 1 with the reconstructed face image P 1 f ′ and merging into the image P 1 , and outputs image data P 2 .
  • the image recovery/addition section 32 applies the face portion P 1 f detected by the face detection section 31 from the image P 1 with the eye area thereof being collapsed to dark due to the shadow of the cap of the subject to the mathematical model M generated through the AAM scheme based on a plurality of sample images representing faces including the eye area.
  • the image reconstruction section 33 reconstructs the image P 1 f ′ recovered in the eye area based on the parameter C corresponding to the face portion P 1 f obtained by applying the face portion P 1 f to the mathematical model M, and further generates the image P 2 by fitting the face portion P 1 f ′ into the face portion of the image P 1 and merging therein.
  • the mathematical model M is generated from sample images representing the face portion including the eye area, so that the eye area, being collapsed to dark in the image P 1 , may be depicted in the model M. Accordingly, even if the eye area of the face portion of the input image P 1 is collapsed to dark, an image obtained by applying the face portion P 1 f of the input image P 1 to the model M is recovered in the eye area. In this way, according to the image recovery/addition process of the present invention, the eye area collapsed to dark due to the shadow of the cap of the subject may be recovered automatically without requiring manual operations. Consequently, the image P 2 with extremely high recovery accuracy may be obtained easily without requiring any operational skills.
  • the image recovery/addition process may be realized without modifying the configuration by appropriately changing the type of sample images for generating the mathematical model M, and of learning sample images used in the learning process for generating the discriminators in the face detection section 31 as required.
  • the image recovery/addition section 32 to apply a face portion P 1 f with a net pattern superimposed thereon detected by the face detection section 31 from an image P 1 obtained by imaging the face over a net fence to the mathematical model M generated through the AAM scheme based on a plurality of sample images without net patterns superimposed thereon, the image reconstruction section 33 to reconstruct a face image P 1 f ′ with the net pattern portion recovered based on the parameter C corresponding to the face portion P 1 f obtained by applying the face portion P 1 f to the mathematical model M, and further to generate an image P 2 by fitting the image P 1 f ′ into the face portion of the image P 1 and merging therein.
  • a color face image may be produced from a monochrome or sepia face image.
  • a face image of a person without any pimple is processed to add some pimples thereon, which may be called as a pimple recovery process.
  • face images with pimples are used as the sample images for generating the mathematical model M
  • face images without pimples are used as the sample images used in the learning process for generating the discriminators in the face detection section 31 .
  • the image recovery/addition section 32 to apply a face portion P 1 f without pimples detected by the face detection section 31 from an image P 1 to the mathematical model M generated through the AAM scheme based on a plurality of sample images with pimples, the image reconstruction section 33 to reconstruct a face image P 1 f ′ with pimples added (recovered) thereon based on the parameter C corresponding to the face portion P 1 f obtained by applying the face portion P 1 f to the mathematical model M, and further to generate an image P 2 by fitting the image P 1 f ′ into the face portion of the image P 1 and merging therein.
  • face images that include the element to be added are used as the sample images for generating the mathematical model M, and face images that do not include the element to be added are used as the sample images used in the learning process for generating the discriminators in the face detection section 31 in the same manner as described above.
  • the image recovery/addition section 32 may apply a face portion P 1 f with a smaller number of gradations detected by the face detection section 31 from an image P 1 to the mathematical model M generated through the AAM scheme based on a plurality of sample images with a greater number of gradations, the image reconstruction section 33 may reconstruct a face image P 1 f ′ with a greater number of gradations based on the parameter C corresponding to the face portion P 1 f obtained by applying the face portion P 1 f to the mathematical model M, and further may generate an image P 2 by fitting the image P 1 f ′ into the face portion of the image P 1 and merging therein. This may be called as a recovery process for the number of gradations
  • the image recovery/addition section 32 may apply a blurry face portion P 1 f detected by the face detection section 31 from an image P 1 to the mathematical model M generated through the AAM scheme based on a plurality of non-blurry sample images
  • the image reconstruction section 33 may reconstruct a non-blurry face image P 1 f ′ based on the parameter C corresponding to the face portion P 1 f obtained by applying the face portion P 1 f to the mathematical model M, and further may generate an image P 2 by fitting the image P 1 f ′ into the face portion of the image P 1 and merging therein. This may be called as a recovery process for high frequency component of the image.
  • a menu item is provided for each recovery/addition target element as in the menu shown in FIG. 4 .
  • face images including a plurality of recovery/addition target elements are used as the sample images for generating the mathematical model M, and face images that do not include these recovery/addition target elements are used as the sample images used in the learning process for generating the discriminators in the face detection section 31 , a comprehensive recovery/addition process may be realized.
  • FIG. 12 is a block diagram illustrating the image recovery/addition process in this case. As illustrated, the embodiment differs from the embodiment shown in FIG. 5 in that it further includes: an attribute obtaining section 34 for obtaining attribute information A k of the subject of an image P 1 ; and a model selection section 35 for selecting a mathematical model M k generated from sample images of subjects having the attribute included in the obtained attribute information A k .
  • each of the plurality of mathematical models is generated, for example, from a sample image group representing subjects of the same race, age, gender, or the like based on the aforementioned method ( FIG. 6 ), and stored with attribute information A i that indicates a common attribute in the sample image group being associated therewith.
  • the attribute obtaining section 34 may be a section for obtaining the attribute information by determining the attribute of the subject of the image P 1 through a known recognition process, for example, the process described in Japanese Unexamined Patent Publication No. 11 (1999)-175724, or a section for obtaining the attribute of the subject recorded in the header of the image P 1 as appended information at the time of imaging. Further, it may have a function to estimate the attribute of the subject based on the accompanying information. For example, if GPS information for imaged location is available, the country or region corresponding to the GPS information may be identified. Noting that the race of the subject may be estimated from the GPS information to a certain degree, a reference table for associating the GPS information with race information may be provided in advance.
  • the GPS information recorded in the header region of the image P 1 is obtained, and the race of the subject may be estimated by referring to the reference table based on the obtained GPS information.
  • the model selection section 35 obtains the mathematical model M k associated with the attribute information A k obtained by the attribute obtaining section 34 .
  • the image recovery/addition section 32 applies the face portion P 1 f of the image P 1 to the mathematical model M k .
  • the model selection section 35 selects a mathematical model M k associated with the attribute A k obtained by the attribute obtaining section 34 ; and the image recovery/addition section 32 applies the face shape P 1 f to the selected mathematical model M k , no unique vector for explaining the variation in the face shape or luminance arising from the difference in the attributes A k is present in the mathematical model M k , so that the face shape P 1 f may be represented based only on the unique vectors representing other factors that determine the face shape or luminance. This may improve the processing accuracy and a higher quality image may be obtained.
  • the attribute-based mathematical models are further specialized, and individual-based mathematical models are generated from the viewpoint of the processing accuracy. In this case, it is necessary to associate the images P 1 with individuals.
  • the mathematical model is preinstalled on the digital photograph printer. But, it is also preferable to provide race-based mathematical models in advance, and select the mathematical model according to the shipping country or the region of the printer from the viewpoint of improving the processing accuracy.
  • the function to generate the mathematical models may be implemented in the digital photograph printer. More specifically, it means just to install a program, for performing the process explained based on the flowchart shown in FIG. 6 , on the arithmetic/control unit 50 . It is also possible to install a default mathematical model at the time of shipping, and then to customize the mathematical model or generating a new model which is different from the default mathematical model using input to the digital photograph printer. This is particularly effective when individual-based mathematical models are generated.
  • face images are represented using separate weighting factors b i and ⁇ i for face shape and luminance respectively.
  • an appearance parameter “c” a new parameter, for controlling both the face shape and luminance may be obtained by performing a principal component analysis on a vector (b 1 , b 2 , - - - , b i , - - - , ⁇ 1 , ⁇ 2 - - - , - - - ) obtained by combining the weighting factors b i and ⁇ i as shown in the following formulae (7) and (8).
  • the variance in the shape from the average face shape is represented by the appearance parameter “c” and the vector Q S
  • the variance in the luminance from the average face luminance is represented by the appearance parameter “c” and the vector Q A
  • the image recovery/addition section 32 obtains the face luminance under the average face shape based on the formula (8) above by varying the value of the appearance parameter “c”, and further reconstructs a face image by transforming the average face shape based on the formula (7) above, and obtains the appearance parameter “c” when the difference between the reconstructed image and the face portion P 1 f becomes minimal.
  • FIG. 13 schematically illustrates the configuration of such digital camera.
  • the digital camera includes: an imaging section 71 , formed of a lens, an aperture diaphragm, a shatter, a CCD, and the like, for imaging a subject; an A/D conversion section 72 for obtaining digital image data P 0 by digitizing the analog signals of the charges stored in the CCD of the imaging section 71 ; an image processing section for performing various image processing on the image data P 0 ; a compression/expansion section 74 for performing compression on image data to be recorded on a memory card, or performing expansion on compressed image data from a memory card; a strobe section 75 , formed of a strobe light and the like, for strobe emission; an operation section 76 , formed of various operation buttons and the like, for specifying imaging conditions, image processing conditions, and the like; a media recording section 77 which is an interface for
  • each means shown in FIG. 2 is realized by the following sections under the control of the control section using the internal memory 79 : the image input means 1 by the imaging means 71 and the A/D conversion section 72 ; the image correction means 2 by the image processing section 73 ; the image retouching means 3 by the image processing section 73 , the operation section 76 , and display section 78 ; and the image output means 4 by the media recording section 77 .
  • the imaging section 71 focuses light, representing a subject inputted to the lens through a full depression of the shutter by the photographer, on the photoelectric surface of the CCD, which is photoelectrically converted and outputted as analog image signals, and the A/D conversion section 72 converts the outputted analog image signals to digital image signals and outputs the digital image signals as digital image data P 0 , thereby the function of the image input means 1 is realized.
  • the image processing section 73 performs automatic white balance correction, white balance correction according to the type of the light source, gradation correction, density correction, color correction, sharpness correction, and the like, and outputs processed image data P 1 , thereby the function of the image correction means 2 is realized.
  • the image P 1 is displayed on the liquid crystal display of the display section 78 .
  • the display layout may be like that shown in FIG. 3A in which a plurality of images in thumbnail format is displayed.
  • the photographer selects a retouching target image and displays it in an enlarged form using the operation buttons of the operation section 76 , and further performs manual image correction or retouching through menu selections.
  • the control section 70 activates an image recovery/addition program stored in the internal memory 79 , and causes the image processing section 73 to perform the image recovery/addition process ( FIG. 5 , or the like) using the mathematical model M stored in the internal memory 79 in advance, and outputs a processed image data P 2 .
  • the function of image retouching means 3 is realized.
  • the compression/expansion section 74 performs compression on the image data P 2 based on a compression format, such as JPEG or the like, and records the compressed image data on a memory card loaded in the digital camera through the media recording section 77 , thereby the function of the image output section 4 is realized.
  • a compression format such as JPEG or the like
  • a configuration may be adopted in which the image recovery/addition process is also performed on the image data stored in a memory card. More specifically, the image data stored in the memory card are expanded (decompressed) by the compression/expansion section 74 , and the image represented by the expanded image data is displayed on the liquid crystal display of the display section 78 . Then, a desired process is selected by the photographer in the same manner as described above, and the selected process is performed by the image processing section 73 .
  • the attribute-based mathematical models explained in FIG. 12 and the like, or the mathematical model generation process explained in FIG. 6 may be implemented in a digital camera.
  • the persons who become the subjects of photographs taken by a specific digital camera Therefore, if individual-based mathematical models for persons who often become the subjects of photographs taken by the digital camera, models without variations in the face due to personal difference may be generated. Consequently, the recovery/addition process for the faces of these persons may be performed with extremely high accuracy.
  • a program for causing a personal computer or the like to perform the image recovery/addition process of the present invention may be incorporated in retouching software.
  • This allows the user to install the software on a personal computer or the like from a recording medium, such as a CD-ROM or the like, including the software, or from a predetermined website by downloading the software therefrom.
  • the user may use the image recovery/addition process of the present invention as one of the image editing/retouching schemes in its own personal computer.

Abstract

A face portion of an input image, an example of a predetermined structure, is applied to a mathematical model by the image recovery/addition section to recover a missing element of the face portion in the input image. The mathematical model is generated by a predetermined statistical method, such as the AAM scheme or the like, based on a plurality of sample images representing the face portion including the recovery target element. Thereafter, the face portion is reconstructed to include the missing element based on the parameter corresponding to the face portion obtained by applying the face portion to the model, and the face portion of the input image is replaced by the reconstructed face portion to produce a restored image by the image reconstruction section.

Description

    BACKGROUND OF THE INVENTION
  • 1. Field of the Invention
  • The present invention relates to an image processing method and apparatus for recovering a missing element of a predetermined structure, such as a face in an image or the like. It also relates to a computer readable recording medium including a program for causing a computer to perform the method.
  • 2. Description of the Related Art
  • When taking photographs, there may be a case in which a photograph of main subject is obtained in partially shaded, depending on the imaging environment or the appearance of the subject. For example, if a person, the major subject, wearing a cap is imaged, the eyebrows or the eyes of the person may be shaded by the rim of the cap, and the eyebrows or the eyes of the person in the recorded image may become collapsed to dark. If a person, the major subject, standing over a net is imaged, the net pattern is superimposed on the face of the person in the recorded image.
  • Further, an excessively high contrast between the subject and the background may cause halation in the high signal region, or collapse in low signal region of the image.
  • Still further, if the subject is dark, or out of autofocus area, the resultant image may be blurred, that is, the image missing high-frequency component may be obtained.
  • In the mean time, a method for correcting digital images obtained by digital cameras or those obtained by scanning photograph prints by scanners is proposed as described, for example, Japanese Unexamined Patent Publication No. 8(1996)-065519. In the method, the center of a defective area of an image is specified by the user, and a predetermined surrounding region including the defective area is specified as a removal region. In addition, an interpolation region required for interpolating the removal region is specified in the surrounding area of the removal region. Then, the interpolation region is extended gradually and linearly toward the defective area to replace the pixels within the removal region with the pixels of the interpolation region, and thereby the defective area is removed and corrected.
  • The aforementioned method is an effective image correction method when a defective area (removal region) is readily replaced by the surrounding region (interpolation region), such as the region of the sky or the ocean. But, when an image component with a unique shape, such as an eye, a nose, a mouth, or the like, is masked by a shield, or having halation or collapse therein, it is impossible for the method to recover the masked component, since no identical component exists in the surrounding region. Further, it can not correct an image with a specific color image signal being lost, or a blurred image.
  • SUMMARY OF THE INVENTION
  • The present invention has been developed in view of the circumstances described above, and it is an object of the present invention to provide an image processing method and apparatus capable of accurately recovering various types of defects in an image. It is a further object of the present invention to provide a computer readable recording medium including a program therefor.
  • An image processing method of the present invention comprises the step of applying a predetermined structure in an input image, representing that an element of the structure is missing, to a model other than the missing element to recover the missing element of the structure in the input image, the model representing the predetermined structure by a statistical characteristic amount obtained by performing predetermined statistical processing on a plurality of images representing the predetermined structure including the recovery target element.
  • An image processing apparatus of the present invention is an apparatus for performing the image processing described above. That is, the apparatus comprises:
  • a model representing a predetermined structure by a statistical characteristic amount obtained by performing predetermined statistical processing on a plurality of images representing the predetermined structure including a recovery target element; and
  • a recovery means for applying the predetermined structure in an input image, representing that the element of the predetermined structure is missing, to the model other than the missing element to recover the missing element of the structure in the input image.
  • A computer readable recording medium of the present invention is a recording medium including a program for causing a computer to perform the image processing method described above.
  • That is, the features of the image processing method, apparatus, and program therefor are that a recovery target element of a predetermined structure does not exist in an input image but exists only in the model, and the recovery is performed by correlating the predetermined structure with the model other than the recovery target element and estimating the recovery target element based on the correlation.
  • The image processing method, apparatus, and program therefor will now be described in detail.
  • As a specific method for realizing image processing according to the present invention, an AAM (Active Appearance Models) scheme may be employed. The AAM scheme is one of the approaches for trying to interpret image contents based on a model. For example, if a face is the interpretation target object, a mathematical model of a face is generated by performing a principal component analysis on the shapes of the face portions of a plurality of learning images, or on the luminance data after normalization of the shapes. Then, the face portion of a new input image is represented by each of the principal components and a weighting parameter thereof to reconstruct the face image. The method is described in a non-patent literature “Active Appearance Models” by T. F. Cootes, et al., In Proc. 5th European Conference on Computer Vision, Springer, Germany, 1998, vol. 2, pp. 484-498 (reference document 1).
  • Preferably, the “predetermined structure” is a structure appropriate for modeling, i.e., a structure whose variation in the shape or luminance in the image falls within a certain range. In particular, a structure from which a statistical characteristic amount having a higher level of explanatory power for the shape or luminance thereof may be derived by performing statistical processing thereon. One of the specific examples is a human face.
  • The recovery target element, i.e., the missing element in an input image may include a portion of the region of a predetermined structure, a portion of color signals representing the predetermined structure, a portion of spatial frequency components representing the predetermined structure, the number of gradations representing the predetermined structure, or the like.
  • For example, if the predetermined structure is a human face, the loss of a portion of the region of the predetermined structure may include, a portion of the face being collapsed to dark due to the shadow of the rim of the cap of the person, a portion of the face being masked by a shield standing between the face and the imaging device, or the like. Further, it may also include the case that a signal of at least one channel of a portion of the region of the predetermined structure being lost due to a compression or transmission error of the imaging device. Still further, a face without glasses may be deemed as the loss of the glasses. Thus, the process for recovering a missing part of the region of the predetermined structure includes the addition (recovery) of appendages (ornaments) to the structure.
  • Specific examples of the loss of a portion of color signals representing the predetermined structure may include the loss of one of R, G, and B signals due to a compression or transmission error of the imaging device, or halation in high signal level component and collapse in low signal level component. Further, it may include the case where a monochrome image is deemed as the loss of color data and reconstructed as a color image. The color space is not limited to RGB, and YCC or the like may also be used. Preferably, however, the process is performed on RGB signals, since the halation in a high signal level component is more likely to occur in R signal.
  • Specific examples of the loss of a portion of spatial frequency components representing the predetermined structure may include the loss of a high frequency component such as an edge. Further, a face without pimples, wrinkles, or the like may be deemed as the loss of spatial frequency components representing the pimples, wrinkles, or the like.
  • The loss of the number of gradations means the state of being a smaller number of gradations, and an image with smaller number of gradations is reconstructed as an image with a greater number of gradations by the image processing of the present invention.
  • Further, the recovery target element may include the aforementioned elements mixed together. Specific examples include the case where a non-makeup face is deemed as the loss of makeup and reconstructed as a made-up face, and the case where an image obtained by a low performance camera, such as a cell phone camera, is reconstructed like an image obtained by a high performance camera.
  • “An image representing the predetermined structure including the recovery target element” may be an image of the predetermined structure that includes the recovery target element obtained by actually imaged, i.e., the image obtained by imaging the structure without any missing element, or an image obtained by artificially performing correction on an image not including the recovery target object to recover the target element.
  • As for the “predetermined statistical processing”, a dimension compression processing capable of representing the structure by compressing it to a statistical amount with the number of dimensions less than the number of pixels representing the structure be preferable. A specific example may be a multivariate statistical technique, such as the principal component analysis or the like. If the principal component analysis is performed as the “predetermined statistical processing”, the “statistical characteristic amount” means a plurality of principal components obtained by the principal component analysis.
  • The level of the explanatory power described above means that a higher order principal component has a higher level of explanatory power, and a lower order principal component has a lower level of explanatory power, taking the case where the predetermined statistical processing is a principal component analysis as an example.
  • The “(predetermined) structure in an input image” may be detected either automatically or manually. Further, the present invention may further include a process for detecting the structure in an input image, or the region of the structure detected from an input image in advance may be used as the input for applying to the model.
  • The referent of “applying a predetermined structure in an input image to a model other than the missing element” as used herein means arithmetic operations or the like for representing the structure in the image by the model. More specifically, it means to obtain a weighting parameter value for weighting each of the principal components in the mathematical model to represent the structure in the input image by the mathematical model, taking the case where the AAM is used as an example.
  • Further, an arrangement may be made in which a plurality of models of the present invention is provided, each for each attribute of the predetermined structure; information of the attribute of the structure in the input image is obtained; a process (means) for selecting the model according to the obtained attribute is additionally provided; and the predetermined structure in the input image is applied to the selected model.
  • The referent of “attribute” as used herein means, for example, the gender, age, or race, if the predetermined structure is a human face. Further, it may be information for identifying an individual. In this case, the model for each attribute means the model for each individual.
  • Specific methods for obtaining the “attribute” may include any known recognition process for images as described, for example, in Japanese Unexamined Patent Publication No. 11 (1999)-175724, or a method of estimating/obtaining it from the information appended to the image such as GPS information or the like.
  • The image processing method of the present invention may be used for reconstruction of a photograph for funeral, i.e., the photograph of a deceased person, reconstruction of a face image inputted for matching in face authentication, as well as reconstruction of general photograph images.
  • According to the image processing method and apparatus, and the computer readable recording medium including the program therefor, a missing element of a predetermined structure in an image may be recovered completely by applying the structure to a model representing the structure including the missing element as the recovery target element. Accordingly, various different types of defects in an image may be recovered accurately and easily without requiring any operational skills.
  • If an arrangement is made in which plurality of models of the present invention is provided, each for each attribute of the predetermined structure; the attribute of the structure in the input image is obtained; a process (means) for selecting a model according to the obtained attribute is additionally provided; and the predetermined structure in the input image is applied to the model of the selected attribute, the structure in the input image may be applied to a more appropriate model, so that the processing accuracy is improved, and a higher quality image may be obtained.
  • BRIEF DESCRIPTION OF THE DRAWINGS
  • FIG. 1 schematically illustrates a hardware configuration of a digital photograph printer according to an embodiment of the present invention.
  • FIG. 2 is a block diagram of the digital photograph printer and digital camera according to the embodiment of the present invention, illustrating the functions and process flow thereof.
  • FIGS. 3A and 3B illustrate example screens displayed on the display of the digital photograph printer and digital camera according to the embodiment of the present invention.
  • FIG. 4 illustrates an example menu screen for selecting details of an image recovery/addition process according to an embodiment of the present invention.
  • FIG. 5 is a block diagram illustrating in detail of the image recovery/addition process according to an embodiment of the present invention.
  • FIG. 6 is a flowchart illustrating a process flow for generating a mathematical model of a face image according to the present invention.
  • FIG. 7 illustrates example characteristic points set on a face.
  • FIG. 8 schematically illustrates the variation in face shapes when the value of the weighting factor for a unique vector of a principal component obtained by a principal component analysis for a face shape is varied.
  • FIG. 9 illustrates that face shapes in sample images are transformed into an average face shape, and indicating the luminance thereof under the average face shape.
  • FIG. 10 schematically illustrates the variation in face luminance when the value of the weighting factor on a unique vector of a principal component obtained by principal component analysis for face luminance is varied.
  • FIGS. 11A to 11D illustrate that an input image is varied in a stepwise manner by the image recovery/addition process of the present invention.
  • FIG. 12 is a block diagram illustrating an evolutionary embodiment of the image recovery/addition process according to an embodiment of the present invention.
  • FIG. 13 schematically illustrates the configuration of a digital camera according to another embodiment of the present invention.
  • DESCRIPTION OF THE PREFERRED EMBODIMENTS
  • Hereinafter an exemplary embodiment of the present invention will be described with reference to the accompanying drawings, taking a case in which a portion of a face being collapsed to dark in an image due to the shadow of the rim of the cap of the subject is recovered by the image processing of the present invention as an example.
  • FIG. 1 schematically illustrates a hardware configuration of a digital photograph printer according to an embodiment of the present invention. As shown in the drawing, the digital photograph printer includes: a film scanner 51; a flat head scanner 52; a media drive 53; a network adapter 54; a display 55; a keyboard 56; a mouse 57; a hard disk 58; and a photograph print output unit 59, all of which are connected to an arithmetic/control unit 50.
  • The arithmetic/control unit 50 controls flows of image input, correction, retouching, and output, in addition to performing arithmetic operations in image processing for correction and retouching in collaboration with a CPU, a main memory, various output interfaces within the unit by executing a program installed from a storage medium, such as a CD-ROM or the like. The restoration process of the present invention is performed in the unit.
  • The film scanner 51 photoelectrically scans an APS negative film or a 135 negative film developed by a developing machine (not shown), and obtains digital image data P0 representing a photograph image recorded on the negative film.
  • The flat head scanner 52 photoelectrically scans a photograph image on a hard copy, such as an L-size photograph print or the like, and obtains digital image data P0.
  • The media drive 53 obtains image data P0 representing a photograph image recorded on a recording medium, such as a memory card, CD, DVD, or the like. The media drive 53 may also record output image data P2 on these recording media. Note that image data of an image obtained by, for example, a digital camera is recorded on the memory card. Further, for example, image data obtained by the film scanner 51 in the previous print order are recorded on the CD, DVD, and the like.
  • The network adapter 54 obtains image data P0 from an order acceptance machine (not shown) of a known network photograph service system. The image data P0 are image data based on a photograph print order placed by a user, and sent to the order acceptance machine from a personal computer of the user through the Internet. The image data P0 may also be those sent from the photograph print order acceptance machine installed in front of a lab shop.
  • The display 55 is a display for displaying operation screens for input, correction, retouching, and output of images in the digital photograph printer according to the present embodiment, and a menu for selecting an operation, a processing target image, or the like is displayed. The keyboard 56 and the mouse 57 are used for selecting a specific process.
  • The hard disk 58 includes a control program of the digital photograph printer. In addition, it temporarily stores image data P0 obtained by the film scanner 51, flat head scanner 52, media drive 53, or network adapter 54, corrected image data P1, and retouched image data P2 (output image data).
  • The photograph print output unit 59 performs laser scan exposure of a photographic paper based on image data P2 representing an output image, development, and drying. It also performs back printing, print by print paper cutting, sorting for each order. The printing method may be a laser exposure—thermal development—dye transfer process, or the like.
  • FIG. 2 is a block diagram of the digital photograph printer illustrating the functions and process flow thereof. As illustrated, from the aspect of function, the digital photograph printer includes: an image input means 1 for inputting image data of a print target image; an image correction means 2 for performing image processing on the image data P0 inputted thereto based on predetermined image processing conditions to automatically correct image quality of the image represented by the image data P0 (hereinafter, image data and the image represented by the image data are denoted by the same reference symbol); an image retouching means 3 for performing image processing on the automatically corrected image data P1 inputted thereto based on an instruction from the operator; and an image output means 4 for outputting a photograph print based on the retouched image data P2 or outputting the image data P2 to a recording medium.
  • The image correction means 2 performs white balance, contrast, and sharpness corrections, noise reduction/removal, and the like. The image retouching means performs image retouching, such as manual correction of the processed results of the image correction means 2, trimming, enlargement/reduction, sepiatization, monochrome conversion, combining with a decorated frame, as well as image recovery according to the present invention.
  • An operations and a process flow of the digital photograph printer will now be described.
  • First, image data P0 are inputted through the image input means 1. When outputting an image recorded on a developed film for printing or the like, the operator sets the film on the film scanner 51, or when outputting image data recorded on a recording medium, such as a memory card or the like, for printing or the like, the operator sets the recording medium on the media drive 53. In the mean time, a screen for selecting input source of the image data is displayed on the display 55, and the operator selects the input source using the keyboard 56 or the mouse 57. When the film is selected as the input source, the film set on the film scanner 51 is photoelectrically scanned by the film scanner 51, and the scanning result is subjected to digital conversion to generate image data P0, which are then sent to the arithmetic/control unit 50. If a hard copy script, such as a photograph print or the like, is selected, the hard copy script, such as a photograph print or the like, set on the flat head scanner 52 is photoelectrically scanned by the flat head scanner 52, and the scanning result is subjected to digital conversion to generate image data P0, which are then sent to the arithmetic/control unit 50. If a recording medium, such as a memory card or the like, is selected, image data P0 recorded on the recording medium, such as a memory card or the like, set on the media drive 53 are read by the arithmetic/control unit 50. If the input source is a photograph print order acceptance machine of a network photograph service system or a photograph print order acceptance machine installed in front of a photograph shop, image data P0 is received by the arithmetic/control unit 50 through the network adapter 54. The image data P0 obtained in the manner as described above are temporarily stored in the hard disk 58.
  • Then, the automatic image quality correction is performed on the image P0 by the image correction means 2. More specifically, known white balance, contrast, and sharpness corrections, noise reduction/removal, and the like are performed by the image processing program executed in the arithmetic/control unit 50 based on the setup conditions preset in the digital photograph printer, and processed image data P1 are outputted. The outputted image data P1 are temporarily stored in the memory of the arithmetic/control unit 50. Note that the image data P1 may be temporarily stored in the hard disk 58.
  • Thereafter, a thumbnail image of the corrected image P1 is generated and displayed on the display 55 by the image retouching means 3. FIG. 3A illustrates an example screen displayed on the display 55. The operator confirms thumbnail images and selects an image that requires a manual image quality correction or an image with an image retouching order using the mouse 57 or the keyboard 56 (in FIG. 3A, an upper left image DSCF0001 is selected). Then, an example screen is shown in FIG. 3B, the thumbnail image is displayed on the display 55 in an enlarged form, in addition to the buttons for selecting a manual correction operation or a retouching operation. The operator selects an intended operation from the displayed buttons using the mouse 57 or the keyboard 56, and further sets a detail of the selected operation as required. In the present embodiment, if the “recovery/addition” button shown in FIG. 3B is selected/depressed by the operator, a menu screen for selecting a detail of the recovery/addition process is displayed as shown in FIG. 4. The image retouching means 3 performs image processing according to the selected operation, and outputs processed image data P2. The outputted image data P2 are stored in the memory of the arithmetic/control unit 50. Note that the image data P1 may be temporarily stored in the hard disk 58. The image display on the display 55 by the image retouching means 3, acceptance of input through the mouse 57 or the keyboard 56, image processing for manual correction or retouching, and the like are controlled by the program executed in the arithmetic/control unit 50.
  • Finally, the image P2 is outputted by the image output means 4. Here, the arithmetic/control unit 50 causes the display 55 to display a screen for selecting an output destination. The operator selects a desired output destination using the mouse 57 or the keyboard 56, and the image data P2 are sent to the selected destination by the arithmetic/control unit 50. If a photograph printing is performed, the image data P2 are sent to the photograph print output unit 59, and the image data P2 are outputted as a photograph print. If the output destination is a recording medium, such as a CD or the like, the image data P2 are recorded on the CD or the like set on the media drive 53.
  • The image recovery/addition process according to the present invention, which is performed when “recovery/addition” is selected from the screen shown in FIG. 3B, and “Remove Cap Shadow” is selected from the screen shown in FIG. 4, will now be described in detail. The process for “Remove Cap Shadow” is a process for recovering a portion of an image around the eyes of the subject person being collapsed to dark due to the shadow of the cap of the person as shown in FIG. 3B (hereinafter referred to as “eye area” or simply “eyes”).
  • FIG. 5 is a block diagram illustrating in detail of the image recovery/addition process. As illustrated, the image recovery/addition process is realized by: a face detection section 31 for detecting a face portion P1 f of an image P1; an image recovery/addition section 32 for recovering the eye area of the detected face portion P1 f by applying the detected face portion P1 f to a mathematical model M generated by the AAM scheme (refer to reference document 1) based on a plurality of sample images representing human face portions including the eyes, i.e., the recovery target; and an image reconstruction section 33 for reconstructing an image P1 f′ of the face portion with recovered eyes based on a parameter C (to be described later) corresponding to the face portion P1 f obtained by applying the face portion P1 f to the mathematical model, and fitting the image P1 f to the face portion of the image P1 and merging therein, thereby generating an image P2 with recovered eye area of the image P1. The process steps are controlled by a program installed on the arithmetic/control unit 50.
  • The mathematical model M is generated based on the flowchart shown in FIG. 6, and preinstalled on the arithmetic/control unit 50 together with the aforementioned program. A generation process of the mathematical model M will now be described.
  • First, characteristic points representing a face shape are set on each of a plurality of face images (sample images) that includes the eye area, i.e., the recovery target, as shown in FIG. 7 (step #1). Here, the number of characteristic points is 122 (FIG. 7 indicates only 60 characteristic points for clarity). Location of each of the characteristic points is predefined like, for example, the first characteristic point is at the left end of the left eye, the 38th characteristic point is at the midpoint between the eyebrows, and the like. Each characteristic point may be set either manually or automatically through a recognition process. It may be corrected manually as required after automatic setting.
  • Then, an average face shape is calculated based on the characteristic points set on each of the sample images (step #2). More specifically, average values of position coordinates of the characteristic points representing the same region on the respective sample images are obtained.
  • Further, a principal component analysis is performed based on the characteristic points representing the face shape in each of the sample images and the position coordinates of the average face shape (step #3). As a result, any face shape may be approximated by the following formula (1).
  • S = S 0 + i = 1 n p i b i
  • where: S is the shape vector on which the position coordinates of each characteristic point of a face shape are arranged (x1, y1, - - - x122, y122); S0 is the average face shape vector on which the position coordinates of each characteristic point on the average face shape are arranged; pi is a unique vector representing the ith principal component of the face shape obtained by the principal component analysis; and bi is a weighting factor for each unique vector pi. FIG. 8 schematically illustrates how the face shape varies when the values of the weighting factors b1 and b2 for the unique vectors p1 and p2 of the top two principal components obtained by the principal component analysis are varied. The variation width is from −3sd to +3sd, which is based on the standard deviation sd of the values of the weighting factors b1 and b2 when the face shape of each sample image is represented by the formula (1) above. The face shape in the center of the three face shapes in each principal component is the average face shape. This example shows that a component contributing to the contour shape of the face is derived as the first principal component as a result of the principal component analysis, and the face shape varies from an elongated face (−3sd) to a round face (+3sd) by varying the weighting factor b1. Similarly, components contributing to open/close states of the mouth and jaw length are derived as the second principal component, and the face shape varies from a face with long jaw and open mouth (−3sd) to a face with short jaw and closed mouth (+3sd) by varying the weighting factor b2. Note that a smaller value of i has stronger explanatory power of the shape. That is, it means a greater contribution to the face shape.
  • Next, each of the sample images is transformed (warped) to the average face shape obtain in step #2 (step #4). More specifically, the amount of shift for each of the characteristic points between each of the sample images and the average face shape is calculated, and based on the calculated amount of shift, an amount of shift of each of the sample images to the average face shape on a pixel basis is calculated by the following formulae (2) to (5) (two dimensional quintic polynomial), and each of the sample images is warped to the average face shape on a pixel basis.

  • x′=x+Δx  (2)

  • y′=y+Δy  (3)
  • Δ x = i = 0 n j = 0 n - 1 a ij · x i · y j ( 4 )
  • where: x and y are the coordinates of each characteristic point in each sample image; x′ and y′ are coordinates on the average face shape to be warped; Δx and Δy are amounts of shift to the average shape; n is the order; and aij and bij are coefficients. The coefficient for polynomial approximation is obtained by the least squares method. Here, for a pixel whose coordinate value after warping is not an integer value, and moved to a position including a fractional part, the pixel value thereof is obtained from four adjacent pixels by first order approximation. That is, the pixel value thereof is distributed to four surrounding pixels in proportion to the distance between the coordinate thereof to each of the pixels after warping. FIG. 9 illustrates that face shapes of three sample images are transformed into an average face shape, and indicating the luminance thereof under the average face shape.
  • Further, based on the luminance of each of the sample images after transformed into the average face shape, a principal component analysis is performed (step #5). As a result, the luminance of any face image under the average face shape may be approximated by the following formula (6).
  • Δ y = i = 0 n j = 0 n - 1 b ij · x i · y j ( 5 )
  • where: A is a luminance vector on which the luminance of each pixel under the average face shape is arranged (a1, - - - am, m is the total number of pixels on the average face shape); A0 is an average face luminance vector on which the average luminance value of each pixel of each sample image in the average face shape is arranged; qi is a unique vector representing the ith principal component of the face luminance obtained by the principal component analysis; and λi is a weighting factor for each unique vector qi. FIG. 10 schematically illustrates how the face luminance varies when the values of the weighting factors λi1 and λi2 for the unique vectors qi1 and qi2 representing the ith 1 and ith 2 principal components respectively obtained through the principal component analysis are varied. The variation width is from −3sd to +3sd, which is based on the standard deviation sd of the values of the weighting factors λi1 and λi2 when the face luminance of each sample image is represented by the formula (6) above. This example shows that a component contributing to the existence or nonexistence of beard is derived as the ith 1 principal component as a result of the principal component analysis, and the face luminance varies from a face with dense beard (−3sd) to a face without beard (+3sd) by varying the weighting factor λi1. Similarly, a component contributing to the shadow on the face is derived as the ith 1 principal component, and the face luminance varies from a face with shadows on the right side (−3sd) to a face with shadows on the left side (+3sd) by varying the weighting factor λi2. Note that a smaller value of i has stronger explanatory power of the luminance. That is, it means a greater contribution to the face luminance.
  • The mathematical model M of a face is generated through steps # 1 to #5 described above. That is, the mathematical model M is represented by a plurality of unique vectors pi representing face shapes and unique vectors qi representing face luminance under the average face shape, which is dimension compressed in which the total number of each type of unique vectors is far smaller than the number of pixels forming the face image. Note that in the embodiment described in the reference document 1, a mathematical model of a face image with 23 unique vectors for the face shape, and 114 unique vectors for the face luminance is generated by setting 122 characteristic points for an image formed of 10,000 pixels and performing the aforementioned process, and it is described that 90% of variations in the shape and luminance may be represented by varying the weighting factor for each unique vector.
  • An image recovery/addition process flow based on the ARM scheme using the mathematical model M will now be described with reference to FIGS. 5 and 11.
  • First, the face detection section 31 reads in image data P1, and detects a face portion P1 f (FIG. 11B) of the image P1 (FIG. 11A). More specifically, the following method as described in Japanese Unexamined Patent Publication No. 2005-108195 (reference document 2) may be applicable. That is, a determination is made whether a face candidate region is present in the image P1 by inputting a first characteristic amount indicating the direction of a gradient vector representing the direction and magnitude of an edge at each pixel to a plurality of first discriminators (to be described later). If a face candidate region is present, the region is extracted, and the magnitude of the gradient vector at each pixel on the extracted region is normalized. Then, a determination is made whether the extracted face candidate region is a true face region by inputting a second characteristic amount indicating the magnitude and direction of the normalized gradient vector to a second discriminator (to be described later), and if it is determined to be a true face region, the region is extracted as the face portion P1 f. Here, the first/second discriminators are generated through a learning process of a machine learning technique, such as AdaBoost or the like, which uses the first/second characteristic amounts calculated for a plurality of images known to be of faces and a plurality of images known to not be of faces, which are sample images for learning, as input. In the present embodiment, images with eye areas are collapsed to dark due to a cap shadow are used as sample images for learning.
  • Various different known methods may be used for detecting the face portion P1 f, which may include: a method that uses a correlation score between a unique face expression and the image itself as described in U.S. Patent Application Publication No. 20040170323 (reference document 3), knowledge database, characteristics extraction, flesh color detection, template matching, graph matching, statistical methods (neural network, SVM, HMM), and the like. Further, when a face image P1 is displayed on the display 55, the face portion P1 f may be manually specified using the mouse 57 or the keyboard 56, or an automatic detection result may be manually corrected.
  • Next, the image recovery/addition section 32 applies the face portion P1 f to the mathematical model M. More specifically, it reconstructs the image based on the formulae (1) and (6) described above by varying the values of weighting factors for the unique vectors pi and qi from the top principal components in formulae (1) and (6) and obtains weighting factors bi and λi (collectively referred to as “parameter C”) when the difference between the reconstructed image and the face portion P1 f is minimal (refer to the reference document 3 for detail). Note that it is preferable that the range of the values of the weighting factors bi and λi be based on the standard deviations sd of the distributions of bi and λi when the sample images are represented by the formulae (1) and (6) at the time of generating the model, and limited to, for example, from −3sd to +3sd, and if the values do not fall in the range, the average values of the distributions be used. This may prevent an erroneous application of the model.
  • Further, the image reconstruction section 33 reconstructs a face image P1 f′ (FIG. 11C) by substituting the parameter C to the formulae (1) and (6), and further, generates an image P2 (FIG. 11D) by replacing the face portion P1 f of the image P1 with the reconstructed face image P1 f′ and merging into the image P1, and outputs image data P2.
  • As describe above, according to the image recovery/addition process of the present embodiment, the image recovery/addition section 32 applies the face portion P1 f detected by the face detection section 31 from the image P1 with the eye area thereof being collapsed to dark due to the shadow of the cap of the subject to the mathematical model M generated through the AAM scheme based on a plurality of sample images representing faces including the eye area. The image reconstruction section 33 reconstructs the image P1 f′ recovered in the eye area based on the parameter C corresponding to the face portion P1 f obtained by applying the face portion P1 f to the mathematical model M, and further generates the image P2 by fitting the face portion P1 f′ into the face portion of the image P1 and merging therein. Here, the mathematical model M is generated from sample images representing the face portion including the eye area, so that the eye area, being collapsed to dark in the image P1, may be depicted in the model M. Accordingly, even if the eye area of the face portion of the input image P1 is collapsed to dark, an image obtained by applying the face portion P1 f of the input image P1 to the model M is recovered in the eye area. In this way, according to the image recovery/addition process of the present invention, the eye area collapsed to dark due to the shadow of the cap of the subject may be recovered automatically without requiring manual operations. Consequently, the image P2 with extremely high recovery accuracy may be obtained easily without requiring any operational skills.
  • Note that, when another processing is selected on the menu screen in FIG. 4, the image recovery/addition process may be realized without modifying the configuration by appropriately changing the type of sample images for generating the mathematical model M, and of learning sample images used in the learning process for generating the discriminators in the face detection section 31 as required.
  • For example, when “Remove Net Fence” is selected on the menu screen in FIG. 4, an image of a person with a net pattern superimposed thereon obtained by imaging the person over a net fence is processed to remove the net pattern from the image and recover the face portion of the person. In this case, face images without net patterns superimposed thereon are used as the sample images for generating the mathematical model M, and face images with net patterns superimposed thereon are used as the sample images used in the learning process for generating the discriminators in the face detection section 31. This allows the image recovery/addition section 32 to apply a face portion P1 f with a net pattern superimposed thereon detected by the face detection section 31 from an image P1 obtained by imaging the face over a net fence to the mathematical model M generated through the AAM scheme based on a plurality of sample images without net patterns superimposed thereon, the image reconstruction section 33 to reconstruct a face image P1 f′ with the net pattern portion recovered based on the parameter C corresponding to the face portion P1 f obtained by applying the face portion P1 f to the mathematical model M, and further to generate an image P2 by fitting the image P1 f′ into the face portion of the image P1 and merging therein.
  • Further, when “Recover Lost Signal” is selected on the menu screen in FIG. 4, an image having halation in the high signal region or collapse in the low signal region due to an excessive contrast between the subject and the background, or an image with a lost image signal of a specific color due to a compression or transmission error of the imaging device is processed to correct the halation or collapse, or recover the lost signal. In this case, face images without halation, collapse, or a lost signal are used as the sample images for generating the mathematical model M, and face images with halation, collapse, or a lost signal are used as the sample images used in the learning process for generating the discriminators in the face detection section 31. Further, if color face images are used for generating the mathematical model M, and monochrome or sepia face images are used as the sample images used in the learning process for generating the discriminators in the face detection section 31, a color face image may be produced from a monochrome or sepia face image.
  • When “Pimple” is selected on the menu screen in FIG. 4, a face image of a person without any pimple is processed to add some pimples thereon, which may be called as a pimple recovery process. In this case, face images with pimples are used as the sample images for generating the mathematical model M, and face images without pimples are used as the sample images used in the learning process for generating the discriminators in the face detection section 31. This allows the image recovery/addition section 32 to apply a face portion P1 f without pimples detected by the face detection section 31 from an image P1 to the mathematical model M generated through the AAM scheme based on a plurality of sample images with pimples, the image reconstruction section 33 to reconstruct a face image P1 f′ with pimples added (recovered) thereon based on the parameter C corresponding to the face portion P1 f obtained by applying the face portion P1 f to the mathematical model M, and further to generate an image P2 by fitting the image P1 f′ into the face portion of the image P1 and merging therein. When adding other menu items, “Wrinkle”, “Makeup”, “Glasses”, or other elements, face images that include the element to be added are used as the sample images for generating the mathematical model M, and face images that do not include the element to be added are used as the sample images used in the learning process for generating the discriminators in the face detection section 31 in the same manner as described above.
  • Further, although not shown on the menu in FIG. 4, if images with a greater number of gradations are used as the sample images for generating the mathematical model M, and face images with a smaller number of gradations are used as the sample images used in the learning process for generating the discriminators in the face detection section 31, the image recovery/addition section 32 may apply a face portion P1 f with a smaller number of gradations detected by the face detection section 31 from an image P1 to the mathematical model M generated through the AAM scheme based on a plurality of sample images with a greater number of gradations, the image reconstruction section 33 may reconstruct a face image P1 f′ with a greater number of gradations based on the parameter C corresponding to the face portion P1 f obtained by applying the face portion P1 f to the mathematical model M, and further may generate an image P2 by fitting the image P1 f′ into the face portion of the image P1 and merging therein. This may be called as a recovery process for the number of gradations.
  • Still further, if non-blurry face images are used as the sample images for generating the mathematical model M, and blurry face images are used as the sample images used in the learning process for generating the discriminators in the face detection section 31, the image recovery/addition section 32 may apply a blurry face portion P1 f detected by the face detection section 31 from an image P1 to the mathematical model M generated through the AAM scheme based on a plurality of non-blurry sample images, the image reconstruction section 33 may reconstruct a non-blurry face image P1 f′ based on the parameter C corresponding to the face portion P1 f obtained by applying the face portion P1 f to the mathematical model M, and further may generate an image P2 by fitting the image P1 f′ into the face portion of the image P1 and merging therein. This may be called as a recovery process for high frequency component of the image.
  • In the present embodiment, a menu item is provided for each recovery/addition target element as in the menu shown in FIG. 4. But, if face images including a plurality of recovery/addition target elements are used as the sample images for generating the mathematical model M, and face images that do not include these recovery/addition target elements are used as the sample images used in the learning process for generating the discriminators in the face detection section 31, a comprehensive recovery/addition process may be realized.
  • In the embodiment described above, only a single mathematical model M is used, but a plurality of mathematical models Mi (i=1, 2, - - - ) may be generated, each for each attribute, such as human race, age, and gender. FIG. 12 is a block diagram illustrating the image recovery/addition process in this case. As illustrated, the embodiment differs from the embodiment shown in FIG. 5 in that it further includes: an attribute obtaining section 34 for obtaining attribute information Ak of the subject of an image P1; and a model selection section 35 for selecting a mathematical model Mk generated from sample images of subjects having the attribute included in the obtained attribute information Ak.
  • Here, each of the plurality of mathematical models is generated, for example, from a sample image group representing subjects of the same race, age, gender, or the like based on the aforementioned method (FIG. 6), and stored with attribute information Ai that indicates a common attribute in the sample image group being associated therewith.
  • The attribute obtaining section 34 may be a section for obtaining the attribute information by determining the attribute of the subject of the image P1 through a known recognition process, for example, the process described in Japanese Unexamined Patent Publication No. 11 (1999)-175724, or a section for obtaining the attribute of the subject recorded in the header of the image P1 as appended information at the time of imaging. Further, it may have a function to estimate the attribute of the subject based on the accompanying information. For example, if GPS information for imaged location is available, the country or region corresponding to the GPS information may be identified. Noting that the race of the subject may be estimated from the GPS information to a certain degree, a reference table for associating the GPS information with race information may be provided in advance. Then, using the image data P1 obtained by a digital camera capable of obtaining GPS information at the time of imaging and recording it in the header region of the image P1 (e.g., digital camera disclosed in Japanese Unexamined Patent Publication No. 2004-153428) as input, the GPS information recorded in the header region of the image P1 is obtained, and the race of the subject may be estimated by referring to the reference table based on the obtained GPS information.
  • The model selection section 35 obtains the mathematical model Mk associated with the attribute information Ak obtained by the attribute obtaining section 34. The image recovery/addition section 32 applies the face portion P1 f of the image P1 to the mathematical model Mk.
  • In this way, if a configuration is adopted in which a plurality of mathematical models Mi is provided, each for each attribute, in advance; the model selection section 35 selects a mathematical model Mk associated with the attribute Ak obtained by the attribute obtaining section 34; and the image recovery/addition section 32 applies the face shape P1 f to the selected mathematical model Mk, no unique vector for explaining the variation in the face shape or luminance arising from the difference in the attributes Ak is present in the mathematical model Mk, so that the face shape P1 f may be represented based only on the unique vectors representing other factors that determine the face shape or luminance. This may improve the processing accuracy and a higher quality image may be obtained.
  • Preferably, the attribute-based mathematical models are further specialized, and individual-based mathematical models are generated from the viewpoint of the processing accuracy. In this case, it is necessary to associate the images P1 with individuals.
  • In the embodiment described above, it is assumed that the mathematical model is preinstalled on the digital photograph printer. But, it is also preferable to provide race-based mathematical models in advance, and select the mathematical model according to the shipping country or the region of the printer from the viewpoint of improving the processing accuracy.
  • Further, the function to generate the mathematical models may be implemented in the digital photograph printer. More specifically, it means just to install a program, for performing the process explained based on the flowchart shown in FIG. 6, on the arithmetic/control unit 50. It is also possible to install a default mathematical model at the time of shipping, and then to customize the mathematical model or generating a new model which is different from the default mathematical model using input to the digital photograph printer. This is particularly effective when individual-based mathematical models are generated.
  • Still further, in the embodiment described above, face images are represented using separate weighting factors bi and λi for face shape and luminance respectively. There is a correlation between the face shape and luminance variations, so that an appearance parameter “c”, a new parameter, for controlling both the face shape and luminance may be obtained by performing a principal component analysis on a vector (b1, b2, - - - , bi, - - - , λ1, λ2- - - , - - - ) obtained by combining the weighting factors bi and λi as shown in the following formulae (7) and (8).

  • S 0 +Q S c  (7)

  • A=A 0 +Q A c  (8)
  • Here, the variance in the shape from the average face shape is represented by the appearance parameter “c” and the vector QS, and the variance in the luminance from the average face luminance is represented by the appearance parameter “c” and the vector QA
  • When this model is used, the image recovery/addition section 32 obtains the face luminance under the average face shape based on the formula (8) above by varying the value of the appearance parameter “c”, and further reconstructs a face image by transforming the average face shape based on the formula (7) above, and obtains the appearance parameter “c” when the difference between the reconstructed image and the face portion P1 f becomes minimal.
  • Another embodiment of the present invention conceivable is to implement the image recovery/addition process described above in a digital camera. FIG. 13 schematically illustrates the configuration of such digital camera. As illustrated, the digital camera includes: an imaging section 71, formed of a lens, an aperture diaphragm, a shatter, a CCD, and the like, for imaging a subject; an A/D conversion section 72 for obtaining digital image data P0 by digitizing the analog signals of the charges stored in the CCD of the imaging section 71; an image processing section for performing various image processing on the image data P0; a compression/expansion section 74 for performing compression on image data to be recorded on a memory card, or performing expansion on compressed image data from a memory card; a strobe section 75, formed of a strobe light and the like, for strobe emission; an operation section 76, formed of various operation buttons and the like, for specifying imaging conditions, image processing conditions, and the like; a media recording section 77 which is an interface for a memory card on which image data are recorded; a display section 78, formed of a liquid crystal display, or the like, for displaying a through image, a recorded image, and various setting menus; a control section for controlling the process of each of the sections described above; and an internal memory 79 including a control program, image data, and the like.
  • Here, the function of each means shown in FIG. 2 is realized by the following sections under the control of the control section using the internal memory 79: the image input means 1 by the imaging means 71 and the A/D conversion section 72; the image correction means 2 by the image processing section 73; the image retouching means 3 by the image processing section 73, the operation section 76, and display section 78; and the image output means 4 by the media recording section 77.
  • An operation and a process flow of the digital camera will now be described.
  • First, the imaging section 71 focuses light, representing a subject inputted to the lens through a full depression of the shutter by the photographer, on the photoelectric surface of the CCD, which is photoelectrically converted and outputted as analog image signals, and the A/D conversion section 72 converts the outputted analog image signals to digital image signals and outputs the digital image signals as digital image data P0, thereby the function of the image input means 1 is realized.
  • Then, the image processing section 73 performs automatic white balance correction, white balance correction according to the type of the light source, gradation correction, density correction, color correction, sharpness correction, and the like, and outputs processed image data P1, thereby the function of the image correction means 2 is realized.
  • Here, the image P1 is displayed on the liquid crystal display of the display section 78. The display layout may be like that shown in FIG. 3A in which a plurality of images in thumbnail format is displayed. The photographer selects a retouching target image and displays it in an enlarged form using the operation buttons of the operation section 76, and further performs manual image correction or retouching through menu selections. Here, if “Recovery/Addition” is selected, and a desired process is selected from the menu screen illustrated in FIG. 4, which appears when “Recovery/Addition” is selected, the control section 70 activates an image recovery/addition program stored in the internal memory 79, and causes the image processing section 73 to perform the image recovery/addition process (FIG. 5, or the like) using the mathematical model M stored in the internal memory 79 in advance, and outputs a processed image data P2. Thereby the function of image retouching means 3 is realized.
  • Then, the compression/expansion section 74 performs compression on the image data P2 based on a compression format, such as JPEG or the like, and records the compressed image data on a memory card loaded in the digital camera through the media recording section 77, thereby the function of the image output section 4 is realized.
  • In this way, by implementing the image recovery/addition process according to the present invention as an image processing function of a digital camera, identical effects to those of the digital photograph printer may be obtained.
  • Here, a configuration may be adopted in which the image recovery/addition process is also performed on the image data stored in a memory card. More specifically, the image data stored in the memory card are expanded (decompressed) by the compression/expansion section 74, and the image represented by the expanded image data is displayed on the liquid crystal display of the display section 78. Then, a desired process is selected by the photographer in the same manner as described above, and the selected process is performed by the image processing section 73.
  • Further, the attribute-based mathematical models explained in FIG. 12 and the like, or the mathematical model generation process explained in FIG. 6 may be implemented in a digital camera. Here, it is often the case that the persons who become the subjects of photographs taken by a specific digital camera. Therefore, if individual-based mathematical models for persons who often become the subjects of photographs taken by the digital camera, models without variations in the face due to personal difference may be generated. Consequently, the recovery/addition process for the faces of these persons may be performed with extremely high accuracy.
  • In addition to the embodiments described above, a program for causing a personal computer or the like to perform the image recovery/addition process of the present invention may be incorporated in retouching software. This allows the user to install the software on a personal computer or the like from a recording medium, such as a CD-ROM or the like, including the software, or from a predetermined website by downloading the software therefrom. In this way, the user may use the image recovery/addition process of the present invention as one of the image editing/retouching schemes in its own personal computer.

Claims (8)

1. An image processing method, comprising the step of applying a predetermined structure in an input image, representing that an element of the structure is missing, to a model other than the missing element to recover the missing element of the structure in the input image, the model representing the predetermined structure by a statistical characteristic amount obtained by performing predetermined statistical processing on a plurality of images representing the predetermined structure including the recovery target element.
2. An image processing apparatus comprising:
a model representing a predetermined structure by a statistical characteristic amount obtained by performing predetermined statistical processing on a plurality of images representing the predetermined structure including a recovery target element; and
a recovery means for applying the predetermined structure in an input image, representing that the element of the predetermined structure is missing, to the model other than the missing element to recover the missing element of the structure in the input image.
3. The image processing apparatus according to claim 2, wherein:
the apparatus further comprises a detection means for detecting the predetermined structure in the input image; and
the recovery means is a means for applying the structure detected by the detection means to the model.
4. The image processing apparatus according to claim 2, wherein:
a plurality of models is provided, each for each attribute of the predetermined structure;
the apparatus further comprises a selection means for obtaining the attribute of the predetermined structure in the input image and selecting the model according to the obtained attribute; and
the recovery means is a means for performing the application to the model selected by the selection means.
5. The image processing apparatus according to claim 2, wherein the predetermined structure is a human face.
6. The image processing apparatus according to claim 2, wherein the recovery target element is at least one of a portion of the region of the structure, a portion of color signals representing the structure, a portion of spatial frequency components representing the structure, and the number of gradations representing the structure.
7. The image processing apparatus according to claim 2, wherein the model and the application to the model are realized by an active appearance model scheme.
8. A computer readable recording medium having recorded thereon an image processing program for causing a computer to perform a process which comprises the step of applying a predetermined structure in an input image, representing that an element of the structure is missing, to a model other than the missing element to recover the missing element of the structure in the input image, the model representing the predetermined structure by a statistical characteristic amount obtained by performing predetermined statistical processing on a plurality of images representing the predetermined structure including the recovery target element.
US11/709,708 2006-02-24 2007-02-23 Image processing method, apparatus, and computer readable recording medium including program therefor Active 2029-10-28 US7885477B2 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
JP2006048782A JP4767718B2 (en) 2006-02-24 2006-02-24 Image processing method, apparatus, and program
JP2006-048782 2006-02-24

Publications (2)

Publication Number Publication Date
US20070201750A1 true US20070201750A1 (en) 2007-08-30
US7885477B2 US7885477B2 (en) 2011-02-08

Family

ID=38444058

Family Applications (1)

Application Number Title Priority Date Filing Date
US11/709,708 Active 2029-10-28 US7885477B2 (en) 2006-02-24 2007-02-23 Image processing method, apparatus, and computer readable recording medium including program therefor

Country Status (2)

Country Link
US (1) US7885477B2 (en)
JP (1) JP4767718B2 (en)

Cited By (17)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20070071346A1 (en) * 2005-09-27 2007-03-29 Fuji Photo Film Co., Ltd. Method and apparatus for judging direction of blur and computer-readable recording medium storing a program therefor
US20090208052A1 (en) * 2008-02-14 2009-08-20 Ecole Polytechnique Federale De Lausanne (Epfl) Interactive device and method for transmitting commands from a user
US20090231420A1 (en) * 2008-03-14 2009-09-17 Tetsuya Kokufu Image pickup apparatus and image combining method of image pickup apparatus
US20100260438A1 (en) * 2009-04-08 2010-10-14 Nikon Corporation Image processing apparatus and medium storing image processing program
US20110019910A1 (en) * 2008-04-07 2011-01-27 Fujifilm Corporation Image processing system
US20110052045A1 (en) * 2008-04-04 2011-03-03 Fujifilm Corporation Image processing apparatus, image processing method, and computer readable medium
US20110055733A1 (en) * 2009-09-03 2011-03-03 International Business Machines Corporation System and Method for Locating Missing Items in a Virtual Universe
US7916971B2 (en) * 2007-05-24 2011-03-29 Tessera Technologies Ireland Limited Image processing method and apparatus
US20110090354A1 (en) * 2009-10-15 2011-04-21 Sony Corporation Block-based variational image processing method
US20120113116A1 (en) * 2010-11-08 2012-05-10 Cranial Technologies, Inc. Method and apparatus for preparing image representative data
US20120114201A1 (en) * 2010-11-08 2012-05-10 Cranial Technologies, Inc. Method and apparatus for processing image representative data
US8599209B2 (en) 2009-04-20 2013-12-03 Fujifilm Corporation Image processing apparatus, image processing method, and computer readable medium
US8750578B2 (en) 2008-01-29 2014-06-10 DigitalOptics Corporation Europe Limited Detecting facial expressions in digital images
US9251318B2 (en) 2009-09-03 2016-02-02 International Business Machines Corporation System and method for the designation of items in a virtual universe
US10242287B2 (en) * 2015-06-11 2019-03-26 Canon Kabushiki Kaisha Image processing apparatus, image processing method, and recording medium
CN110163794A (en) * 2018-05-02 2019-08-23 腾讯科技(深圳)有限公司 Conversion method, device, storage medium and the electronic device of image
US11328455B2 (en) * 2018-12-25 2022-05-10 Netease (Hangzhou) Network Co., Ltd. Method and apparatus for generating face model, storage medium, and terminal

Families Citing this family (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2009273117A (en) * 2008-04-07 2009-11-19 Fujifilm Corp Image processing system, image processing method, and program
JP5337969B2 (en) * 2008-04-08 2013-11-06 富士フイルム株式会社 Image processing system, image processing method, and program
JP5337970B2 (en) * 2008-04-08 2013-11-06 富士フイルム株式会社 Image processing system, image processing method, and program
WO2009125578A1 (en) * 2008-04-07 2009-10-15 富士フイルム株式会社 Image processing system, image processing method, and computer-readable medium
JP5399887B2 (en) * 2009-12-28 2014-01-29 本田技研工業株式会社 Vehicle periphery monitoring device
KR101919831B1 (en) 2012-01-11 2018-11-19 삼성전자주식회사 Object Recognition Apparatus, Classification Tree Learning Apparatus and Method thereof
US8977347B2 (en) * 2012-06-25 2015-03-10 Xerox Corporation Video-based estimation of heart rate variability
CN104573099B (en) * 2015-01-29 2018-05-22 作业帮教育科技(北京)有限公司 The searching method and device of topic
CN107330408B (en) * 2017-06-30 2021-04-20 北京乐蜜科技有限责任公司 Video processing method and device, electronic equipment and storage medium

Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20040170323A1 (en) * 2001-05-25 2004-09-02 Cootes Timothy F Object identification
US20040193789A1 (en) * 2002-08-29 2004-09-30 Paul Rudolf Associative memory device and method based on wave propagation
US20050036712A1 (en) * 2003-05-08 2005-02-17 Toshiaki Wada Image retrieving apparatus and image retrieving program
US20050169520A1 (en) * 2003-12-29 2005-08-04 Canon Kabushiki Kaisha Detecting human faces and detecting red eyes
US20050213820A1 (en) * 2000-03-09 2005-09-29 Microsoft Corporation Rapid computer modeling of faces for animation
US7068835B1 (en) * 1997-11-28 2006-06-27 Minolta Co., Ltd. Data processing apparatus for processing a 3-dimensional data of an object and a method therefor
US7471833B2 (en) * 2002-07-19 2008-12-30 Mitsubishi Electric Corporation Method and apparatus for processing image data

Family Cites Families (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPH0865519A (en) 1994-08-19 1996-03-08 Toppan Printing Co Ltd Method for correcting defect of picture and device therefor
JP4510556B2 (en) 2003-09-09 2010-07-28 富士フイルム株式会社 Object identification device and method, and program

Patent Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7068835B1 (en) * 1997-11-28 2006-06-27 Minolta Co., Ltd. Data processing apparatus for processing a 3-dimensional data of an object and a method therefor
US20050213820A1 (en) * 2000-03-09 2005-09-29 Microsoft Corporation Rapid computer modeling of faces for animation
US20040170323A1 (en) * 2001-05-25 2004-09-02 Cootes Timothy F Object identification
US7471833B2 (en) * 2002-07-19 2008-12-30 Mitsubishi Electric Corporation Method and apparatus for processing image data
US20040193789A1 (en) * 2002-08-29 2004-09-30 Paul Rudolf Associative memory device and method based on wave propagation
US20050036712A1 (en) * 2003-05-08 2005-02-17 Toshiaki Wada Image retrieving apparatus and image retrieving program
US20050169520A1 (en) * 2003-12-29 2005-08-04 Canon Kabushiki Kaisha Detecting human faces and detecting red eyes

Cited By (30)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7848588B2 (en) * 2005-09-27 2010-12-07 Fujifilm Corporation Method and apparatus for judging direction of blur and computer-readable recording medium storing a program therefor
US20070071346A1 (en) * 2005-09-27 2007-03-29 Fuji Photo Film Co., Ltd. Method and apparatus for judging direction of blur and computer-readable recording medium storing a program therefor
US7916971B2 (en) * 2007-05-24 2011-03-29 Tessera Technologies Ireland Limited Image processing method and apparatus
US8515138B2 (en) 2007-05-24 2013-08-20 DigitalOptics Corporation Europe Limited Image processing method and apparatus
US8494232B2 (en) 2007-05-24 2013-07-23 DigitalOptics Corporation Europe Limited Image processing method and apparatus
US9462180B2 (en) 2008-01-27 2016-10-04 Fotonation Limited Detecting facial expressions in digital images
US11689796B2 (en) 2008-01-27 2023-06-27 Adeia Imaging Llc Detecting facial expressions in digital images
US11470241B2 (en) 2008-01-27 2022-10-11 Fotonation Limited Detecting facial expressions in digital images
US8750578B2 (en) 2008-01-29 2014-06-10 DigitalOptics Corporation Europe Limited Detecting facial expressions in digital images
US20090208052A1 (en) * 2008-02-14 2009-08-20 Ecole Polytechnique Federale De Lausanne (Epfl) Interactive device and method for transmitting commands from a user
US8126221B2 (en) * 2008-02-14 2012-02-28 Ecole Polytechnique Federale De Lausanne (Epfl) Interactive device and method for transmitting commands from a user
US8405713B2 (en) * 2008-03-14 2013-03-26 Olympus Imaging Corp. Image pickup apparatus and image combining method of image pickup apparatus
US20090231420A1 (en) * 2008-03-14 2009-09-17 Tetsuya Kokufu Image pickup apparatus and image combining method of image pickup apparatus
US20110052045A1 (en) * 2008-04-04 2011-03-03 Fujifilm Corporation Image processing apparatus, image processing method, and computer readable medium
US20110019910A1 (en) * 2008-04-07 2011-01-27 Fujifilm Corporation Image processing system
US8447128B2 (en) 2008-04-07 2013-05-21 Fujifilm Corporation Image processing system
US20100260438A1 (en) * 2009-04-08 2010-10-14 Nikon Corporation Image processing apparatus and medium storing image processing program
US8599209B2 (en) 2009-04-20 2013-12-03 Fujifilm Corporation Image processing apparatus, image processing method, and computer readable medium
US20110055733A1 (en) * 2009-09-03 2011-03-03 International Business Machines Corporation System and Method for Locating Missing Items in a Virtual Universe
US8788952B2 (en) * 2009-09-03 2014-07-22 International Business Machines Corporation System and method for locating missing items in a virtual universe
US9251318B2 (en) 2009-09-03 2016-02-02 International Business Machines Corporation System and method for the designation of items in a virtual universe
US8633997B2 (en) * 2009-10-15 2014-01-21 Sony Corporation Block-based variational image processing method
US20110090354A1 (en) * 2009-10-15 2011-04-21 Sony Corporation Block-based variational image processing method
US8494237B2 (en) * 2010-11-08 2013-07-23 Cranial Technologies, Inc Method and apparatus for processing digital image representations of a head shape
US8442288B2 (en) * 2010-11-08 2013-05-14 Cranial Technologies, Inc. Method and apparatus for processing three-dimensional digital mesh image representative data of three-dimensional subjects
US20120114201A1 (en) * 2010-11-08 2012-05-10 Cranial Technologies, Inc. Method and apparatus for processing image representative data
US20120113116A1 (en) * 2010-11-08 2012-05-10 Cranial Technologies, Inc. Method and apparatus for preparing image representative data
US10242287B2 (en) * 2015-06-11 2019-03-26 Canon Kabushiki Kaisha Image processing apparatus, image processing method, and recording medium
CN110163794A (en) * 2018-05-02 2019-08-23 腾讯科技(深圳)有限公司 Conversion method, device, storage medium and the electronic device of image
US11328455B2 (en) * 2018-12-25 2022-05-10 Netease (Hangzhou) Network Co., Ltd. Method and apparatus for generating face model, storage medium, and terminal

Also Published As

Publication number Publication date
US7885477B2 (en) 2011-02-08
JP2007226655A (en) 2007-09-06
JP4767718B2 (en) 2011-09-07

Similar Documents

Publication Publication Date Title
US7885477B2 (en) Image processing method, apparatus, and computer readable recording medium including program therefor
US7756343B2 (en) Image processing method, image processing apparatus, and computer-readable recording medium storing image processing program
US7751640B2 (en) Image processing method, image processing apparatus, and computer-readable recording medium storing image processing program
US8107764B2 (en) Image processing apparatus, image processing method, and image processing program
US7848588B2 (en) Method and apparatus for judging direction of blur and computer-readable recording medium storing a program therefor
US20060280380A1 (en) Apparatus, method, and program for image processing
US8068645B2 (en) Apparatus, method, and program for image processing
US20060291739A1 (en) Apparatus, method and program for image processing
US7565073B2 (en) Photography apparatus, photography method, and photography program for obtaining an image of a subject
US8391595B2 (en) Image processing method and image processing apparatus
JP4856086B2 (en) Method and apparatus for detecting red eye in an acquired digital image
JP4367757B2 (en) Method and system for outputting formatted information relating to defects in one device of a chain, in particular relating to smearing
JP2004078912A (en) Method for positioning face in digital color image
US20050129331A1 (en) Pupil color estimating device
US20050220346A1 (en) Red eye detection device, red eye detection method, and recording medium with red eye detection program
JP2001309225A (en) Camera for detecting face and its method
JP2002245471A (en) Photograph finishing service for double print accompanied by second print corrected according to subject contents
JP2006295303A (en) Image processing method, apparatus and program
US20070014483A1 (en) Apparatus, method and program for image processing
JP2007122533A (en) Comment layout for image
JP2006350769A (en) Image processing device, method and program
JP2003187257A (en) Image creation system
JP2007124112A (en) Image processing method, device and program
JP2011188237A (en) Image processing method and image processing apparatus
JP2006023949A (en) Image processor and image processing method

Legal Events

Date Code Title Description
AS Assignment

Owner name: FUJIFILM CORPORATION, JAPAN

Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNORS:ITO, WATARU;YAMAGISHI, HIDEKI;LI, YUANZHONG;SIGNING DATES FROM 20061225 TO 20070110;REEL/FRAME:018988/0309

Owner name: FUJIFILM CORPORATION, JAPAN

Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNORS:ITO, WATARU;YAMAGISHI, HIDEKI;LI, YUANZHONG;REEL/FRAME:018988/0309;SIGNING DATES FROM 20061225 TO 20070110

FEPP Fee payment procedure

Free format text: PAYOR NUMBER ASSIGNED (ORIGINAL EVENT CODE: ASPN); ENTITY STATUS OF PATENT OWNER: LARGE ENTITY

STCF Information on status: patent grant

Free format text: PATENTED CASE

FPAY Fee payment

Year of fee payment: 4

MAFP Maintenance fee payment

Free format text: PAYMENT OF MAINTENANCE FEE, 8TH YEAR, LARGE ENTITY (ORIGINAL EVENT CODE: M1552)

Year of fee payment: 8

MAFP Maintenance fee payment

Free format text: PAYMENT OF MAINTENANCE FEE, 12TH YEAR, LARGE ENTITY (ORIGINAL EVENT CODE: M1553); ENTITY STATUS OF PATENT OWNER: LARGE ENTITY

Year of fee payment: 12