Techno Blender
Digitally Yours.

Apple introduces groundbreaking AI image editing model: MGIE

0 25


Apple researchers have introduced a groundbreaking AI model, MLLM-Guided Image Editing (MGIE), capable of editing images based on text prompts. Developed in collaboration with researchers from the University of California, Santa Barbara, this model represents a significant advancement in image editing technology. Unlike existing models, MGIE reportedly handles a wide range of editing scenarios, from simple color adjustments to complex object manipulations.

The core of the MGIE is a Multimodal Large Language Model (MLLM), which interprets user requests and provides concise instructions for image editing. This approach enables the model to address ambiguous commands effectively, achieving reasonable editing results. For instance, the MLLM understands a request to “make a pizza more healthy”, and connects the term “healthy” with “vegetable toppings,” instructing the diffusion model to edit the image accordingly.

The MGIE can edit images from your text description

What sets MGIE apart from existing models like LLM-Guided Image Editing (LGIE) is its enhanced visual perception. While LGIE is confined to a single modality, MLLM within MGIE has access to the input image and cross-modal understanding, allowing for more descriptive instructions. This capability enables the model to identify specific regions in the image that need adjustment, such as brightening certain areas for a desired effect.

MGIE is now available as an open-source project on GitHub, offering code, data, and pre-trained models for download. Additionally, a web demo hosted on Hugging Face spaces allows users to experience the image editing capabilities of the model firsthand. However, Apple has not yet disclosed its plans for integrating MGIE into its products beyond research projects.

During Apple’s recent quarterly earnings call, CEO Tim Cook confirmed the company’s ongoing work on AI features for its devices. The company is likely to announce the results later this year. Business Standard expects these AI enhancements to extend to various Apple services, including Siri, Messages, and Apple Music. With the incorporation of generative AI features, users can anticipate improvements such as text summarization, personalized suggestions, and enhanced functionality across Apple’s ecosystem.


Apple researchers have introduced a groundbreaking AI model, MLLM-Guided Image Editing (MGIE), capable of editing images based on text prompts. Developed in collaboration with researchers from the University of California, Santa Barbara, this model represents a significant advancement in image editing technology. Unlike existing models, MGIE reportedly handles a wide range of editing scenarios, from simple color adjustments to complex object manipulations.

The core of the MGIE is a Multimodal Large Language Model (MLLM), which interprets user requests and provides concise instructions for image editing. This approach enables the model to address ambiguous commands effectively, achieving reasonable editing results. For instance, the MLLM understands a request to “make a pizza more healthy”, and connects the term “healthy” with “vegetable toppings,” instructing the diffusion model to edit the image accordingly.

The MGIE can edit images from your text description

What sets MGIE apart from existing models like LLM-Guided Image Editing (LGIE) is its enhanced visual perception. While LGIE is confined to a single modality, MLLM within MGIE has access to the input image and cross-modal understanding, allowing for more descriptive instructions. This capability enables the model to identify specific regions in the image that need adjustment, such as brightening certain areas for a desired effect.

MGIE is now available as an open-source project on GitHub, offering code, data, and pre-trained models for download. Additionally, a web demo hosted on Hugging Face spaces allows users to experience the image editing capabilities of the model firsthand. However, Apple has not yet disclosed its plans for integrating MGIE into its products beyond research projects.

During Apple’s recent quarterly earnings call, CEO Tim Cook confirmed the company’s ongoing work on AI features for its devices. The company is likely to announce the results later this year. Business Standard expects these AI enhancements to extend to various Apple services, including Siri, Messages, and Apple Music. With the incorporation of generative AI features, users can anticipate improvements such as text summarization, personalized suggestions, and enhanced functionality across Apple’s ecosystem.

FOLLOW US ON GOOGLE NEWS

Read original article here

Denial of responsibility! Techno Blender is an automatic aggregator of the all world’s media. In each content, the hyperlink to the primary source is specified. All trademarks belong to their rightful owners, all materials to their authors. If you are the owner of the content and do not want us to publish your materials, please contact us by email – [email protected]. The content will be deleted within 24 hours.

Leave a comment