New Apple AI Model Edits Images Based on Natural Language Input

Apple researchers have released a new open-source AI model that is capable of editing images based on a user's natural language instructions (via VentureBeat).

DALL%C2%B7E apple logo image editing ai

MacRumors image made with DALL·E

Called "MGIE," which stands for MLLM-Guided Image Editing, it uses multimodal large language models (MLLMs) to interpret user requests and perform pixel-level manipulations.

The model is capable of editing various aspects of images. Global photo enhancements can include brightness, contrast, or sharpness, or the application of artistic effects like sketching. Local editing can modify the shape, size, color, or texture of specific regions or objects in an image, while Photoshop-style modifications can include cropping, resizing, rotating, and adding filters, or even changing backgrounds and blending images.

A user input for a photo of a pizza could be to "make it look more healthy." Using common sense reasoning, the model can add vegetable toppings, such as tomatoes and herbs. A global optimization input request might take the form of "add contrast to simulate more light," while a Photoshop-style modification could be made by asking the model to remove people from the background of a photo, shifting the focus of the image to the subject's facial expression.

Apple collaborated with University of California researchers to create MGIE, which was presented in a paper at the International Conference on Learning Representations (ICLR) 2024. The model is available on GitHub, and includes the code, data, and pre-trained models.

MGIE apple AI model image editing
This is Apple's second breakthrough in AI research in as many months. In late December, Apple revealed that it had made strides in deploying large language models (LLMs) on iPhones and other Apple devices with limited memory by inventing an innovative flash memory utilization technique.

For the last several months, Apple has been testing an "Apple GPT" rival that could compete with ChatGPT. According to Bloomberg's Mark Gurman, work on AI is a priority for Apple, with the company designing an "Ajax" framework for large language models.

Both The Information and analyst Jeff Pu claim that Apple will have some kind of generative AI feature available on the ‌iPhone‌ and iPad around late 2024, which is when iOS 18 will be coming out. iOS 18 is said to include an enhanced version of Siri with ChatGPT-like generative AI functionality, and has the potential to be the "biggest" software update in the iPhone's history, according to Gurman.

Popular Stories

Generic iOS 18 Feature Real Mock

iOS 18 Available Now With These 8 New Features For Your iPhone

Sunday September 15, 2024 10:09 am PDT by
Following over three months of beta testing, iOS 18 was finally widely released to the public on Monday, September 16. The update is available in the Settings app under General → Software Update on the iPhone XS and newer. Below, we have highlighted eight key new features included in iOS 18, and Apple shared a complete list of new features and changes last week. Note that Apple...
iOS 18 Public Beta Thumb 1

Here's When iOS 18 Rolls Out Today in Every Time Zone

Monday September 16, 2024 3:56 am PDT by
It's that time of year again. Apple is about to release iOS 18, which promises to bring a range of new features and improvements to iPhones worldwide. It's Apple's biggest software update of the year, and the company is expected to release it sometime today – Monday, September 16. Based on past releases, the update is likely to drop at around 10:00 a.m. Pacific Time/1:00 p.m. Eastern...
Beyond iPhone 13 Better Blue Face ID Single Camera Hole

10 Reasons to Wait for Next Year's iPhone 17

Friday September 13, 2024 2:40 am PDT by
Apple's iPhone development roadmap runs several years into the future and the company is continually working with suppliers on several successive iPhone models simultaneously, which is why we sometimes get rumored feature leaks so far ahead of launch. The iPhone 17 series is no different – already we have some idea of what to expect from Apple's 2025 smartphone lineup. If you plan to skip...
apple silicon mac lineup wwdc 2022 feature purple

M4 Macs, New iPad Mini, and iPad 11 Expected at Upcoming Apple Event

Sunday September 15, 2024 5:29 am PDT by
Apple will likely hold another event in October this year to announce new Macs and iPads. If so, it would be the fourth time in the last five years that Apple has held an event in October. Last year, Apple held a virtual event on Monday, October 30 to announce new MacBook Pro and iMac models with the M3 series of chips. In his Power On newsletter today, Bloomberg's Mark Gurman reiterated...
16 pro

iPhone 16 Pro Demand Has Been Lower Than Expected, Analyst Says

Sunday September 15, 2024 3:58 pm PDT by
Apple analyst Ming-Chi Kuo today said demand for the iPhone 16 Pro and iPhone 16 Pro Max has been "lower than expected" since the devices became available to pre-order in the U.S. and dozens of other countries on Friday. Kuo said his data is based on a "supply chain survey" and shipping estimates listed on Apple's online store. Kuo estimated that sales of all four iPhone 16 models reached...
iphone 16 pro apple intelligence

Apple Intelligence Features Expected to Roll Out in This Order Between iOS 18.1 and iOS 18.4

Friday September 13, 2024 1:01 pm PDT by
iOS 18 will be released to the public on Monday, but the first Apple Intelligence features will not be available until iOS 18.1 is released in October. Apple Intelligence features will continue to roll out in iOS 18.2 and beyond, with the expected roadmap outlined below per Apple's website and rumors. Apple Intelligence requires an iPhone 15 Pro model or any iPhone 16 model, and it will...

Top Rated Comments

klasma Avatar
8 months ago
Apple really can’t make up their minds, can they?

Attachment Image
Score: 19 Votes (Like | Disagree)
AlmightyKang Avatar
8 months ago
Apple as always do something because it has a benefit to the user, not just for the sake of the technology. This is the ML approach we need.

Most of the VC and AI industry are just grifting and trying to sell it as an application for everything and trying to create new markets for it.
Score: 13 Votes (Like | Disagree)
AAPLbuyback Avatar
8 months ago

The main imagine is kinda creepy...
Creepy is part of the allure of AI.
Score: 12 Votes (Like | Disagree)
ThatGuyInLa Avatar
8 months ago
And yet even then we won’t be able to ask Siri, “Turn on kitchen lights and set to 35% and light blue.”

“Play my subscriptions for YouTube on AppleTV.”
Score: 11 Votes (Like | Disagree)
kirky29 Avatar
8 months ago
The main imagine is kinda creepy...
Score: 10 Votes (Like | Disagree)
Jason2000 Avatar
8 months ago

The new development is that Apple can now do it too, which means that it’s exciting and innovative now.
It's exciting if Apple is bringing this to their photos app. I don't want to have to use Google for this type stuff.
Score: 9 Votes (Like | Disagree)