当前位置：首页 >熱點 >【】

【】

2025-04-14 21:14:43 [綜合] 来源：有聲有色網

Apple is dabbling in AI image-editing with an open-source multimodal AI model.

Earlier this week, researchers from Apple and the University of California, Santa Barbara released MLLM-Guided Image Editing, or "MGIE;" a multimodal AI model that can edit images like Photoshop, based on simple text commands.

On the AI development front, Apple has been characteristically cautious about its plans. It was also one of the few companies that didn't announce any big AI plans in the wake of last year's ChatGPT hype. However, Apple reportedly has an in-house version of a ChatGPT-esque chatbot dubbed "Apple GPT" and Tim Cook said Apple will be making some major AI announcements later this year.

SEE ALSO:Tim Cook says big Apple AI announcement is coming later this year

Whether this announcement includes an AI image editing tool remains to be seen, but based on this model, Apple is definitely doing some research and development.

Mashable Light SpeedWant more out-of-this world tech, space and science stories?Sign up for Mashable's weekly Light Speed newsletter.By signing up you agree to our Terms of Use and Privacy Policy.Thanks for signing up!

While there are already AI image editing tools out there, "human instructions are sometimes too brief for current methods to capture and follow," said the research paper. This often leads to lackluster or failed results. MGIE is a different approach that uses MLLMs, or multimodal large language models, to understand the text prompts or "expressive instruction," as well as image training data. Effectively, learning from MLLMs helps MGIE understand natural language commands without the need for heavy description.

In examples from the research, MGIE can take an input image of a pepperoni pizza and using the prompt, "make this more healthy" infer that "this" is referring to the pepperoni pizza and "more healthy" can be interpreted as adding vegetables. Thus, the output image is a pepperoni pizza with some green vegetables scattered on top.

Related Stories

Apple Vision Pro teardown: What's inside the $3,500 headset
Apple is working on a foldable clamshell iPhone, report says
Apple Car may be coming much, much later than we hoped

In another example comparing MGIE to other models, the input image is a forested shoreline and a tranquil body of water. With the prompt "add lightning and make the water reflect the lightning," other models omit the lightning reflection, but MGIE successfully captures it.

MGIE is available as an open-source model on GitHub and as a demo version hosted on Hugging Face.

TopicsAppleArtificial Intelligence

(责任编辑：熱點)

相关内容

推荐文章

Two astronauts just installed a new parking spot on the International Space Station
UPDATE: Aug. 19, 2016, 2:04 p.m. EDT 。 Astronauts Kate Rubins and Jeff Williams are back in the Inter ...[详细]
國米首發：盧卡庫領銜鋒線勞塔羅埃裏克森出戰
國米首發：盧卡庫領銜鋒線勞塔羅埃裏克森出戰_博洛尼亞www.ty42.com 日期:2021-04-04 02:31:00| 評論(已有266921條評論) ...[详细]
記者：海港很難找到邁斯托羅維奇替代者申花鋒線是隱患
記者：海港很難找到邁斯托羅維奇替代者申花鋒線是隱患_外援www.ty42.com 日期:2021-04-05 23:01:00| 評論(已有267398條評論) ...[详细]
滬媒：國足熱身賽8連勝靠實力全力拚就配得上掌聲
滬媒：國足熱身賽8連勝靠實力全力拚就配得上掌聲_中國國家隊www.ty42.com 日期:2021-04-05 08:31:00| 評論(已有267211條評論) ...[详细]
Fake news reports from the Newseum are infinitely better than actual news
Actual investigative journalism: who needs it?At least, that's what some people will likely conclude ...[详细]
國米首發：盧卡庫領銜鋒線勞塔羅埃裏克森出戰
國米首發：盧卡庫領銜鋒線勞塔羅埃裏克森出戰_博洛尼亞www.ty42.com 日期:2021-04-04 02:31:00| 評論(已有266921條評論) ...[详细]
陳戌源保護國腳論被批幹涉俱樂部事務:下指令隻會讓人反感
陳戌源保護國腳論被批幹涉俱樂部事務:下指令隻會讓人反感_國足www.ty42.com 日期:2021-04-04 10:31:00| 評論(已有266980條評論) ...[详细]
火爆！法甲榜首大戰巴黎輸球又輸人內馬爾再染紅
火爆！法甲榜首大戰巴黎輸球又輸人內馬爾再染紅_比賽www.ty42.com 日期:2021-04-04 02:31:00| 評論(已有266919條評論) ...[详细]
Slack goes down again, prompting anxiety everywhere
Panic briefly took over on Tuesday when everyone's favorite messaging app/millstone went down tempor ...[详细]
名記：C羅為了進球太自私領袖作用不及盧卡庫伊布
名記：C羅為了進球太自私領袖作用不及盧卡庫伊布_全尤文www.ty42.com 日期:2021-04-06 08:31:00| 評論(已有267445條評論) ...[详细]

热点阅读

随机内容

友情链接

接受PR>=1、BR>=1，流量相当，内容相关类链接。

樓上骨討論區