イメージプロンプトの手作業は終わり：PromptPerfect で Midjourney 風の画像を逆エンジニアリング

はい、あなたです！この記事を読んでいるあなた。Midjourney や他の画像生成モデルに時間を費やしすぎているプロンプトエンジニアのあなたへ、この記事はあなたのためのものです。

「まさか ~~ヒョウ~~ AI が私の顔を食べるとは思わなかった」と、~~ヒョウ~~ AI が人々の顔を食べる党に投票した女性が泣きながら言った。

💡

Adrian Bott に謝意を表します

AI が多くの仕事を奪っている中で、こうも言えるかもしれません：

最初、AI はアーティストたちを奪っていったが、私は声を上げなかった - なぜなら私はアーティストではなかったから。次に AI はプロンプトエンジニア（AI を使ってアーティストを押しのけた人たち）を奪いにきた。そして私は困ってしまった - なぜならそれは私の仕事だったから。

💡

Martin Niemöller に謝意を表します

その通り、相棒。あなたは Midjourney に「mid（平凡）」を付け加えている。あなたの Stable Diffusion は不安定な混乱のようなもの。そしてあなたの DALL-E のスキルは実際には CRAP-E レベル。PromptPerfect のようなツールがあれば、誰でも既存の画像からプロンプトを逆エンジニアリングしたり、人間がループに入った状態でリアルタイムのステップバイステップのフィードバックを得ながらプロンプトを生成したりすることができます。

では、あなたの顔を食べたがっている AI のヒョウたちから（少なくとも今のところは）先手を打って身を守れるよう、画像からプロンプトを逆エンジニアリングする方法を見ていきましょう。

💡

PromptPerfect は Midjourney スタイルの画像だけでなく、DALL-E 3 や Stable Diffusion XL、さらに多くの LLM 向けに最適化されたプロンプトを生成することもできます。

tagPromptPerfect Interactive

PromptPerfect Interactive はコンテンツ生成と複雑なタスクへの取り組み方を変革します。次の 2 つのアプローチに基づいています：

専用アシスタント：あなたのニーズを理解し、効果的なプロンプトの作成を支援する AI パートナーで、コンテンツ生成プロセスをできるだけスムーズにします。
強力なオプティマイザー：プロンプトを最適な結果に向けて微調整する高度なツールで、クリエイティブで生産的な取り組みをより効果的にします。

PromptPerfect は最近 Midjourney スタイルの画像生成を導入したので、この記事では Interactive のアシスタントとオプティマイザーを使用して、画像からプロンプトを逆エンジニアリングし、それらのプロンプトに基づいて Midjourney スタイルの新しい画像を生成します。

tag画像プロンプトを逆エンジニアリングする方法

まず、作業する画像が必要です。以前 Midjourney で生成したこのパンクなアインシュタインの画像を使ってみましょう：

アインシュタインをパンクスタイルで描いた芸術的な描写。カラフルな髪と様式的な要素が、ぼやけた鮮やかな背景に配置されている

画像を生成するために Midjourney で使用したプロンプトは以下の通りです：

Realistic photo of Albert Einstein as a punk, retaining his recognizable facial
features, with a brightly colored mohawk, visible tattoos, facial piercings,
and wearing a spiked leather jacket, highly detailed, photorealistic, vibrant
colors, dramatic lighting --ar 4:5 --s 500 --v 6

次に、その画像を PromptPerfect の Interactive Assistant にアップロードし、その画像に基づいてプロンプトを生成するよう依頼しました：

プロンプト機能を含む AI オプティマイザーの記事のスクリーンショット。アインシュタインの鮮やかな画像とスタイリングオプションが含まれている。

返されたプロンプトは：

Colorful punk hairstyle, vibrant pink, orange, and gray hair, edgy jacket with
spikes and colorful geometric patterns, urban background, bokeh lighting,
cyberpunk aesthetic --ar 4:5 --v 6 --s 500

いいですね...ただし、一つ欠けているものがあります：アインシュタイン本人です。Interactive にはチャットベースのインターフェースがあるので、単純にプロンプトに彼を追加するよう伝えました：

Alex Cureton-Griffiths による指示を含む、アインシュタインの創造的な描写を特徴とするメール下書きのスクリーンショット。

すると以下のようになりました：

Albert Einstein with a colorful punk hairstyle in vibrant pink, orange, and
gray hues, wearing an edgy jacket adorned with spikes and colorful geometric
patterns, set against an urban background with bokeh lighting, cyberpunk
aesthetic --ar 4:5 --v 6 --stylize 500

最初の画像を作成するために使用したプロンプトとまったく同じではないことにお気付きでしょう。これは当然のことです - まず第一に、AI 生成画像はプロンプトで使用された特徴以外の要素も取り入れているからです。たとえば、最初の画像では、アインシュタインは右を向いており、襟元に赤い部分がありますが、これらはプロンプトで指定していなかったので、画像からプロンプトを逆エンジニアリングしても、最初に使用したのと同じプロンプトは得られません。第二の理由は、画像分析モデル（多くの AI と同様）が非決定論的であることです - 同じ画像からプロンプトを逆エンジニアリングするよう 2 回目に依頼すると、異なる詳細を拾い上げる可能性があります。

とにかく、プロンプトが得られたので、「アシスタントに送信」ボタンをクリックして Midjourney スタイルの 4 つの画像を生成できます：

スタイライズされたアインシュタインのプロンプトと「アシスタントに送信」ボタンを備えたテキスト生成インターフェース。

ピンク、オレンジ、グレーのパンクな髪型で、スパイク付きジャケットを着たアインシュタインの鮮やかな描写。ボケた背景が特徴。

ここでも最初の画像とは一致していないことがわかります。そしてそれは決して一致することはありません。同じプロンプトを画像生成モデルに 2 回入力してみるだけで、まったく異なる結果が得られます - 画像認識モデルと同様に非決定論的だからです。

左上の画像が本当に気に入りました。それをクリックしてアップスケールを選択すると、物理学のおじさんの最終的な画像が完成します：

グレーがかった髪、口ひげ、鼻ピアス、スパイク付きジャケットを着たアインシュタインのカラフルなポートレート。鮮やかなボケ背景が特徴。

もちろん、Midjourney 本体でもプロンプトをテストでき、同様の結果が得られます：

鮮やかな衣装とさまざまな髪色のアインシュタインの 4 つの生き生きとしたポートレートのコラージュ。表現豊かな都市の背景に配置されている。

tagその他の例

以下にいくつかの例を示します。内容の順序は:

最初のプロンプト
Midjourney で生成された画像
リバースエンジニアリングされたプロンプト
PromptPerfect Interactive で生成された Midjourney スタイルの画像

tagターボ鳩

abstract, minimalist mesh wireframe of A pigeon::4 , wearing a helmet and
carrying a turbo booster on its back, with a gradient of green, cyan, and blue
lines against a black background, Vanishing point, with minimal detailing::4 ,
--ar 16:9 --s 750 --v 6.0

Futuristic bird with neon pink, blue, and red features against a black background, creating a techno-artistic ambiance.

Futuristic bird with neon lights, intricate feather details, glowing pink and
blue colors, highly detailed, digital art, ethereal and luminous, dark
background, dynamic light streaks, cybernetic effect, hyper-realistic --ar
16:9 --v 6 --stylize 750

Digital art of a mystical red and blue bird with colorful lights and sparks against a gradient background.

tag溶ける脳

melting brain, floating in space, plain black background --ar 16:9 --niji 6
--s 750

Colorful digital art of a melting brain in pink and blue, with vein-like patterns and floating bubbles against a dark backgro

Surreal, melting brain suspended in space, dripping neon pink and blue colors,
abstract, fluid textures, hyper-detailed, futuristic, digital art, cosmic
background with stars, vibrant and glowing, soft lighting --ar 16:9 --v 6
--stylize 750

Abstract depiction of a glowing brain in pink hues against a dark, starry backdrop, evoking a mystical aura.

tagボリウッド版プリンセス・レイア

Bollywood Star Wars scene, close up shot of Princess Leia Organa in traditional
Indian attire, intricate jewelry, holding a defender sporting blaster pistol,
vibrant colors, futuristic elements, sci-fi, dramatic lighting, detailed
background, cinematic, 8K resolution, Unreal Engine, --ar 4:5 --v 6.0

Woman cosplaying as Princess Leia with a blaster, styled hair in buns, in a red-tinted room with hanging lanterns.

Princess Leia, holding a blaster, futuristic sci-fi setting, white robe,
detailed hair buns, dramatic lighting, heroic pose, vibrant colors, cinematic
scene, intricate background with glowing elements --ar 4:5 --s 500 --v 6

Digital painting of a woman styled like Princess Leia in white, holding a blaster, against a colorful bokeh background.

うーん...ボリウッド要素が本当に足りないですね。これはリバースエンジニアリングの事実として、画像分析アルゴリズムが人間なら見つけるものを見落とすことがあります。少し試行錯誤（プロンプトエンジニアリングの専門用語です）した後、プロンプトを以下のように改良しました：

Princess Leia, holding a blaster, futuristic sci-fi setting, dressed in a 
white robe with intricate Indian embroidery, ethnically Indian with 
traditional Indian facial features, detailed hair buns adorned with 
traditional Indian jewelry, dramatic lighting, heroic pose, vibrant colors, 
Bollywood-inspired design, charismatic expression, cinematic scene, intricate 
background with glowing elements and traditional Indian patterns --ar 4:5 --s 
500 --v 6

これで次の画像が生成されました：

Woman in Indian attire with braided hair and jewelry holding a gun, against a luminous background.

ここで対話型オプティマイザーが真価を発揮します。私一人だったら、単純に bollywood という単語をプロンプトに追加するだけでしたが、オプティマイザーに Refine this Midjourney-style prompt to include more Bollywood vibes と指示することで、PromptPerfect はプロンプトにより多くの説明的な単語（traditional Indian patterns など）を追加しました。ウェイトやスタイルをいじるよりも、特定の結果を示唆するより多くの単語や詳細を追加する方が、通常は生成される画像に影響を与えるより良い方法です。

tagパステルメダル

a medal is sitting on a podium against pastel colored confetti, in the style
of simplified forms and shapes, yellow and beige, columns and totems, playful
streamlined forms, nerdcore, contest winner, repetition and pattern --ar 64:39
--s 750 --v 6.0

Celebratory image featuring a bronze medal with a red ribbon and laurel wreath pattern against a rich blue background.

Award medal, intricate laurel design, suspended from a ribbon, celebratory
background, vibrant confetti, glowing lights, high detail, 3D render, soft
lighting, pink and blue color scheme, festive atmosphere --ar 16:9 --s 500
--v 6 --stylize 750

Mysterious medal with silver edges, suspended amidst red particles on a deep blue bokeh background with heart and star-shaped

tag画像のリバースエンジニアリングを始めましょう

PromptPerfect を使って画像プロンプトのリバースエンジニアリングを始めるには、サインアップして有料の PromptPerfect プランを 7 日間無料でお試しください。最初のログインから 24 時間以内にプランに登録すると、40% オフになります：

これが、飢えた AI のヒョウたちの先を行く唯一の方法だとお分かりでしょう！