Automatically Creating Prompts! A Workflow Combining the Best of Animagine-XL and Illustrious-XL

2025-10-102025-12-22

Caption with Florence-2
Create compositions with Animagine-XL
Polish with Illustrious-XL

Introduction

Hello, I'm Easygoing!

In this post, I’ll walk you through a workflow that automatically generates prompts from images, leveraging the strengths of both Animagine-XL and Illustrious-XL to create stunning illustrations.

Halloween witch girl illustration generated by Animagine-XL: A purple dress and hat — Happy Halloween!!

Animagine-XL Series: Exceptional Composition!

The Animagine-XL series is a wildly popular anime model based on SDXL. Its standout feature is its innovative composition.

Animagine XL 4.0 - v4 Opt | Stable Diffusion XL Checkpoint | Civitai

Nine Halloween witch variations by Animagine-XL: Diverse compositions of a girl in a purple dress, holding a basket or lantern, showcasing bold interpretations from simple prompts — Varied compositions from simple prompts

Example of flaws in Animagine-XL Halloween witch illustration: Purple dress girl with finger deformities (extra or misshapen fingers), highlighting creative composition vs. detail weaknesses — Inaccurate hand and finger rendering

The Animagine-XL series excels at interpreting simple prompts freely, producing bold and dynamic compositions.

However, compared to the newer Illustrious-XL series, it struggles with fine details, often resulting in hand and finger deformities.

Illustrious-XL Series: Precision in Details!

The Illustrious-XL series, another SDXL-based anime model, may not match Animagine-XL’s bold compositions but shines in generating clean, detailed illustrations.

BrDr XL - V03b | Illustrious Checkpoint | Civitai🌟 BrDr XL - Illustration Model 🌟 Hello, I'm BD. This is a mycivitai.com

Nine Halloween witch variations by brdrXL-v0.3b: Stable compositions with detailed illustrations of a purple dress girl holding a basket, showcasing Illustrious-XL’s precision — More restrained compositions compared to Animagine-XL

Strengths of brdrXL Halloween witch illustration: Accurate finger rendering in purple dress and hat, clear details from Illustrious-XL-based model output — Precise finger rendering

The brdrXL-v0.3b model, created by BD, is a custom model based on Illustrious-XL.

It stands out for its sharp resolution and high artistic expressiveness, making it particularly strong in detailed rendering.

Automatic Prompt ->Animagine-XL ->brdrXL


flowchart LR
A1(Reference Image)
subgraph Florence-2
B1(Caption<br>- Free Interpretation)
end
subgraph Animagine-XL
C1(Original Illustration<br>- Creative Composition)
end
subgraph brdrXL
D1(Redraw<br>- Detailed Rendering)
end
E1(Final Output)
A1-->B1
B1-->C1
C1-->D1
D1-->E1

Let’s explore a workflow that harnesses AI’s creativity by inputting a reference image to automatically generate prompts, then creating varied illustrations based on those prompts.

The prompts are generated by captioning images using the Florence-2 model, which excels at image recognition.

Generated Illustrations

Let’s dive into the illustrations we created.

Input Image

Generated Prompt

1girl, solo, smile, open mouth, black hair, dress, hat, holding, closed eyes, full body, black background, dress up, witch hat, purple dress, witch, holding basket, halloween

Original Illustration by Animagine-XL 4.0 Opt

Original Animagine-XL 4.0 Opt illustration: Dynamic composition of a Halloween witch, smiling girl in a purple dress with a basket, showcasing creative interpretation — Bold and fun composition

Redrawn by brdrXL-v0.3b

Color Adjustment + Upscaling

Final polished illustration: Color-adjusted (skin tone tweak) and upscaled brdrXL redraw, high-quality Halloween witch in purple dress — Color adjustment targeting skin tone

Using Florence-2 and Animagine-XL to freely interpret the input image, we created a bold composition, which was then redrawn by the brdrXL model to produce a highly polished illustration.

More Examples with the Same Prompt!

Let’s look at other illustrations generated from the same input image.

Variation 2 with the same prompt: Animagine-XL/brdrXL-generated Halloween witch, fun composition — Rich variety

Variation 3 with the same prompt: Unique AI-interpreted Halloween witch — A bacon-and-egg bowl?

Trying a Different Input Image!

Now, let’s generate illustrations using a different input image. The top image is the input, and the bottom is the generated illustration.

Woman with a Bouquet

Input image: Elegant illustration of a woman holding a bouquet, generated by HiDream, in a red dress with white flowers — Original from HiDream

Workflow output: Anime-style woman holding a bouquet, preserving the red dress and white flower vibe, generated by Animagine-XL/brdrXL — Captures the atmosphere

Photographer in a Meadow

Input image: Real photo of a female photographer in a black dress with a camera, meadow background, sourced from Pixabay — Downloaded from Pixabay

Workflow output: Anime-style conversion of the real photographer, woman with a camera in an outdoor scene, Animagine-XL/brdrXL real-to-anime example — Turning real photos into anime illustrations

Moon Rabbit

The Actual Workflow!

Here’s a breakdown of the actual workflow.

1. Captioning Images to Create Prompts

ComfyUI flowchart: Florence-2 captioning section, image input ->tag-based prompt output

Florence2_Caption_20251010.json

Models Used

Animagine-XL 4.0 Opt

The first part of the workflow involves captioning images to create tag-based prompts using the Florence-2 model, which is adept at image recognition.

Which Is the Best Auto-Prompt Model? A Thorough Comparison of TIPO, Cliption, and Florence-2! | AI Image Journey

Florence2Run node settings: prompt_gen_tags mode, max new tokens 1024, num beams 3, parameters for generating anime tags

The prompt_gen_tags mode in the Florence2Run node outputs tagged captions from input images, ideal for generating prompts for anime illustrations.

These varied prompts are then used with the Animagine-XL 4.0 Opt model to create bold compositions.

2. Full Workflow with Redrawing

The complete workflow includes redrawing with the brdrXL-v0.3b model, followed by color adjustments and AI upscaling for high resolution.

Full ComfyUI workflow: Animagine-XL generation ->brdrXL redrawing ->color adjustment/upscaling node diagram

Animagine-XL-4.0-Opt_brdrXL-v03b_20251010.json

Models Used

Custom Nodes

Here are the custom nodes used, along with their search screens in ComfyUI-Manager.

Bonus: Overcoming Illustrious-XL’s “Perfection”!

Both Animagine-XL and Illustrious-XL require quality prompts and negative prompts as inputs.

Is Negative Prompt Necessary? Unleashing AI’s Creativity! | AI Image Journey

The Illustrious-XL model excels at detailed rendering, but setting the CFG scale too high can reduce variety, leading to what’s called a “mask-like face”.

To counter this, we introduce the SDXL_Refiner model in the middle to disrupt the illustration and enhance variety.

Workflow diagram using three models: GhostXL ->SDXL_Refiner (disruption) ->brdrXL node connections, overcoming perfection example — Three-stage workflow

GhostXL_Refiner_brdrXL_20251010.json

Models Used

GhostXL: A derivative of Animagine-XL 3.0 released in February 2024, still highly expressive artistically
SDXL_Refiner: Disrupts the illustration to enhance completion
brdrXL_v03b: Used for final polishing

GhostXL

SDXL_Refiner

brdrXL_v03b + Finishing

brdrXL finishing: Halloween witch regenerated post-Refiner, softer expression illustration — Slightly offset eye positions create a softer expression

By using the Refiner to disrupt and then regenerate the illustration, we aim to reduce the overly perfect look typical of AI illustrations.

About Refiner

The True Value of the Refiner! What Emerges from Destructive Creation | AI Image Journey

Bonus 2: Mixing Models in Stable Diffusion webUI

While this post focuses on the ComfyUI workflow, you can also mix models to generate illustrations using default features in Stable Diffusion webUI (A1111, Forge, reForge, Forge Classic).

Example with Stable Diffusion webUI Forge

Stable Diffusion webUI Forge settings: Base/Refiner switch (0.8 timing), step count adjustment for mixing arbitrary models

Stable Diffusion webUI includes a feature to redraw with a Refiner model after the Base model, but you can also use any arbitrary model instead of the Refiner.

Set the total step count 20–30% higher than usual and the switch timing to around 0.7–0.8 for optimal results. Since the same prompt is used for both models, align the quality and negative prompts with the finishing model.

Conclusion: Combining the Best of Animagine-XL and Illustrious-XL!

Caption with Florence-2
Create compositions with Animagine-XL
Polish with Illustrious-XL

In this workflow, we combined Animagine-XL and Illustrious-XL to great effect. While Illustrious-XL excels in detailed rendering, I initially found its glossy skin and clothing textures less appealing.

However, discovering the brdrXL model with its matte, expressive quality has been a game-changer, and I’m excited to use it more.