Automatically Creating Prompts! A Workflow Combining the Best of Animagine-XL and Illustrious-XL

halloween_witch
  • Caption with Florence-2
  • Create compositions with Animagine-XL
  • Polish with Illustrious-XL

Introduction

Hello, I'm Easygoing!

In this post, I’ll walk you through a workflow that automatically generates prompts from images, leveraging the strengths of both Animagine-XL and Illustrious-XL to create stunning illustrations.

Halloween witch girl illustration generated by Animagine-XL: A purple dress and hat
Happy Halloween!!

Animagine-XL Series: Exceptional Composition!

The Animagine-XL series is a wildly popular anime model based on SDXL. Its standout feature is its innovative composition.

Nine Halloween witch variations by Animagine-XL: Diverse compositions of a girl in a purple dress, holding a basket or lantern, showcasing bold interpretations from simple prompts
Varied compositions from simple prompts
Example of flaws in Animagine-XL Halloween witch illustration: Purple dress girl with finger deformities (extra or misshapen fingers), highlighting creative composition vs. detail weaknesses
Inaccurate hand and finger rendering

The Animagine-XL series excels at interpreting simple prompts freely, producing bold and dynamic compositions.

However, compared to the newer Illustrious-XL series, it struggles with fine details, often resulting in hand and finger deformities.

Illustrious-XL Series: Precision in Details!

The Illustrious-XL series, another SDXL-based anime model, may not match Animagine-XL’s bold compositions but shines in generating clean, detailed illustrations.

Nine Halloween witch variations by brdrXL-v0.3b: Stable compositions with detailed illustrations of a purple dress girl holding a basket, showcasing Illustrious-XL’s precision
More restrained compositions compared to Animagine-XL
Strengths of brdrXL Halloween witch illustration: Accurate finger rendering in purple dress and hat, clear details from Illustrious-XL-based model output
Precise finger rendering

The brdrXL-v0.3b model, created by BD, is a custom model based on Illustrious-XL.

It stands out for its sharp resolution and high artistic expressiveness, making it particularly strong in detailed rendering.

Automatic Prompt ->Animagine-XL ->brdrXL


flowchart LR
A1(Reference Image)
subgraph Florence-2
B1(Caption<br>- Free Interpretation)
end
subgraph Animagine-XL
C1(Original Illustration<br>- Creative Composition)
end
subgraph brdrXL
D1(Redraw<br>- Detailed Rendering)
end
E1(Final Output)
A1-->B1
B1-->C1
C1-->D1
D1-->E1

Let’s explore a workflow that harnesses AI’s creativity by inputting a reference image to automatically generate prompts, then creating varied illustrations based on those prompts.

The prompts are generated by captioning images using the Florence-2 model, which excels at image recognition.

Generated Illustrations

Let’s dive into the illustrations we created.

Input Image

Input image: Simple Halloween witch girl illustration from Irasutoya, wearing a purple dress and holding a basket, used for Florence-2 captioning
Prompts were automatically generated from this illustration.

Generated Prompt

1girl, solo, smile, open mouth, black hair, dress, hat, holding, closed eyes, full body, black background, dress up, witch hat, purple dress, witch, holding basket, halloween

Original Illustration by Animagine-XL 4.0 Opt

Original Animagine-XL 4.0 Opt illustration: Dynamic composition of a Halloween witch, smiling girl in a purple dress with a basket, showcasing creative interpretation
Bold and fun composition

Redrawn by brdrXL-v0.3b

Redrawn by brdrXL-v0.3b: Halloween witch based on Animagine-XL original, with enhanced details in purple dress and accurate finger rendering
Highly detailed rendering

Color Adjustment + Upscaling

Final polished illustration: Color-adjusted (skin tone tweak) and upscaled brdrXL redraw, high-quality Halloween witch in purple dress
Color adjustment targeting skin tone

Using Florence-2 and Animagine-XL to freely interpret the input image, we created a bold composition, which was then redrawn by the brdrXL model to produce a highly polished illustration.

More Examples with the Same Prompt!

Let’s look at other illustrations generated from the same input image.

Variation 2 with the same prompt: Animagine-XL/brdrXL-generated Halloween witch, fun composition
Rich variety
Variation 3 with the same prompt: Unique AI-interpreted Halloween witch
A bacon-and-egg bowl?

Trying a Different Input Image!

Now, let’s generate illustrations using a different input image. The top image is the input, and the bottom is the generated illustration.

Woman with a Bouquet

Input image: Elegant illustration of a woman holding a bouquet, generated by HiDream, in a red dress with white flowers
Original from HiDream
Workflow output: Anime-style woman holding a bouquet, preserving the red dress and white flower vibe, generated by Animagine-XL/brdrXL
Captures the atmosphere

Photographer in a Meadow

Input image: Real photo of a female photographer in a black dress with a camera, meadow background, sourced from Pixabay
Downloaded from Pixabay
Workflow output: Anime-style conversion of the real photographer, woman with a camera in an outdoor scene, Animagine-XL/brdrXL real-to-anime example
Turning real photos into anime illustrations

Moon Rabbit

Input image: Irasutoya’s moon rabbit illustration, simple design with a white rabbit and moon on a blue background
Workflow output: Humorous moon rabbit-themed art, dynamic fantasy illustration of a rabbit and moon with AI interpretation
A fun AI interpretation!

The Actual Workflow!

Here’s a breakdown of the actual workflow.

1. Captioning Images to Create Prompts

ComfyUI flowchart: Florence-2 captioning section, image input ->tag-based prompt output

Models Used

The first part of the workflow involves captioning images to create tag-based prompts using the Florence-2 model, which is adept at image recognition.

Florence2Run node settings: prompt_gen_tags mode, max new tokens 1024, num beams 3, parameters for generating anime tags

The prompt_gen_tags mode in the Florence2Run node outputs tagged captions from input images, ideal for generating prompts for anime illustrations.

These varied prompts are then used with the Animagine-XL 4.0 Opt model to create bold compositions.

2. Full Workflow with Redrawing

The complete workflow includes redrawing with the brdrXL-v0.3b model, followed by color adjustments and AI upscaling for high resolution.

Full ComfyUI workflow: Animagine-XL generation ->brdrXL redrawing ->color adjustment/upscaling node diagram

Models Used

Custom Nodes

Here are the custom nodes used, along with their search screens in ComfyUI-Manager.

ComfyUI-Florence2

ComfyUI-Manager: Search results for Florence2 custom node

cg-use-everywhere

ComfyUI-Manager: Installation screen for cg-use-everywhere custom node

ComfyUI-SuperBeasts

ComfyUI-Manager: Search and installation screen for ComfyUI-SuperBeasts custom node

ComfyUI-easygoing-nodes

ComfyUI-Manager: Search and installation screen for ComfyUI-easygoing-nodes

Bonus: Overcoming Illustrious-XL’s “Perfection”!

Both Animagine-XL and Illustrious-XL require quality prompts and negative prompts as inputs.

The Illustrious-XL model excels at detailed rendering, but setting the CFG scale too high can reduce variety, leading to what’s called a “mask-like face”.

To counter this, we introduce the SDXL_Refiner model in the middle to disrupt the illustration and enhance variety.

Workflow diagram using three models: GhostXL ->SDXL_Refiner (disruption) ->brdrXL node connections, overcoming perfection example

Three-stage workflow

Models Used

  • GhostXL: A derivative of Animagine-XL 3.0 released in February 2024, still highly expressive artistically
  • SDXL_Refiner: Disrupts the illustration to enhance completion
  • brdrXL_v03b: Used for final polishing

GhostXL

GhostXL generation: Halloween witch art illustration, example of overly perfect facial feature placement
High artistic expressiveness but overly perfect facial feature placement

SDXL_Refiner

After SDXL_Refiner: Example of disrupted illustration, shifted eye and nose positions creating variety
Disrupting the illustration slightly alters eye and nose positions

brdrXL_v03b + Finishing

brdrXL finishing: Halloween witch regenerated post-Refiner, softer expression illustration
Slightly offset eye positions create a softer expression

By using the Refiner to disrupt and then regenerate the illustration, we aim to reduce the overly perfect look typical of AI illustrations.

About Refiner

Bonus 2: Mixing Models in Stable Diffusion webUI

While this post focuses on the ComfyUI workflow, you can also mix models to generate illustrations using default features in Stable Diffusion webUI (A1111, Forge, reForge, Forge Classic).

Example with Stable Diffusion webUI Forge

Stable Diffusion webUI Forge settings: Base/Refiner switch (0.8 timing), step count adjustment for mixing arbitrary models

Stable Diffusion webUI includes a feature to redraw with a Refiner model after the Base model, but you can also use any arbitrary model instead of the Refiner.

Set the total step count 20–30% higher than usual and the switch timing to around 0.7–0.8 for optimal results. Since the same prompt is used for both models, align the quality and negative prompts with the finishing model.

Conclusion: Combining the Best of Animagine-XL and Illustrious-XL!

  • Caption with Florence-2
  • Create compositions with Animagine-XL
  • Polish with Illustrious-XL

In this workflow, we combined Animagine-XL and Illustrious-XL to great effect. While Illustrious-XL excels in detailed rendering, I initially found its glossy skin and clothing textures less appealing.

However, discovering the brdrXL model with its matte, expressive quality has been a game-changer, and I’m excited to use it more.

Summary illustration: Halloween witch generated using three models
GhostXL is a stellar derivative of Animagine-XL

Animagine-XL and its derivatives produce unexpected compositions from simple prompts, showcasing AI’s creativity and remaining timeless models.

I’ll continue exploring unique expressions through model combinations.

Thank you for reading!


Reference Articles

Color Adjustment Settings

For settings of the HDR Effects with LAB Adjust node for color adjustments, see here.

ComfyUI Custom Nodes

An introduction to standard ComfyUI custom nodes, including those used in this workflow.