Anima Model’s Dual Licensing! Open Source Technology from Alibaba and NVIDIA’s Ambition

An anime-style illustration of a purple-haired woman standing in front of a computer screen displaying code, set within a futuristic, neon-lit data center.
  • Anima is an image generation AI whose roots trace back to Wan2.1.
  • It is subject to NVIDIA’s strict licensing restrictions.
  • It is safest to avoid commercial use for now.

Introduction

Hello, this is Easygoing.

This time, I’d like to explore the structure and licensing of the new anime-specialized model called Anima.

An anime illustration of a purple-haired woman wearing glasses and a lab coat standing inside a golden futuristic data center
Today’s topic is the Anima model.

What is the Anima Model?

The Anima model is a text-to-image AI model specialized in anime-style illustrations, released by CircleStone Labs.

gantt
    title Image Generative AI Roadmap
    dateFormat YYYY-MM-DD
    tickInterval 12month
    axisFormat %Y

    section Black Forest Labs
        Flux.1 : done, 2024-08-01, 2026-05-01

    section Alibaba
        Wan2.1 : 2025-02-25, 2026-05-01
        Wan2.2 : 2025-08-07, 2026-05-01
        Qwen-Image : 2025-08-04, 2026-05-01

    section NVIDIA
        Cosmos-Predict1 : done, 2025-01-06, 2026-05-01
        Cosmos-Predict2 : 2025-04-30, 2026-05-01
        Cosmos-Predict2.5 : 2025-10-06, 2026-05-01

    section CircleStone Labs
        Anima Preview 1 : 2026-01-26, 2026-05-01
        Anima Preview 2 : 2026-03-11, 2026-05-01
        Anima Preview 3-base : 2026-04-08, 2026-05-01

The Anima model was first released as Preview1 on January 26, 2026, and as of May 2026, it has progressed to Preview3-base.

The Roots of Anima: Alibaba’s Wan

The Anima model is composed of the following components:

Anima Developer Model License
Text Encoder Alibaba Qwen3-0.6B-base Apache-2.0
Transformer CircleStone Labs Cosmos-Predict2-2B
(Fine-tuned on anime illustrations)
NVIDIA Open Model License
CircleStone Labs Non-Commercial License
VAE Alibaba Qwen-Image_vae
(Wan_vae)
Apache-2.0
  • Text Encoder: Qwen3 lightweight version (Alibaba)
  • Transformer: Cosmos-Predict2-2B (NVIDIA)
  • VAE: Qwen-Image_vae (Alibaba)

The Anima model uses Alibaba’s lightweight Qwen3 as its text encoder, NVIDIA’s Cosmos-Predict2-2B as its transformer, and Alibaba’s Qwen-Image_vae as its VAE.

Anima Uses a Dual Licensing Structure

While the text encoder and VAE are released under the open-source Apache-2.0 license, the transformer part adopts a dual licensing structure that combines NVIDIA’s and CircleStone Labs’ respective proprietary licenses.

An anime illustration of a purple-haired woman in a lab coat pointing at machinery inside a golden futuristic data center
NVIDIA and CircleStone Labs Dual Licensing

To make the licensing of the Anima model easier to understand, let’s organize the path that led to its creation.

Organizing the Timeline by Focusing on the VAE

When looking at the origins of image generation AI models, focusing on the VAE (Variational Auto Encoder) makes things much clearer.

The VAE is a specialized model that generates the latent space for efficient AI computation. If you change the VAE, you generally need to restart training from scratch.

Stable Diffusion and Flux as the Foundation

Open-source image generation AI began with Stability AI’s Stable Diffusion.

gantt
    title VAE Roadmap
    dateFormat YYYY-MM-DD
    tickInterval 12month
	axisFormat %Y

    section Stability AI
        Stable Diffusion 1 : done, 2022-08-22, 2026-04-18
        Stable Diffusion XL 0.9 : done, 2023-06-22, 2026-04-18
        
    section Black Forest Labs
        Flux.1 : 2024-08-01, 2026-04-18
        Flux.2 : 2025-11-25, 2026-04-18
		

Later, former Stability AI staff founded Black Forest Labs and developed Flux.1.

The Flux.1_vae not only dramatically improved performance compared to previous versions but was also released under the Apache-2.0 license, allowing free development and commercial use.

As a result, it has been adopted by many image generation AI models.

VAE Performance Comparison

Image Generation AI Model License Overview

Alibaba Develops Wan_vae

China’s Alibaba then developed Wan_vae, which added temporal understanding based on Flux.1_vae. In February 2025, they released the video generation model Wan-2.1 that incorporated this VAE.

gantt
    title Image Generative AI Roadmap
    dateFormat YYYY-MM-DD
    tickInterval 12month
    axisFormat %Y

    section Black Forest Labs
        Flux.1 : done, 2024-08-01, 2026-05-01

    section Alibaba
        Wan2.1 : 2025-02-25, 2026-05-01
        Wan2.2 : 2025-08-07, 2026-05-01
        Qwen-Image : 2025-08-04, 2026-05-01

    section NVIDIA
        Cosmos-Predict1 : done, 2025-01-06, 2026-05-01
        Cosmos-Predict2 : 2025-04-30, 2026-05-01
        Cosmos-Predict2.5 : 2025-10-06, 2026-05-01

    section CircleStone Labs
        Anima Preview 1 : 2026-01-26, 2026-05-01
        Anima Preview 2 : 2026-03-11, 2026-05-01
        Anima Preview 3-base : 2026-04-08, 2026-05-01

Because Wan-2.1 was released under the Apache-2.0 license, which allows free development and commercial use, it became the most widely adopted model for local video generation AI, along with its successor Wan-2.2.

Furthermore, Alibaba adjusted the Wan_vae structure for still images, creating Qwen-Image_vae, which is also widely used in their image generation model Qwen-Image.

An anime illustration of a purple-haired woman in a lab coat looking toward the viewer inside a golden futuristic data center
Wan_vae and Qwen-Image_vae share the same structure.

NVIDIA’s Cosmos-Predict2 Is Based on Wan

Now let’s turn our attention to NVIDIA.

NVIDIA originally had a physics simulation AI model called Cosmos-Predict1, but it was not specialized for image or video generation and significantly lagged behind competitors in both performance and efficiency.

gantt
    title Image Generative AI Roadmap
    dateFormat YYYY-MM-DD
    tickInterval 12month
    axisFormat %Y

    section Black Forest Labs
        Flux.1 : done, 2024-08-01, 2026-05-01

    section Alibaba
        Wan2.1 : 2025-02-25, 2026-05-01
        Wan2.2 : 2025-08-07, 2026-05-01
        Qwen-Image : 2025-08-04, 2026-05-01

    section NVIDIA
        Cosmos-Predict1 : done, 2025-01-06, 2026-05-01
        Cosmos-Predict2 : 2025-04-30, 2026-05-01
        Cosmos-Predict2.5 : 2025-10-06, 2026-05-01

    section CircleStone Labs
        Anima Preview 1 : 2026-01-26, 2026-05-01
        Anima Preview 2 : 2026-03-11, 2026-05-01
        Anima Preview 3-base : 2026-04-08, 2026-05-01

Cosmos-Predict1 Model Composition

Cosmos-Predict1 Developer Model License
Text Encoder Google T5-XXL-v1.0 Apache-2.0
Transformer NVIDIA Cosmos-Predict1 NVIDIA Open Model License
VAE NVIDIA Cosmos-Tokenize1 NVIDIA Open Model License

Cosmos-Predict2 Model Composition

Cosmos-Predict2 Developer Model License
Text Encoder Google T5-XXL-v1.0 Apache-2.0
Transformer NVIDIA Cosmos-Predict2 NVIDIA Open Model License
VAE Alibaba Wan_vae Apache-2.0

In April 2025, NVIDIA developed Cosmos-Predict2 by adopting Wan_vae and retraining the model from scratch. In essence, it can be described as the NVIDIA version of Wan.

Of course, it wasn’t built entirely from scratch—some datasets and parts of the architecture were likely carried over from Cosmos-Predict1.

Cosmos-Predict2 Comes with Strict Licensing

While Alibaba’s Wan uses the permissive Apache-2.0 license, Cosmos-Predict2 switched to NVIDIA’s proprietary NVIDIA Open Model License.

Key Features of the NVIDIA Open Model License:

  • NVIDIA does not claim rights to the outputs.
  • Developers of derivative models can claim rights to their own additions, provided they acknowledge NVIDIA’s ownership of the base components.
  • If a third party makes a claim against NVIDIA related to a derivative model, the derivative model developer must bear the costs.
  • If the NVIDIA Open Model License is revised, the new license applies retroactively to existing models.

The license explicitly states that NVIDIA retains ownership of the parts it developed, even in derivative models.

If legal claims arise due to a derivative model, the developer is obligated to cover NVIDIA’s expenses in NVIDIA’s place.

An anime illustration of a purple-haired woman in a lab coat with a wry smile inside a golden futuristic data center
Derivative model developers become NVIDIA’s shield.

Furthermore, even existing models must comply with any future revisions to the license. Failure to do so may require the model to be taken down.

Cosmos-Predict2 can be described as a very strict license (essentially terms of use) that forces derivative developers to act as NVIDIA’s shield.

Anima Was Fine-Tuned on Millions of Anime Illustrations

Let’s look again at Anima’s structure.

Anima Developer Model License
Text Encoder Alibaba Qwen3-0.6B-base Apache-2.0
Transformer CircleStone Labs Cosmos-Predict2-2B
(Fine-tuned on anime illustrations)
NVIDIA Open Model License
CircleStone Labs Non-Commercial License
VAE Alibaba Qwen-Image_vae
(Wan_vae)
Apache-2.0

The Anima model replaces the text encoder in NVIDIA’s Cosmos-Predict2-2B with the lightweight Qwen3-0.6B-Base.

The transformer uses the Cosmos-Predict2-2B model as-is but has been further fine-tuned on millions of anime illustrations.

The VAE has been changed to Qwen-Image_vae, the still-image version of Wan_vae. Since Qwen-Image_vae maintains compatibility with Wan_vae, structural consistency with Cosmos-Predict2-4B is preserved.

Anima Uses a Dual License from NVIDIA and CircleStone Labs

The text encoder and VAE of the Anima model are under Alibaba’s Apache-2.0 license.

However, the transformer part inherits the NVIDIA Open Model License and adds CircleStone Labs’ own restrictions under the CircleStone Labs Non-Commercial License v1.0.

CircleStone Labs Non-Commercial License v1.0:

  • Commercial use of the model itself is prohibited.
  • Commercial use of the outputs is allowed.
  • Using the outputs to train other models is prohibited.

This license is similar to Black Forest Labs’ FLUX [dev] Non-Commercial License. It was likely modeled after it.

Both prohibit commercial use of the model while allowing commercial use of the generated outputs. However, even with FLUX [dev], opinions are divided on whether outputs can truly be used commercially (many interpret it negatively).

Therefore, at this point, it is safest to avoid commercial use of the Anima model.

Why Choose Cosmos-Predict2 Instead of Wan?

Here’s where my questions deepen.

As explained earlier, Anima is subject to dual licensing restrictions. If CircleStone Labs had chosen Alibaba’s Wan model—which offers comparable performance—instead of NVIDIA’s Cosmos-Predict2, they would not have been bound by NVIDIA’s restrictive license.

An anime illustration of a purple-haired woman in a lab coat shown in profile inside a golden futuristic data center
Why didn’t they choose Wan, which has similar performance?

Why would CircleStone Labs choose Cosmos-Predict2 as the base model, even if it meant taking on the obligation to indemnify NVIDIA against damages and accepting the risk of the model being taken down?

Anime Illustration Has a Strong Local/Offline Orientation

Anime-style illustrations are popular not only in East Asia but around the world.

Among image generation AI fields, anime has particularly high demand for NSFW content, making it a domain that strongly prefers local/offline usage rather than online services.

Because of its unique characteristics, active technical development continues, resulting in high-performance community models such as Illustrious-XL and Animagine-XL that far exceed what one would expect from the original base models.

ComfyUI Dominates Next-Generation Models

Currently, the most widely used tools for anime illustration are derivative models based on Stable Diffusion XL, with ownership held by Stability AI and the respective derivative developers.

An anime illustration of a purple-haired woman in a lab coat looking at monitors inside a golden futuristic data center
SDXL derivatives belongs to Stability AI and the derivative developers.

In contrast, for next-generation models such as Flux and Z-Image, ComfyUI has established a near-monopoly as the go-to generation tool.

Why ComfyUI is Dominant

If NVIDIA can popularize an anime model under its license to replace SDXL through ComfyUI, it would allow NVIDIA to exert overwhelming dominance even in the local development environment, which is normally difficult to control.

Anima Is Strongly Backed by Comfy Org

The development of the Anima model has received strong support from Comfy Org.

In January 2026, Comfy Org announced that it provided $1 million in funding to CircleStone Labs as part of its open-source model development support program.

Around the same time, on January 24, 2026, Comfy Org announced a 30% price reduction for Comfy Cloud, citing improved access to GPU compute power at lower costs.

An anime illustration of a purple-haired woman in a lab coat turning around to look at the viewer inside a golden futuristic data center
NVIDIA aims to dominate not only the cloud but also local environments!

For cloud AI service providers including Comfy Cloud, GPU rental costs directly impact their business. The allocation of GPU resources naturally involves the influence of NVIDIA, which supplies over 90% of industrial-grade GPUs worldwide.

Although there is no official document linking the Anima model directly to NVIDIA, I suspect that the decision to use Cosmos-Predict2 was not made by CircleStone Labs alone, but was likely predetermined from the beginning.

Summary: Open Source and NVIDIA’s Ambition

  • Anima is an image generation AI whose roots trace back to Wan2.1.
  • It is subject to NVIDIA’s strict licensing restrictions.
  • It is safest to avoid commercial use for now.

In this article, I examined the licensing of the Anima model.

When the excellent open-source video generation AI called Wan was released, NVIDIA quickly abandoned its previous assets, adopted the new technology, leveraged its overwhelming computing power to rapidly create a model of equal or greater performance, and then deployed it along with restrictive licensing to dominate the ecosystem. This move truly demonstrates the dignity of a champion.

An anime illustration of a purple-haired woman in a lab coat looking directly at the viewer inside a golden futuristic data center
Will excellent models spread even under strict licensing?

For now, I have no plans to develop using the high-risk Anima model. However, given the immense power of NVIDIA and Comfy Org, I may not be able to say the same in six months.

I will continue to observe closely to understand what “open source” truly means to NVIDIA and Comfy Org.

Thank you for reading until the end!