test
gantt
title HiDream.ai models
dateFormat YYYY-MM-DD
tickInterval 12month
axisFormat %Y
HiDream-I1-Full : 2025-04-06, 2026-05-15
HiDream-I1-Dev : 2025-04-06, 2026-05-15
HiDream-I1-Fast : 2025-04-06, 2026-05-15
HiDream-E1-Full : 2025-04-27, 2026-05-15
HiDream-E1.1 : 2025-07-16, 2026-05-15
HiDream-O1-Image : crit, 2026-05-08, 2026-05-15
flowchart TB
subgraph Checkpoint
A1(Text Encoder)
B1(Unet / Transformer)
C1(VAE)
end
flowchart TB
B1(Text Encoder)
subgraph 潜在空間
C1(Unet / Transformer)
end
D1(VAE)
A1(ユーザー)
A1--絵を描いて-->B1
B1--[1, 0, 128, 2, 4, 6, 0, 2]-->C1
C1--[4, 2, 0, 64, 8, 5, 7, 4]-->D1
D1--できました-->A1
flowchart TB
A1(ユーザー)
B1(HiDream-O1-Image)
A1--絵を描いて-->B1
B1--できました-->A1
gantt
title Image Generative AI Roadmap
dateFormat YYYY-MM-DD
tickInterval 12month
axisFormat %Y
section Stability AI
Stable Diffusion 1 : 2022-08-22, 2026-05-15
Stable Diffusion XL : 2023-07-26, 2026-05-15
Stable Diffusion 3 : 2024-06-12, 2026-05-15
section Fal.ai
AuraFlow : 2024-07-12, 2026-05-15
section Black Forest Labs
Flux.1 : 2024-08-01, 2026-05-15
Flux.2 : 2025-11-25, 2026-05-15
section DeepSeek.ai
janus-pro : 2025-01-25, 2026-05-15
section Zhipu AI
CogVideoX : 2024-08-06, 2026-05-15
GLM-Image : 2026-01-12, 2026-05-15
section Rhymes AI
Allegro : 2024-10-22, 2026-05-15
section Genmo
Mochi : 2024-10-25, 2026-05-15
section Tencent
Hunyuan video : 2024-12-03, 2026-05-15
Hunyuan image : 2025-09-09, 2026-05-15
section lllyasviel
Framepack : 2025-04-17, 2026-05-15
section Lightricks
LTX : 2024-12-11, 2026-05-15
section StepFun
Step-Video-T2V : 2025-02-17, 2026-05-15
section Alibaba
Wan : 2025-02-25, 2026-05-15
Qwen-Image : 2025-08-04, 2026-05-15
Z-Image : 2025-11-25, 2026-05-15
section NVIDIA
Cosmos-Predict2 : 2025-04-30, 2026-05-15
section CircleStone Labs
Anima : 2026-01-26, 2026-05-15
section Baidu
ERNIE-Image : 2026-04-07, 2026-05-15
section HiDream.ai
HiDream-I1 : 2025-04-06, 2026-05-15
HiDream-O1-Image : crit, 2026-05-08, 2026-05-15
| 開発元 | モデル | テキストエンコーダー | エンコーダー開発元 |
|---|---|---|---|
| CLIP 世代(〜2023) | |||
| Stability AI | Stable Diffusion 1.x | CLIP-L(0.1B) | OpenAI |
| Stable Diffusion XL | CLIP-L(0.1B) OpenCLIP-G(0.7B) |
OpenAI LAION |
|
| T5 世代(2024〜) | |||
| Stability AI | Stable Diffusion 3 | CLIP-L(0.1B) OpenCLIP-G(0.7B) T5-XXL-v1.1(11B) |
OpenAI LAION |
| Fal.ai | AuraFlow | pile-T5-XL(3B) | EleutherAI / Google |
| Black Forest Labs | Flux.1 [schnell / dev] | CLIP-L(0.1B) T5-XXL-v1.1(11B) |
OpenAI |
| DeepSeek | Janus-Pro | SigLIP-L(0.4B) DeepSeek-LLM(7B) |
Google DeepSeek |
| Zhipu AI | CogVideoX | T5-XXL(11B) | |
| GLM-Image | GLM-4-9B(9B) Glyph Encoder |
Zhipu AI | |
| Genmo | Mochi | T5-XXL-v1.1(11B) | |
| Rhymes AI | Allegro | T5-XXL(11B) | |
| Lightricks | LTX-Video | T5-XXL-v1.1(11B) | |
| NVIDIA | Cosmos-Predict2 | T5-XXL(11B) | |
| 独自 LLM 世代(2025〜) | |||
| HiDream-ai | HiDream-I1 | CLIP-L(0.1B) OpenCLIP-G(0.7B) T5-XXL-v1.1(11B) Llama-3.1-Instruct(8B) |
OpenAI LAION Meta |
| Tencent | Hunyuan Video | LLaVA-LLaMA-3(8B) CLIP-L(0.1B) |
Xtuner / Meta OpenAI |
| Hunyuan Image | 独自 MLLM | Tencent | |
| StepFun | Step-Video-T2V | Hunyuan-CLIP Step-LLM |
Tencent StepFun |
| Alibaba | Wan(2.1 / 2.2) | UMT5-XXL(13B) | |
| Qwen-Image | Qwen2.5-VL(7B) | Alibaba | |
| Z-Image | Qwen3(4B) | Alibaba | |
| Black Forest Labs | Flux.2 [dev] | Mistral-small-3.2 / Pixtral(24B) | Mistral AI |
| Flux.2 [klein] 9B | Qwen3(8B) | Alibaba | |
| Flux.2 [klein] 4B | Qwen3(4B) | Alibaba | |
| CircleStone Labs | Anima | Qwen3-Base(0.6B) | Alibaba |
| Baidu | ERNIE-Image | Mistral3 / Pixtral(3.3B) | Mistral AI |
| HiDream-ai | HiDream-O1-Image | Qwen3-VL(8B) | Alibaba |