Parameter number 63M as NCSN, yang, et.al.
Iteration 100k
GPU number,training time 2day8GPU for EDM
LR
Optimizer
batch size

from EDM:

Untitled

from U-VIT, https://arxiv.org/pdf/2209.12152.pdf:

Untitled

Untitled