Parameter number
63M as NCSN, yang,
et.al
.
Iteration
100k
GPU number,training time
2day8GPU for EDM
LR
Optimizer
batch size
from EDM:
from U-VIT,
https://arxiv.org/pdf/2209.12152.pdf
: