diff --git a/_images/img_001.png b/_images/img_001.png
new file mode 100755
index 00000000..c6135ca4
Binary files /dev/null and b/_images/img_001.png differ
diff --git a/_sources/docs/review/BBDM.md b/_sources/docs/review/BBDM.md
index 373fd736..ff31572a 100755
--- a/_sources/docs/review/BBDM.md
+++ b/_sources/docs/review/BBDM.md
@@ -31,16 +31,17 @@
- **Brownian Motion Process (Wiener Process) 소개**
- **Brownian Motion**
- 유체의 미소입자가 불규칙하게 운동하는 현상
-
- :::{figure-md}
+
+ :::{figure-md}
+
굴뚝에서 퍼져나간 연기 사진을 오른쪽으로 90도 회전시킨 사진
:::
- **Brownian Motion Process (Wiener Process)**
- Brownian Motion 을 연속 시간 확률 과정으로 모델링한 것
- :::{figure-md}
+ :::{figure-md}
$W_0$ = 0 이고 max time T=1000 인 Wiener Process 를 100번 Sampling 한 결과
@@ -105,12 +106,11 @@
Source : [https://www.youtube.com/watch?v=ld0rxwAJpkM&ab_channel=finRGB](https://www.youtube.com/watch?v=ld0rxwAJpkM&ab_channel=finRGB)
:::
- 파란색 점들은, Brownian Motion Process 를 진행한 특정한 경우
- (one representation) 를 나타냄
+ 파란색 점들은, Brownian Motion Process 를 진행한 특정한 경우 (one representation) 를 나타냄
보라색 점처럼, W_T 는 확률에 의해 여러 경우의 수가 존재할 수 있음
- :::{figure-md}
-
+ :::{figure-md}
+
Source : [https://www.youtube.com/watch?v=ld0rxwAJpkM&ab_channel=finRGB](https://www.youtube.com/watch?v=ld0rxwAJpkM&ab_channel=finRGB)
:::
@@ -127,11 +127,11 @@
0
+ :::{figure-md}
+
- Source : [https://sine-qua-none.tistory.com/158](https://sine-qua-none.tistory.com/158)
- :::
+ Source : [https://sine-qua-none.tistory.com/158](https://sine-qua-none.tistory.com/158)
+ :::
- **가장 간단한 Bridge 는, 선형으로 연결된 Bridge 일 것**
- 위의 Bridge 는 다음과 같이 표현할 수 있다.
@@ -159,11 +159,11 @@
- Linear Bridge 의 우변에 $W(t)−{t\over T}W(T)$ 를 더해보자.
$W$ 는 $Z$ 와 독립인 새로운 Wiener Process 이다.
- :::{figure-md}
-
+ :::{figure-md}
+
- Source : [https://sine-qua-none.tistory.com/158](https://sine-qua-none.tistory.com/158)
- :::
+ Source : [https://sine-qua-none.tistory.com/158](https://sine-qua-none.tistory.com/158)
+ :::
- 위 식에는
t = 0 을 대입해도 0 이 나오고,
@@ -195,6 +195,7 @@
:::{figure-md}
+
:::
따라서,
@@ -204,7 +205,7 @@
이므로, $B(t)$ 는 Wiener Process 이다.
:::{figure-md}
-
+
$W_0$ = 0 에서 $W_1000$ = 123 까지 100개의 Brownian Bridge 를 샘플링한 결과
:::
@@ -215,11 +216,11 @@ $W_0 ≠ 0$ 인 **두 점 사이의 Brownian Bridge 를 만들 때는?**
$B(t) = Z(T_0) + {(t - T_0)\over (T - T_0)}(Z(T)-Z(T_0)) + W(t-T_0) - {(t - T_0)\over (T - T_0)}W(T - T_0)$
- 아래 그림 참고
- :::{figure-md}
-
+:::{figure-md}
+
- Source : [https://sine-qua-none.tistory.com/158](https://sine-qua-none.tistory.com/158)
- :::
+Source : [https://sine-qua-none.tistory.com/158](https://sine-qua-none.tistory.com/158)
+:::
- **Abstrcat**
@@ -235,7 +236,7 @@ $W_0 ≠ 0$ 인 **두 점 사이의 Brownian Bridge 를 만들 때는?**
- **BBDM** 은 Conditional generation process 가 아닌
**Stochastic Brownian Bridge Process** 로 두 도메인 사이의 변환을 모델링하므로, **Bidirectional Diffusion Process** 임.
- Brownian Bridge diffusion process 를 Image-to-Image 변환에 접목한 최초의 논문임
- - BBDM 모델의 훌륭한 성능을 실험적으로 증명함
+ - BBDM 모델의 훌륭한 성능을 실험적으로 증명함
1. **Introduction**
- I2I 변환에서 **Non-diffusion models 의 한계**
- Pix2Pix 와 같은 **conditional GANs** 는 **fideltiy 가 높았으나,**
@@ -255,28 +256,31 @@ $W_0 ≠ 0$ 인 **두 점 사이의 Brownian Bridge 를 만들 때는?**
- **LDM** 의 경우, **복잡한 attention mechanism 으로 multi-modal condition** 이 주어지므로, **이론적 근거를 제시하기가 더 힘듦**
- **본 논문에서 제안하는 BBDM 모델**
- :::{figure-md}
+ :::{figure-md}
+
:::
- **BBDM** 모델은 **input 과 output 도메인 간의 mapping** 을
**Brownian Bridge stochastic process 를 통해 구축**함
- - 가속을 위해 Latent space 에서 diffusion process 를 수행함
+ - 가속을 위해 Latent space 에서 diffusion process 를 수행함
1. **Related Work**
- **2.1. Image-to-Image Translation**
- introduction 참고
- **2,2. Duffusion Models**
- **Diffusion Models** 의 simplified **objective** 는 다음과 같음
- :::{figure-md}
-
- :::
+ :::{figure-md}
+
+
+ :::
- 대부분의 **conditional Diffusion Models** 는 **condition 을 objective 에 직접 “주입”**
- :::{figure-md}
-
- :::
+ :::{figure-md}
+
+
+ :::
- $p(x_t|y)$ 가 objective 에 드러나 있지 않으므로,
**desired conditional distribution 에 도달할 수 있을 것**이라는 **이론적 보장이 없음**
@@ -287,15 +291,15 @@ $W_0 ≠ 0$ 인 **두 점 사이의 Brownian Bridge 를 만들 때는?**
:::{figure-md}
+
:::
- $T_0 ≤ t ≤ T$ 일 때,
$B(t) = Z(T_0) + {(t - T_0)\over (T - T_0)}(Z(T)-Z(T_0)) + W(t-T_0) - {(t - T_0)\over (T - T_0)}W(T - T_0)$
이었다.
- ****
- $T_0 = 0, Z(t) = x_t$ 로 바꿔보자.
- - $B(t) = x_0 - {t\over T}x_0 + {t\over T}x_T + W(t) - {t\over T}W(T)$ 가 된다.
+ - $B(t) = x_0 - {t\over T}x_0 + {t\over T}x_T + W(t) - {t\over T}W(T)$ 가 된다.
2. **Method**
- **3.1. Brownian Bridge Diffusion Model (BBDM)**
- **Forward diffusion process**
@@ -303,11 +307,12 @@ $W_0 ≠ 0$ 인 **두 점 사이의 Brownian Bridge 를 만들 때는?**
- **BBDM : conditional input y 자체를 향해 Brownian Bridge process 진행**
- VQGAN 의 latent space 에서 diffusion process 를 수행
- **x** 가 **A 도메인 영상의 latent features** 이고,
- **y** 가 **B 도메인 영상의 latent features** 일 때,
+ **y** 가 **B 도메인 영상의 latent features** 일 때,
**Forward diffusion process 는 다음과 같이 정의**됨
:::{figure-md}
+
:::
- **T** 는 diffusion process 의 **total steps** 이다.
@@ -321,6 +326,7 @@ $W_0 ≠ 0$ 인 **두 점 사이의 Brownian Bridge 를 만들 때는?**
:::{figure-md}
+
:::
- 만약 t 는 양의 정수의 discrete time 이고, 그 최댓값인 T=1000 이라면
@@ -328,6 +334,7 @@ $W_0 ≠ 0$ 인 **두 점 사이의 Brownian Bridge 를 만들 때는?**
:::{figure-md}
+
:::
- diffusion process 가 시작하는 **t = 0 에서는, $m_0$ = 0** 이고,
@@ -346,6 +353,7 @@ $W_0 ≠ 0$ 인 **두 점 사이의 Brownian Bridge 를 만들 때는?**
:::{figure-md}
+
:::
- 이 논문에서 **s 의 디폴트 값은 1**
@@ -354,17 +362,20 @@ $W_0 ≠ 0$ 인 **두 점 사이의 Brownian Bridge 를 만들 때는?**
- **training 과 inference process 를 위해**서는 **forward transition probability** 인 $q_{BB}(x_t|x_{t-1}, y)$ 를 알아야함
- **식 (4) 에 의해, $x_0$ 와 $y$ 가 주어졌을 때의 $x_t$ 와** $x_{t-1}$ 은 다음과 같이 쓸 수 있음
- :::{figure-md}
-
- :::
-
- :::{figure-md}
-
- :::
-
- :::{figure-md}
-
- :::
+ :::{figure-md}
+
+
+ :::
+
+ :::{figure-md}
+
+
+ :::
+
+ :::{figure-md}
+
+
+ :::
- 참고. 위 식 (7) 의 $m_ty$ 는 $m_{t-1}y$ 로 쓰는 것이 옳음
- **식 (6) 의 $x_0$ 를 식 (7) 의 $x_0$ 로 대체**하면,
@@ -372,6 +383,7 @@ $W_0 ≠ 0$ 인 **두 점 사이의 Brownian Bridge 를 만들 때는?**
:::{figure-md}
+
:::
- 증명
@@ -384,6 +396,7 @@ $W_0 ≠ 0$ 인 **두 점 사이의 Brownian Bridge 를 만들 때는?**
:::{figure-md}
+
:::
- 식(8) 에 의해, t=T 가 될 때 $m_T = 1$, $x_T = y$ 임.
@@ -400,6 +413,7 @@ $W_0 ≠ 0$ 인 **두 점 사이의 Brownian Bridge 를 만들 때는?**
:::{figure-md}
+
:::
- $\mu_\theta (x_t,t)$ 는 U-Net 에 의해 예측된 노이즈 평균값이며, $\tilde{\delta_t}$ 는 노이즈의 분산
@@ -411,48 +425,55 @@ $W_0 ≠ 0$ 인 **두 점 사이의 Brownian Bridge 를 만들 때는?**
:::{figure-md}
+
:::
- - **Diffusion Models** 의 simplified **objective** 는 다음과 같음
+ - **Diffusion Models** 의 simplified **objective** 는 다음과 같음
- :::{figure-md}
+ :::{figure-md}
+
:::
- **Brownian Bridge diffusion process** 의 **ELBO**
:::{figure-md}
+
:::
- **첫 번째 term :** $x_T$ 가 곧 y 이므로 무시할 수 있음
- **세 번째 term** : 매우 작은 값이 되므로 무시할 수 있음
- **베이즈 이론과 Markov chain property 를 식 (4) 와 식 (8) 에 적용**하여,
다음과 같이 **식 (11) 이 도출**된다.
- - 참고. Markovian Chain
- - $q(x_t|x_{t-1}) = q(x_t|x_{t-1}, x_{t-2}, … , x_0)$
- - Markov chain property 에 의해,
- $q_{BB}(x_t|x_{t-1},y) = q_{BB}(x_t|x_{t-1},x_0,y)$ 가 성립됨을 활용
- - 식(4)
+ - 참고. Markovian Chain
+ - $q(x_t|x_{t-1}) = q(x_t|x_{t-1}, x_{t-2}, … , x_0)$
+ - Markov chain property 에 의해,
+ $q_{BB}(x_t|x_{t-1},y) = q_{BB}(x_t|x_{t-1},x_0,y)$ 가 성립됨을 활용
+ - 식(4)
:::{figure-md}
+
:::
- - 식(8)
+ - 식(8)
:::{figure-md}
+
:::
- - 식(11) & 식(13)
+ - 식(11) & 식(13)
:::{figure-md}
+
:::
:::{figure-md}
+
:::
- 증명
@@ -463,25 +484,26 @@ $W_0 ≠ 0$ 인 **두 점 사이의 Brownian Bridge 를 만들 때는?**
- $= {q_{BB}(x_{t},x_{t-1},x_{0},y)\over q_{BB}(x_{t},x_{0},y)}$
- $= q_{BB}(x_{t-1}|x_{t},x_{0},y)$
- ---
-
- 위 식 (11) 의 평균은, 식 (12) 와 같이 정리됨
- :::{figure-md}
-
- :::
+ :::{figure-md}
+
+
+ :::
- 식(4) 와 식(12) 를 통합하고Reparameterization method 를 사용해서
$\tilde {\mu_t}$ 를 다음과 같이 변형할 수 있음
:::{figure-md}
+
:::
- 참고. 식(4)
:::{figure-md}
+
:::
@@ -494,16 +516,19 @@ $W_0 ≠ 0$ 인 **두 점 사이의 Brownian Bridge 를 만들 때는?**
:::{figure-md}
+
:::
:::{figure-md}
+
:::
- $\epsilon_\theta (x_t,t)$ 는 $m_t(y-x_0)+\sqrt {\delta_t}\epsilon$ 을 근사하도록 학습되어야겠네 !
:::{figure-md}
+
:::
- ELBO 의 두 번째 term 을 다시 살펴보면,
@@ -511,10 +536,12 @@ $W_0 ≠ 0$ 인 **두 점 사이의 Brownian Bridge 를 만들 때는?**
:::{figure-md}
+
:::
:::{figure-md}
+
:::
- $arg \space min_\theta \space D_{KL}(q_{BB}(x_{t-1}|x_t, x_0, y)||p_\theta (x_{t-1}|x_t,y))$
@@ -523,16 +550,18 @@ $W_0 ≠ 0$ 인 **두 점 사이의 Brownian Bridge 를 만들 때는?**
=$arg \space min_\theta \space (c_{\epsilon_t} (m_t(y-x_0) + \sqrt {\delta_t}\epsilon - \epsilon_\theta(x_t,t)))$
- 따라서, ELBO 는 다음과 같이 단순화될 수 있음
- :::{figure-md}
-
- :::
+ :::{figure-md}
+
+
+ :::
- **Training Algorithm 정리**
- :::{figure-md}
-
- :::
+ :::{figure-md}
+
+
+ :::
- **3.2. Accelerated Sampling Processes**
@@ -544,18 +573,21 @@ $W_0 ≠ 0$ 인 **두 점 사이의 Brownian Bridge 를 만들 때는?**
:::{figure-md}
+
:::
- **inference process**
:::{figure-md}
+
:::
- **Sampling Algorithm**
:::{figure-md}
+
:::
- 본 논문에서는 **S 값의 디폴트**를 **200** 으로 두었음
@@ -585,14 +617,17 @@ $W_0 ≠ 0$ 인 **두 점 사이의 Brownian Bridge 를 만들 때는?**
:::{figure-md}
+
:::
:::{figure-md}
+
:::
:::{figure-md}
+
:::
- Pix2Pix 는 지도 학습 방식으로 학습하므로, 괜찮은 결과를 냄
@@ -612,22 +647,25 @@ $W_0 ≠ 0$ 인 **두 점 사이의 Brownian Bridge 를 만들 때는?**
- **4.3. Quantitative Comparison**
- Table 1 과 2 를 보면, BBDM 이 모든 실험에서 가장 좋은 FID 값을 기록했으며, 훌륭한 LPIPS 값을 기록함
- :::{figure-md}
-
- :::
-
- :::{figure-md}
-
- :::
+ :::{figure-md}
+
+
+ :::
+
+ :::{figure-md}
+
+
+ :::
- **4.4. 다른 Translation Tasks**
- **BBDM 의 generalization 성능을 검증**하기 위해서, 다른 tasks 에 대해서도 실험했음
- 아래 그림과 같이, **다른 tasks 에서도 camparable 한 성능을 기**록함
- :::{figure-md}
-
- :::
+ :::{figure-md}
+
+
+ :::
- **4.5. Ablation Study**
@@ -635,6 +673,7 @@ $W_0 ≠ 0$ 인 **두 점 사이의 Brownian Bridge 를 만들 때는?**
:::{figure-md}
+
:::
- **BBDM 과 LDM** 에 대해서,
@@ -644,6 +683,7 @@ $W_0 ≠ 0$ 인 **두 점 사이의 Brownian Bridge 를 만들 때는?**
:::{figure-md}
+
:::
- **Sampling steps 가 작을 때 (200 이하) 는, 조금만 늘려도 성능이 크게 증가**
@@ -651,6 +691,7 @@ $W_0 ≠ 0$ 인 **두 점 사이의 Brownian Bridge 를 만들 때는?**
:::{figure-md}
+
:::
- 식 (5) 에 나타난 것처럼, **scaling factor s 의 값을 변경**함으로써,
@@ -659,6 +700,7 @@ $W_0 ≠ 0$ 인 **두 점 사이의 Brownian Bridge 를 만들 때는?**
:::{figure-md}
+
:::
1. **Conclusion and Future Work**
diff --git a/_sources/docs/review/Your_Diffusion_Model_is_Secretly_a_Zero_Shot_Classifier.md b/_sources/docs/review/Your_Diffusion_Model_is_Secretly_a_Zero_Shot_Classifier.md
new file mode 100755
index 00000000..2022e57a
--- /dev/null
+++ b/_sources/docs/review/Your_Diffusion_Model_is_Secretly_a_Zero_Shot_Classifier.md
@@ -0,0 +1,256 @@
+``` {admonition} Information
+- **Title:** {Your Diffusion Model is Secretly a Zero-Shot Classifier}, {ICCV 2023}
+
+- **Reference**
+ - Paper: [https://arxiv.org/pdf/2303.16203.pdf](https://arxiv.org/pdf/2303.16203.pdf)
+ - Github io: [https://diffusion-classifier.github.io/](https://diffusion-classifier.github.io/)
+ - Code: [https://github.com/diffusion-classifier/diffusion-classifier](https://github.com/diffusion-classifier/diffusion-classifier)
+
+- **Author:** SeonHoon Kim
+- **Edited by:** SeonHoon Kim
+
+- **Last updated on Nov. 09, 2023**
+```
+
+# YDMSZC 발표 자료
+
+- **핵심**
+ - 학습된 **Diffusion Models 에서 Classifier 를 추가 학습 없이 획득**할 수 있다.
+ - **Stable Diffusion** 같은 거대 모델로부터 **Zero-shot classifier** 를 얻을 수 있다.
+ - **Class-conditional Diffusion Models** 에서는 **일반적인 (non Zero-shot) classifier** 를 얻을 수 있다.
+- **결과 요약**
+ - **Classification 성능이 나쁘지 않았다.**
+ - **Zero-shot classifier 는 Multimodal Compositional reasoning ability 가 매우 훌륭**했다.
+ - 이렇게 Diffusion 모델에서 추출된 Classifiers 는 **Distribution shift 에 대해 Robust** 한 성능을 보여주었다.
+
+- **Classifier 구현 방법**
+
+:::{figure-md}
+
+
+:::
+
+ - **예시로 먼저 살펴보기.**
+ - 예를 들어, 어떤 동물 이미지 X 를 Stable Diffusion 으로 Classification 하고 싶다면..
+ 1. 일단 해당 동물의 클래스를 포함하고 있을 만한 데이터셋을 구한다.
+ 37개의 동물 클래스가 존재하는 Pets 데이터셋을 사용한다고 치자.
+ 2. text prompts 로 “호랑이” 가 주어진 Stable Diffusion 으로, X 의 Noised Image 에서 Reverse process 를 진행한다. 그럼 Loss 를 획득할 수 있을 것이다.
+ 3. 37개의 모든 Pets Classes 에 대해서 이를 수행해서,
+ 가장 Loss 가 작은 Class 를 판별한다.
+ 이 Class 가 바로 이미지 X 의 클래스이다.
+
+:::{figure-md}
+
+
+:::
+
+ 1. `n_samples` 에 지정된 수 만큼 t 와 noise 를 각각 샘플링해 벡터를 만든다.
+ 2. 클래스 판별이 필요한 이미지 X 의 t-step Noised image 인 X_t 를 구한다.
+ 3. X_t 를 Diffusion Model 에 Input 으로 주어 Noise 를 출력한다.
+ 4. **loss** 를 구한다.
+ - 위 과정을, 여러 번 (`n_trials` 만큼) 시도해서 평균낼 수도 있다.
+ 5. loss 가 가장 낮은 Class 를 찾을 때 까지, 가능한 모든 Class 에 대해 추론한다.
+ 6. 최종 남은 Class 를 X 의 Class 라고 판정한다.
+ - Zero-shot classification 도 위와 동일한 과정으로 진행된다.
+ 다만 추론할 Class list 가 필요하다.
+ - 예를 들어서, Stable Diffusion 의 Zero-shot classification 을 수행하기 위해서는, (Stable Diffusion 이 학습하지는 않았지만) 37개의 클래스가 정의되어 있는
+ Pets 와 같은 데이터셋으로 Classification 을 수행할 수 있다.
+ - 하지만, Class 마다 n_samples 수 만큼 t 를 샘플링하고,
+ 또 X_t 를 구하고,
+ Diffusion Model 로 노이즈를 추론하고,
+ loss 를 구하는 것은 Inference times 가 많이 소모됨.
+ 따라서 다음의 방법을 활용해 inference times 을 줄인다.
+
+:::{figure-md}
+
+
+:::
+
+1. **일단 작은 수의 n_samples 로 error 가 높은 class 들을 걸러낸다.**
+2. **소수의 class 만 남았다면,
+이제는 정확한 추론을 위해서 더 큰 n_samples 를 설정해 추론한다.
+(large n_samples 로 t 와 $\epsilon$ 을 sampling 한다.)**
+- c.f.
+
+```markdown
+### Oxford-IIIT Pets
+```bash
+python eval_prob_adaptive.py --dataset pets --split test --n_trials 1 \
+ --to_keep **5 1** --n_samples **25 250** --loss l1 \
+ --prompt_path prompts/pets_prompts.csv
+```
+
+- 왜 이렇게까지 inference time 을 줄이려고 하지??
+ - 위의 스크립트 그대로 RTX 3090 에서 돌리면,
+ Pets 이미지 1장 Classification 하는데 18초 걸린다.
+ - ImageNet 은 Class 1,000 개 있는데,
+ 512x512 이미지 1장 Classification 하려면 1,000 초 걸린다.
+- **c.f. Loss 계산 코드 (eval_prob_adaptive.py)**
+
+```python
+**all_noise** = torch.randn((**max_n_samples * args.n_trials**, 4, latent_size, latent_size), device=latent.device)
+
+def eval_error(unet, scheduler, latent, all_noise, ts, noise_idxs,
+ text_embeds, text_embed_idxs, batch_size=32, dtype='float32', loss='l2'):
+ assert len(ts) == len(noise_idxs) == len(text_embed_idxs)
+ pred_errors = torch.zeros(len(ts), device='cpu')
+ idx = 0
+ with torch.inference_mode():
+ for _ in tqdm.trange(len(ts) // batch_size + int(len(ts) % batch_size != 0), leave=False):
+ batch_ts = torch.tensor(ts[idx: idx + batch_size])
+ **noise** = **all_noise**[noise_idxs[idx: idx + batch_size]]
+ noised_latent = latent * (scheduler.alphas_cumprod[batch_ts] ** 0.5).view(-1, 1, 1, 1).to(device) + \
+ noise * ((1 - scheduler.alphas_cumprod[batch_ts]) ** 0.5).view(-1, 1, 1, 1).to(device)
+ t_input = batch_ts.to(device).half() if dtype == 'float16' else batch_ts.to(device)
+ text_input = text_embeds[text_embed_idxs[idx: idx + batch_size]]
+ **noise_pred** = unet(noised_latent, t_input, encoder_hidden_states=text_input).sample
+ if loss == 'l2':
+ error = F.mse_loss(**noise**, **noise_pred**, reduction='none').mean(dim=(1, 2, 3))
+ elif loss == 'l1':
+ error = F.l1_loss(noise, noise_pred, reduction='none').mean(dim=(1, 2, 3))
+ elif loss == 'huber':
+ error = F.huber_loss(noise, noise_pred, reduction='none').mean(dim=(1, 2, 3))
+ else:
+ raise NotImplementedError
+ pred_errors[idx: idx + len(batch_ts)] = error.detach().cpu()
+ idx += len(batch_ts)
+ return pred_errors
+```
+
+
+- **실험 결과**
+ - **Figure 2**
+
+ :::{figure-md}
+
+
+ :::
+
+ - 특정한 이미지 x 의 모든 클래스에 대해서 loss 를 추론하게 될텐데,
+ **모든 클래스에 대해서
+ 동일한 $\epsilon$** (즉 sampled noise) **과 동일한 t** (즉 sampled time steps) **를 사용해야** 한다.
+ **이 두 변수에 따라 loss 가 크게 달라지기 때문.**
+
+- **Figure 3 & Figure 4**
+ - **Figure 3**
+ - t 에 따라서, Classification 성능이 달라졌다.
+ - **Figure 4**
+ - Figure 3 의 결과에 따라서,
+ intermediate timesteps 를 더 많이 sampling 하면 성능이 올라가는지 실험해보았다.
+ - 그렇지 않았다.
+ timesteps 를 Uniform 하게 sampling 했을 때 성능이 가장 좋았다.
+
+:::{figure-md}
+
+
+:::
+
+:::{figure-md}
+
+
+:::
+
+- **Table 1** (+ F. Additional Implementation Details 참고)
+
+:::{figure-md}
+
+
+:::
+
+- 본 논문에서 제시한 Diffusion Classifier 가 Classification 능력이 나쁘지 않았다.
+1. Diffusion 모델에서 knowledge 를 추출해내는 다른 방법들보다 성능이 뛰어났다.
+ - Diffusion Classifier 는 **Zero-shot 성능**이,
+ **“Stable Diffusion 으로 생성된 영상을“ 학습한** **ResNet-50** **classifier** 보다 뛰어났다.
+ - **Synthetic SD data :**
+ Class 마다 10,000 장의 이미지를 Stable Diffusion 2.0 으로 생성해
+ 데이터셋을 구축하고 (90% train / 10% validation),
+ 해당 데이터셋으로 ResNet-50 classifier 를 학습시켜서 classification 수행한 결과
+ - Diffusion Classifier 는 **Classification 성능**이,
+ **Stable Diffusion 의 intermediate U-Net layer 를 추출해 학습시킨
+ ResNet-based 모델**보다 뛰어났다.
+ - **SD features :**
+ Input 이미지에 따른 Stable Diffusion 의 Intermediate U-Net features 를
+ ResNet 기반의 classifier 에 전달해서 추론.
+ 이 때 classifier 는 모든 데이터셋을 직접 학습한다. 따라서 zero-shot 은 아니다.
+2. **CLIP ResNet-50 모델보다도 성능이 뛰어났다.**
+3. **OpenCLIP ViT-H/14 모델에 competitive** 했다. (비벼볼 만 했다.)
+
+- **Table 2**
+
+:::{figure-md}
+
+
+:::
+
+- **Stable Diffusion 은**
+Resolution 이 높은지, Aesthetic 한지, Safe-for-work 한지에 따라서 **filtered 된
+LAION-5B 데이터셋을 학습**했다.
+- 이와 같은 기준으로 filtering 하면, **CIFAR10, Pets, Flowers, STL10, ImageNet 데이터셋의 test set 은 97~100% 가 filtered out** 된다.
+- 따라서, **이들 데이터셋은 Stable Diffusion 에게 완전한 out-of-distribution 데이터**이다.
+- 따라서, **필터링이 안된 데이터로 Stable Diffusion 을 추가 학습시키면
+classification 성능도 올라갈 것**이다.
+
+- **Figure 5 & Table 3**
+
+:::{figure-md}
+
+
+:::
+
+:::{figure-md}
+
+
+:::
+
+- 본 논문에서는 Winoground 데이터셋을 활용해
+visio-linguistic compositional reasoning abilities 를 측정했다.
+ - 주어진 captions 를 적절한 이미지에 매치시키는 능력을 측정하는 것이다.
+ - Winoground 데이터셋
+ - Object 는 명사절끼리 뒤바뀐 경우
+ - Relation 은 동사끼리 or 형용사끼리 or 부사끼리 뒤바뀐 경우
+ - Both 는 다른 품사끼리 서로 뒤바뀐 경우
+- Stable Diffusion 의 Diffusion Classifier 가 최고의 성능을 보여주었다.
+- 본 논문에서 제시한 method 를 통해서 **추가 학습 없이,**
+여느 diffusion 모델처럼 sample generation 만을 학습했음에도,
+**Stable Diffusion 모델을 훌륭한 classifier 이자 reasoner 로 변모**시킬 수 있었다.
+
+- **Table 4**
+
+:::{figure-md}
+
+
+:::
+
+- ImageNet 에 존재하는 **1,000 개의 클래스를 활용해**
+Pretrained **DiT** (Diffusion Transformer) 를 활용한 **Diffusion Classifier 의 성능**을,
+**Discriminative Classifiers** (ResNet-101 and ViT-B/16) **와 비교**했다.
+- **ImageNet** 에 대해서, **79.1% 의 top-1 accuracy 를 기록하며 ViT-L/32 을 능가**했다.
+- **더 적은 augmentation 기법**을 사용하였고,
+**regularization 은 사용하지 않았음에도** Discriminative Classifiers 의 성능을 능가했다.
+
+- **Figure 6**
+
+:::{figure-md}
+
+
+:::
+
+- ImageNet 데이터셋에서,
+ImageNet-A 와 겹치는 클래스에 대해서만 Classification 을 수행한다.
+- 일반적인 **discriminative classifiers 는 신뢰구간 과 함께 파란 점**으로 찍혀 있다.
+- **Diffusion Classifiers 는 신뢰구간 과 함께 별 모양의 점**으로 찍혀 있다.
+- Diffusion Classifiers 는 In-distribution (ImageNet) 에서 획득한 Accuracy 에 따라
+기대되는 것보다,
+훨씬 Out-of-distribution (ImageNet-A) 에서의 성능이 뛰어났다.
+ - 즉, OOD 에 훨씬 Robust 하다.
+
+- 결론
+ - Diffusion Models 에서 **Diffusion Classifier 를 추출하는 방법을 제시**함
+ - Stable Diffusion 에서 추출한 **Diffusion Classifier 가 Zero-shot 능력이 우수함을 확인**
+ - DiT 에서 추출한 **Diffusion Classifier 가 Standard Classification 능력이 우수함을 확인**
+ - Diffusion Classifiers 의 **Compositional Reasoning 능력이 우수함을 확인**
+ - Diffusion Classifiers 가 **OOD 에 매우 Robust 함**
+ - **Filtering 되지 않은 데이터도 학습시킬 수 있다면,
+ Stable Diffusion 의 Diffusion Classifier 성능은 더 개선될 것**임.
+ - Imagen 의 경우 OpenCLIP 보다 훨씬 큰 거대 언어 모델인, T5-XXL 을 활용했음.
+ **Imagen 의 Classification 능력은 Stable Diffusion 보다 뛰어날 것으로 예상**됨.
\ No newline at end of file
diff --git a/docs/review/BBDM.html b/docs/review/BBDM.html
index f5a6bbd4..dfc33c6a 100755
--- a/docs/review/BBDM.html
+++ b/docs/review/BBDM.html
@@ -69,7 +69,7 @@
-
+
@@ -228,6 +228,7 @@
GLIDE
BBDM
+YDMSZC 발표 자료
Experiments
@@ -475,15 +476,21 @@ BBDM
-
+
-Fig. 295 Source : https://www.youtube.com/watch?v=ld0rxwAJpkM&ab_channel=finRGB
+Fig. 296 Source : https://www.youtube.com/watch?v=ld0rxwAJpkM&ab_channel=finRGB
-파란색 점들은, Brownian Motion Process 를 진행한 특정한 경우
-(one representation) 를 나타냄
+
파란색 점들은, Brownian Motion Process 를 진행한 특정한 경우 (one representation) 를 나타냄
보라색 점처럼, W_T 는 확률에 의해 여러 경우의 수가 존재할 수 있음
-
-
+
+
-Fig. 296 Source : https://www.youtube.com/watch?v=ld0rxwAJpkM&ab_channel=finRGB
+Fig. 297 Source : https://www.youtube.com/watch?v=ld0rxwAJpkM&ab_channel=finRGB
@@ -599,18 +605,19 @@ BBDM
Linear Bridge between Standard Wiener Process
-
+
+
-Fig. 297 Source : https://sine-qua-none.tistory.com/158
+Fig. 298 Source : https://sine-qua-none.tistory.com/158
-
+
가장 간단한 Bridge 는, 선형으로 연결된 Bridge 일 것
위의 Bridge 는 다음과 같이 표현할 수 있다.
@@ -643,16 +650,17 @@
BBDM
next
-
Synthetic Data with Stable Diffusion for Foliar Disease Classification
+
YDMSZC 발표 자료
diff --git a/docs/review/Your_Diffusion_Model_is_Secretly_a_Zero_Shot_Classifier.html b/docs/review/Your_Diffusion_Model_is_Secretly_a_Zero_Shot_Classifier.html
new file mode 100755
index 00000000..8d8613b3
--- /dev/null
+++ b/docs/review/Your_Diffusion_Model_is_Secretly_a_Zero_Shot_Classifier.html
@@ -0,0 +1,776 @@
+
+
+
+
+
+
+
+
+
+
+
+ YDMSZC 발표 자료 — Text-to-Image Generation-feat-Diffusion
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+ Skip to main content
+
+
+
+
+
+
+ Back to top
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+YDMSZC 발표 자료
+
+핵심
+
+
+결과 요약
+
+Classification 성능이 나쁘지 않았다.
+Zero-shot classifier 는 Multimodal Compositional reasoning ability 가 매우 훌륭 했다.
+이렇게 Diffusion 모델에서 추출된 Classifiers 는 Distribution shift 에 대해 Robust 한 성능을 보여주었다.
+
+
+Classifier 구현 방법
+
+ - **예시로 먼저 살펴보기.**
+ - 예를 들어, 어떤 동물 이미지 X 를 Stable Diffusion 으로 Classification 하고 싶다면..
+ 1. 일단 해당 동물의 클래스를 포함하고 있을 만한 데이터셋을 구한다.
+ 37개의 동물 클래스가 존재하는 Pets 데이터셋을 사용한다고 치자.
+ 2. text prompts 로 “호랑이” 가 주어진 Stable Diffusion 으로, X 의 Noised Image 에서 Reverse process 를 진행한다. 그럼 Loss 를 획득할 수 있을 것이다.
+ 3. 37개의 모든 Pets Classes 에 대해서 이를 수행해서,
+ 가장 Loss 가 작은 Class 를 판별한다.
+ 이 Class 가 바로 이미지 X 의 클래스이다.
+
+
+ 1. `n_samples` 에 지정된 수 만큼 t 와 noise 를 각각 샘플링해 벡터를 만든다.
+2. 클래스 판별이 필요한 이미지 X 의 t-step Noised image 인 X_t 를 구한다.
+3. X_t 를 Diffusion Model 에 Input 으로 주어 Noise 를 출력한다.
+4. **loss** 를 구한다.
+ - 위 과정을, 여러 번 (`n_trials` 만큼) 시도해서 평균낼 수도 있다.
+5. loss 가 가장 낮은 Class 를 찾을 때 까지, 가능한 모든 Class 에 대해 추론한다.
+6. 최종 남은 Class 를 X 의 Class 라고 판정한다.
+ - Zero-shot classification 도 위와 동일한 과정으로 진행된다.
+ 다만 추론할 Class list 가 필요하다.
+ - 예를 들어서, Stable Diffusion 의 Zero-shot classification 을 수행하기 위해서는, (Stable Diffusion 이 학습하지는 않았지만) 37개의 클래스가 정의되어 있는
+ Pets 와 같은 데이터셋으로 Classification 을 수행할 수 있다.
+- 하지만, Class 마다 n_samples 수 만큼 t 를 샘플링하고,
+또 X_t 를 구하고,
+Diffusion Model 로 노이즈를 추론하고,
+loss 를 구하는 것은 Inference times 가 많이 소모됨.
+따라서 다음의 방법을 활용해 inference times 을 줄인다.
+
+
+
+일단 작은 수의 n_samples 로 error 가 높은 class 들을 걸러낸다.
+소수의 class 만 남았다면,
+이제는 정확한 추론을 위해서 더 큰 n_samples 를 설정해 추론한다.
+(large n_samples 로 t 와 \(\epsilon\) 을 sampling 한다.)
+
+
+### Oxford-IIIT Pets
+```bash
+python eval_prob_adaptive.py --dataset pets --split test --n_trials 1 \
+ --to_keep **5 1** --n_samples **25 250** --loss l1 \
+ --prompt_path prompts/pets_prompts.csv
+
+
+
+왜 이렇게까지 inference time 을 줄이려고 하지??
+- 위의 스크립트 그대로 RTX 3090 에서 돌리면,
+Pets 이미지 1장 Classification 하는데 18초 걸린다.
+- ImageNet 은 Class 1,000 개 있는데,
+512x512 이미지 1장 Classification 하려면 1,000 초 걸린다.
+c.f. Loss 계산 코드 (eval_prob_adaptive.py)
+
+** all_noise ** = torch . randn (( ** max_n_samples * args . n_trials ** , 4 , latent_size , latent_size ), device = latent . device )
+
+def eval_error ( unet , scheduler , latent , all_noise , ts , noise_idxs ,
+ text_embeds , text_embed_idxs , batch_size = 32 , dtype = 'float32' , loss = 'l2' ):
+ assert len ( ts ) == len ( noise_idxs ) == len ( text_embed_idxs )
+ pred_errors = torch . zeros ( len ( ts ), device = 'cpu' )
+ idx = 0
+ with torch . inference_mode ():
+ for _ in tqdm . trange ( len ( ts ) // batch_size + int ( len ( ts ) % batch_size != 0 ), leave = False ):
+ batch_ts = torch . tensor ( ts [ idx : idx + batch_size ])
+ ** noise ** = ** all_noise ** [ noise_idxs [ idx : idx + batch_size ]]
+ noised_latent = latent * ( scheduler . alphas_cumprod [ batch_ts ] ** 0.5 ) . view ( - 1 , 1 , 1 , 1 ) . to ( device ) + \
+ noise * (( 1 - scheduler . alphas_cumprod [ batch_ts ]) ** 0.5 ) . view ( - 1 , 1 , 1 , 1 ) . to ( device )
+ t_input = batch_ts . to ( device ) . half () if dtype == 'float16' else batch_ts . to ( device )
+ text_input = text_embeds [ text_embed_idxs [ idx : idx + batch_size ]]
+ ** noise_pred ** = unet ( noised_latent , t_input , encoder_hidden_states = text_input ) . sample
+ if loss == 'l2' :
+ error = F . mse_loss ( ** noise ** , ** noise_pred ** , reduction = 'none' ) . mean ( dim = ( 1 , 2 , 3 ))
+ elif loss == 'l1' :
+ error = F . l1_loss ( noise , noise_pred , reduction = 'none' ) . mean ( dim = ( 1 , 2 , 3 ))
+ elif loss == 'huber' :
+ error = F . huber_loss ( noise , noise_pred , reduction = 'none' ) . mean ( dim = ( 1 , 2 , 3 ))
+ else :
+ raise NotImplementedError
+ pred_errors [ idx : idx + len ( batch_ts )] = error . detach () . cpu ()
+ idx += len ( batch_ts )
+ return pred_errors
+
+
+
+실험 결과
+
+
+Figure 3 & Figure 4
+
+Figure 3
+
+
+Figure 4
+
+Figure 3 의 결과에 따라서,
+intermediate timesteps 를 더 많이 sampling 하면 성능이 올라가는지 실험해보았다.
+그렇지 않았다.
+timesteps 를 Uniform 하게 sampling 했을 때 성능이 가장 좋았다.
+
+
+
+
+
+
+
+
+Diffusion 모델에서 knowledge 를 추출해내는 다른 방법들보다 성능이 뛰어났다.
+- Diffusion Classifier 는 Zero-shot 성능 이,
+“Stable Diffusion 으로 생성된 영상을“ 학습한 ResNet-50 classifier 보다 뛰어났다.
+- Synthetic SD data :
+Class 마다 10,000 장의 이미지를 Stable Diffusion 2.0 으로 생성해
+데이터셋을 구축하고 (90% train / 10% validation),
+해당 데이터셋으로 ResNet-50 classifier 를 학습시켜서 classification 수행한 결과
+- Diffusion Classifier 는 Classification 성능 이,
+Stable Diffusion 의 intermediate U-Net layer 를 추출해 학습시킨
+ResNet-based 모델 보다 뛰어났다.
+- SD features :
+Input 이미지에 따른 Stable Diffusion 의 Intermediate U-Net features 를
+ResNet 기반의 classifier 에 전달해서 추론.
+이 때 classifier 는 모든 데이터셋을 직접 학습한다. 따라서 zero-shot 은 아니다.
+CLIP ResNet-50 모델보다도 성능이 뛰어났다.
+OpenCLIP ViT-H/14 모델에 competitive 했다. (비벼볼 만 했다.)
+
+
+
+Stable Diffusion 은
+Resolution 이 높은지, Aesthetic 한지, Safe-for-work 한지에 따라서 filtered 된
+LAION-5B 데이터셋을 학습 했다.
+이와 같은 기준으로 filtering 하면, CIFAR10, Pets, Flowers, STL10, ImageNet 데이터셋의 test set 은 97~100% 가 filtered out 된다.
+따라서, 이들 데이터셋은 Stable Diffusion 에게 완전한 out-of-distribution 데이터 이다.
+따라서, 필터링이 안된 데이터로 Stable Diffusion 을 추가 학습시키면
+classification 성능도 올라갈 것 이다.
+Figure 5 & Table 3
+
+
+본 논문에서는 Winoground 데이터셋을 활용해
+visio-linguistic compositional reasoning abilities 를 측정했다.
+
+
+Stable Diffusion 의 Diffusion Classifier 가 최고의 성능을 보여주었다.
+본 논문에서 제시한 method 를 통해서 추가 학습 없이,
+여느 diffusion 모델처럼 sample generation 만을 학습했음에도,
+Stable Diffusion 모델을 훌륭한 classifier 이자 reasoner 로 변모 시킬 수 있었다.
+Table 4
+
+
+ImageNet 에 존재하는 1,000 개의 클래스를 활용해
+Pretrained DiT (Diffusion Transformer) 를 활용한 Diffusion Classifier 의 성능 을,
+Discriminative Classifiers (ResNet-101 and ViT-B/16) 와 비교 했다.
+ImageNet 에 대해서, 79.1% 의 top-1 accuracy 를 기록하며 ViT-L/32 을 능가 했다.
+더 적은 augmentation 기법 을 사용하였고,
+regularization 은 사용하지 않았음에도 Discriminative Classifiers 의 성능을 능가했다.
+Figure 6
+
+
+ImageNet 데이터셋에서,
+ImageNet-A 와 겹치는 클래스에 대해서만 Classification 을 수행한다.
+일반적인 discriminative classifiers 는 신뢰구간 과 함께 파란 점 으로 찍혀 있다.
+Diffusion Classifiers 는 신뢰구간 과 함께 별 모양의 점 으로 찍혀 있다.
+Diffusion Classifiers 는 In-distribution (ImageNet) 에서 획득한 Accuracy 에 따라
+기대되는 것보다,
+훨씬 Out-of-distribution (ImageNet-A) 에서의 성능이 뛰어났다.
+- 즉, OOD 에 훨씬 Robust 하다.
+결론
+
+Diffusion Models 에서 Diffusion Classifier 를 추출하는 방법을 제시 함
+Stable Diffusion 에서 추출한 Diffusion Classifier 가 Zero-shot 능력이 우수함을 확인
+DiT 에서 추출한 Diffusion Classifier 가 Standard Classification 능력이 우수함을 확인
+Diffusion Classifiers 의 Compositional Reasoning 능력이 우수함을 확인
+Diffusion Classifiers 가 OOD 에 매우 Robust 함
+Filtering 되지 않은 데이터도 학습시킬 수 있다면,
+Stable Diffusion 의 Diffusion Classifier 성능은 더 개선될 것 임.
+Imagen 의 경우 OpenCLIP 보다 훨씬 큰 거대 언어 모델인, T5-XXL 을 활용했음.
+Imagen 의 Classification 능력은 Stable Diffusion 보다 뛰어날 것으로 예상 됨.
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
\ No newline at end of file
diff --git a/genindex.html b/genindex.html
index 14315120..9eba6921 100755
--- a/genindex.html
+++ b/genindex.html
@@ -223,6 +223,7 @@
GLIDE
BBDM
+YDMSZC 발표 자료
Experiments
diff --git a/intro.html b/intro.html
index 723e7134..13a3d0e0 100755
--- a/intro.html
+++ b/intro.html
@@ -225,6 +225,7 @@
GLIDE
BBDM
+YDMSZC 발표 자료
Experiments
diff --git a/objects.inv b/objects.inv
index 70bb97fd..c4282d1b 100755
Binary files a/objects.inv and b/objects.inv differ
diff --git a/search.html b/search.html
index 6b0020ab..1a314c94 100755
--- a/search.html
+++ b/search.html
@@ -225,6 +225,7 @@
GLIDE
BBDM
+YDMSZC 발표 자료
Experiments
diff --git a/searchindex.js b/searchindex.js
index d5d2f773..90c1071e 100755
--- a/searchindex.js
+++ b/searchindex.js
@@ -1 +1 @@
-Search.setIndex({"docnames": ["docs/experiments/js_exp", "docs/experiments/swjo_exp", "docs/review/BBDM", "docs/review/CM3leon", "docs/review/ControlNet", "docs/review/CustomDiffusion", "docs/review/DALLE2", "docs/review/DDIM", "docs/review/DDPM", "docs/review/GLIDE", "docs/review/HyperDreamBooth", "docs/review/I-DDPM", "docs/review/Latent_Diffusion_Model", "docs/review/LoRA", "docs/review/SDEdit", "docs/review/SDXL", "docs/review/StyO", "docs/review/StyleGAN", "docs/review/Synthetic_Data_from_Diffusion_Models_Improves_ImageNet_Classification", "docs/review/Textual_Inversion", "docs/review/cycleGAN", "docs/review/dalle", "docs/review/diffusion_beats_GANs", "docs/review/dreambooth", "docs/review/gan", "docs/review/imagen", "docs/review/imagen_editor", "docs/review/t2i_adapter", "docs/review/vae", "intro"], "filenames": ["docs\\experiments\\js_exp.md", "docs\\experiments\\swjo_exp.md", "docs\\review\\BBDM.md", "docs\\review\\CM3leon.md", "docs\\review\\ControlNet.md", "docs\\review\\CustomDiffusion.md", "docs\\review\\DALLE2.md", "docs\\review\\DDIM.md", "docs\\review\\DDPM.md", "docs\\review\\GLIDE.md", "docs\\review\\HyperDreamBooth.md", "docs\\review\\I-DDPM.md", "docs\\review\\Latent_Diffusion_Model.md", "docs\\review\\LoRA.md", "docs\\review\\SDEdit.md", "docs\\review\\SDXL.md", "docs\\review\\StyO.md", "docs\\review\\StyleGAN.md", "docs\\review\\Synthetic_Data_from_Diffusion_Models_Improves_ImageNet_Classification.md", "docs\\review\\Textual_Inversion.md", "docs\\review\\cycleGAN.md", "docs\\review\\dalle.md", "docs\\review\\diffusion_beats_GANs.md", "docs\\review\\dreambooth.md", "docs\\review\\gan.md", "docs\\review\\imagen.md", "docs\\review\\imagen_editor.md", "docs\\review\\t2i_adapter.md", "docs\\review\\vae.md", "intro.md"], "titles": ["Synthetic Data with Stable Diffusion for Foliar Disease Classification", "Training DreamBooth on Naver Webtoon Face Dataset", "BBDM", "CM3leon", "ControlNet", "Custom Diffusion", "DALLE2", "DDIM", "DDPM", "GLIDE", "HyperDreamBooth", "I-DDPM", "Latent Diffusion Model", "LoRA", "SDEdit", "SDXL", "StyO", "StyleGAN", "Synthetic Data from Diffusion Models Improves ImageNet Classification", "Textual Inversion", "CycleGAN", "DALL-E", "Diffusion Models Beat GANs on Image Synthesis", "DreamBooth", "GAN", "Imagen", "Imagen Editor", "T2I-Adapter", "VAE", "[PseudoLab] Text-to-Image Generation (feat. Diffusion)"], "terms": {"titl": [0, 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28], "author": [0, 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28], "jisu": [0, 4, 17], "kim": [0, 2, 4, 6, 17], "last": [0, 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28], "updat": [0, 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 14, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28], "jul": [0, 1], "05": [0, 15], "2023": [0, 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28], "\uc0ac\uacfc": 0, "\ub098\ubb34\uc758": 0, "\uc78e\uc5d0": 0, "\uc0dd\uae30\ub294": [0, 18], "\uc9c8\ubcd1\uc744": 0, "\uc774\ubbf8\uc9c0\ub85c": [0, 1, 5, 10, 15, 16, 25, 26, 27], "\ud310\ubcc4\ud558\ub294": 0, "kaggl": 0, "competit": [0, 22], "\ub9c1\ud06c": [0, 4], "\uc5d0\uc11c": [0, 2, 4, 6, 8, 9, 11, 13, 18, 21, 23, 25, 26, 27, 28], "\uc544\uc774\ub514\uc5b4\ub97c": 0, "\uc5bb\uc5b4\uc11c": 0, "\uc9c4\ud589\ud55c": [0, 2, 9], "\ud504\ub85c\uc81d\ud2b8\uc785\ub2c8\ub2e4": 0, "\ud574\ub2f9": [0, 5, 8, 9, 10, 12, 14, 18, 19, 23, 27, 28], "competition\uc740": 0, "\uc0ac\uacfc\ub098\ubb34": 0, "\uac78\ub9b0": 0, "\uc9c8\ubcd1\uc5d0": 0, "\ub530\ub77c": [0, 2, 3, 6, 9, 10, 11, 13, 15, 18, 19, 20, 21, 22, 23, 28], "\uc78e": 0, "\uc774\ubbf8\uc9c0\ub97c": [0, 3, 4, 5, 6, 7, 8, 9, 10, 11, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 25, 27], "4\uac1c\uc758": [0, 6, 19, 27], "class\ub85c": 0, "\ubd84\ub958\ud558\ub294": [0, 9, 27], "task\uc785\ub2c8\ub2e4": 0, "class": [0, 4, 5, 7, 8, 9, 11, 17, 18, 20, 22, 23, 24, 25, 27, 28], "leav": 0, "competition\uc744": 0, "\uc124\uba85\ud55c": [0, 27], "articl": 0, "\uc804\uccb4\uc801\uc778": [0, 6, 17], "accuracy\ub294": 0, "97": 0, "\uc774\uc9c0\ub9cc": 0, "multipl": [0, 27], "class\uc758": [0, 22], "\uacbd\uc6b0": [0, 1, 2, 4, 5, 10, 11, 12, 13, 14, 15, 16, 17, 18, 20, 24, 27], "accuracy\uac00": 0, "51": 0, "\uc5d0": [0, 1, 2, 3, 4, 5, 6, 8, 9, 10, 11, 12, 13, 16, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28], "\ubd88\uacfc\ud588\ub2e4\uace0": 0, "\uc5b8\uae09\ud569\ub2c8\ub2e4": 0, "\uc774\ubbf8\uc9c0": [0, 3, 4, 5, 6, 7, 9, 10, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27], "\uac1c\uc218\uac00": 0, "\ub2e4\ub978": [0, 2, 5, 7, 8, 9, 10, 11, 13, 16, 17, 18, 19, 20, 21, 23, 25, 26, 27, 28], "class\uc5d0": [0, 9], "\ube44\ud574": [0, 3, 4, 5, 7, 9, 10, 11, 16, 18, 22, 26], "\uc801\uc740": [0, 3, 4, 5, 7, 8, 9, 11, 13, 19, 22], "\uc810\uc5d0": 0, "\uc8fc\ubaa9\ud588\uace0": 0, "diffusion\uc744": [0, 14], "\uc0ac\uc6a9\ud558\uc5ec": [0, 8, 10, 17, 18, 20, 21, 23, 25], "\ud074\ub798\uc2a4\uc758": [0, 18], "\ub370\uc774\ud130": [0, 14, 15, 18, 19, 20, 21, 24, 25, 28], "\uac1c\uc218\ub97c": [0, 8], "\ub298\ub824\uc11c": 0, "classifi": [0, 18, 24, 26, 27], "\ud559\uc2b5\uc5d0": [0, 3, 11, 13, 18, 20], "\uc0ac\uc6a9\ud558\uba74": [0, 11, 19, 21], "\ub354": [0, 1, 2, 4, 5, 6, 7, 8, 9, 10, 11, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 25, 26, 27, 28], "\uc88b\uc740": [0, 1, 2, 5, 9, 14, 15, 16, 18, 20, 22, 23, 25, 27], "\uc131\ub2a5\uc758": [0, 18], "classifier\ub97c": [0, 9], "\uc5bb\uc744": [0, 15, 18, 19, 20], "\uc218": [0, 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28], "\uc788\uc744": [0, 1, 2, 4, 8, 13, 17, 18, 20], "\uac83\uc73c\ub85c": [0, 9, 13, 18, 19], "\uae30\ub300\ud588\uc2b5\ub2c8\ub2e4": 0, "\ubb38\uc81c": [0, 27], "\uc0c1\ud669\uc744": 0, "\uc7ac\ud604\ud558\uae30": 0, "\uc704\ud574": [0, 2, 3, 4, 5, 6, 8, 9, 10, 13, 15, 16, 17, 19, 20, 21, 23, 26, 27], "\uae30\uc874": [0, 3, 4, 5, 9, 10, 11, 12, 13, 15, 16, 17, 18, 19, 20, 22, 25, 27], "\ub370\uc774\ud130\ub85c": [0, 1, 4, 15, 18], "imag": [0, 1, 2, 6, 7, 10, 12, 14, 15, 16, 17, 18, 19, 21, 24, 25, 26, 27, 28], "\ud559\uc2b5\ud558\uc5ec": [0, 21, 22], "baseline\uc73c\ub85c": 0, "\uc7a1\uc558\uc2b5\ub2c8\ub2e4": 0, "\ubaa8\ub378\uc740": [0, 2, 3, 6, 10, 11, 12, 13, 14, 15, 17, 18, 19, 20, 27], "pretrained\ub41c": 0, "resnet18\uc5d0": 0, "linear": [0, 2, 8, 9, 11, 13, 17, 22, 24, 28], "layer\ub97c": [0, 13, 17], "\ubd99\uc5ec\uc11c": 0, "\uc0ac\uc6a9\ud588\uc2b5\ub2c8\ub2e4": [0, 6, 10, 26], "\uc804\uccb4": [0, 2, 4, 5, 9, 10, 11], "7": [0, 1, 2, 3, 7, 8, 11, 14, 25, 27], "class\ubcc4": 0, "healthi": 0, "99": 0, "73": [0, 19], "rust": 0, "scab": 0, "98": 0, "class\ub294": 0, "\uac1c\uc218": 0, "91\uac1c\ub85c": 0, "\ud074\ub798\uc2a4\ub4e4\uc5d0": 0, "\ube44\ud574\uc11c": [0, 6], "\uc801\uc2b5\ub2c8\ub2e4": 0, "imbalance\uac00": 0, "\uc131\ub2a5\uc744": [0, 2, 3, 4, 5, 7, 9, 11, 12, 13, 15, 16, 17, 18, 20, 21, 22, 23, 25, 26, 27], "\ub0ae\ucd94\ub294": 0, "\uc6d0\uc778\uc77c": [0, 18], "\uac83\uc774\ub77c": [0, 13], "\uac00\uc815\ud558\uace0": 0, "diffusion\uc73c\ub85c": [0, 18], "data\ub97c": [0, 5, 20], "\ucd94\uac00\ub85c": [0, 3, 11, 15, 16, 20], "\uc0dd\uc131\ud574\ubcf4\uae30\ub85c": 0, "\ud588\uc2b5\ub2c8\ub2e4": [0, 1, 6, 17, 18], "\uc608\uc2dc": [0, 3, 20, 25, 26, 27], "pretran": 0, "diffusion\uc758": 0, "\ub300\ud55c": [0, 1, 4, 5, 6, 8, 9, 10, 11, 15, 16, 18, 19, 23, 24, 27], "\uc815\ubcf4\uac00": [0, 6, 10, 16, 23], "\uc5c6\uc5b4\uc11c": 0, "\uc0dd\uc131\ud560": [0, 1, 3, 4, 6, 10, 14, 15, 18, 20, 23], "\uc544\ub798\uc640": [0, 4, 12, 17, 18, 20, 22], "\uac19\uc774": [0, 2, 4, 5, 6, 8, 10, 12, 17, 19, 20, 21, 22, 23, 24, 27, 28], "\uad00\ub828\uc5c6\ub294": 0, "\uc774\ubbf8\uc9c0\uac00": [0, 4, 6, 8, 10, 11, 14, 15, 16, 18, 19, 20, 22, 24, 25], "\uc0dd\uc131\ub429\ub2c8\ub2e4": 0, "prompt": [0, 4, 5, 6, 9, 10, 16, 19, 23, 25, 26, 27], "photo": [0, 1, 2, 5, 19], "\ub530\ub77c\uc11c": [0, 2, 3, 4, 6, 8, 9, 10, 11, 14, 15, 16, 18, 19, 20, 26, 27], "model": [0, 2, 4, 6, 7, 10, 13, 21, 24, 26, 29], "\uc815\ubcf4\ub97c": [0, 4, 6, 8, 10, 16, 18, 19, 20, 23, 27], "\ub123\uc5b4\uc8fc\uae30": 0, "dreambooth": [0, 5], "\ub97c": [0, 1, 2, 3, 4, 5, 6, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28], "tuning\ud588\uc2b5\ub2c8\ub2e4": 0, "training\uc5d0": [0, 7, 22], "\uc0ac\uc6a9\ud55c": [0, 3, 10, 15, 17, 18, 19, 22], "prompt\ub294": [0, 10], "disea": 0, "leaf": 0, "\uc774\uba70": [0, 2, 19], "\uc0dd\uc131\ud55c": [0, 4, 6, 21, 23, 25, 27], "\uc774\ubbf8\uc9c0\uc758": [0, 1, 4, 5, 6, 8, 16, 17, 18, 19, 20, 21, 23, 25], "\uc608\uc2dc\ub294": [0, 25, 27], "\uac19\uc2b5\ub2c8\ub2e4": [0, 1, 4, 6, 17, 18, 20, 23, 24, 27], "\uc0dd\uc131": [0, 3, 6, 7, 8, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 21, 22, 23, 25, 27], "engineering\uc744": 0, "\uc218\ud589\ud558\ub358": 0, "\uc911": [0, 3, 5, 7, 10, 11, 13, 14, 15, 16, 17, 18, 20, 21, 22, 23, 25, 27, 28], "\uc758\ub3c4\ud558\uc9c0\uc54a\uc740": 0, "\uacb0\uacfc\ub97c": [0, 2, 3, 4, 6, 7, 9, 10, 11, 15, 16, 17, 18, 19, 20, 25, 26], "\ubc1c\uacac\ud588\uc2b5\ub2c8\ub2e4": [0, 1, 6], "\uc544\ub798\ub294": [0, 4, 14, 28], "\uc774\uc5d0": [0, 2, 4, 10, 16, 18, 23, 27], "\uc608\uc2dc\ub85c": 0, "\uc804\uc758": [0, 15], "model\uc758": [0, 4, 5, 8, 10, 11, 13, 18, 19, 20, 22], "\uacb0\uacfc\uc640": [0, 6], "\ube44\uad50\uc785\ub2c8\ub2e4": 0, "\uc0c1\ud6691": 0, "\uc804": [0, 8, 13, 15, 18, 22], "\ud6c4": [0, 1, 2, 3, 6, 8, 9, 13, 14, 15, 20, 21, 23, 25, 26, 27], "\uc0c1\ud6691\uc744": 0, "\ubcf4\uba74": [0, 2, 5, 9, 11, 15, 17, 18, 19, 20, 21], "\ub2f4\uc740": 0, "uniqu": [0, 1, 23], "identifi": [0, 1, 16, 23], "\uac00": [0, 1, 2, 4, 6, 8, 9, 10, 11, 13, 15, 20, 22, 23, 24, 25, 26, 27, 28], "\uc5c6\uc74c\uc5d0\ub3c4": [0, 9], "diseases\uc758": 0, "\uc78e\ub4e4\ub9cc": 0, "\uc774\ub294": [0, 3, 4, 7, 10, 15, 17, 18, 19, 24, 26, 27, 28], "\uac19\uc740": [0, 1, 2, 3, 4, 6, 8, 9, 10, 11, 12, 13, 15, 17, 18, 19, 20, 22, 23, 26, 27, 28], "\uc18d\ud558\ub294": 0, "\uc774\ubbf8\uc9c0\ub4e4\uc744": [0, 1, 2, 4, 6, 23, 26], "\uc0dd\uc131\ud574\ub0b4\uc9c0": [0, 5], "\ubabb\ud558\uace0": [0, 8], "\uc788\ub2e4\ub294": [0, 9, 10, 13, 17, 19, 25, 26], "\uac83\uc785\ub2c8\ub2e4": [0, 4, 6, 10, 17, 18, 20, 26], "\uc774": [0, 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 27, 28], "\ud604\uc0c1\uc744": [0, 5, 17], "languag": [0, 3, 5, 6, 13, 18, 19, 21, 23, 25], "drift\ub77c\uace0": 0, "\ud558\uba70": [0, 2, 21], "\ubaa8\ub378\uc774": [0, 1, 3, 4, 5, 7, 8, 10, 15, 16, 17, 18, 22, 23, 26, 27], "leaf\uac00": 0, "\uc544\ub2cc": [0, 1, 2, 4, 7, 13, 19, 20, 22], "\uc77c\ubc18\uc801\uc778": [0, 3, 5, 10, 15, 19, 22], "\uad00\ud55c": [0, 11, 16, 17], "\uc78a\uc5b4\ubc84\ub838\uae30": 0, "\ub54c\ubb38\uc785\ub2c8\ub2e4": [0, 20], "\uc0c1\ud6692": 0, "\uc0c1\ud6692\ub97c": 0, "photo\ub77c\ub294": 0, "prompt\ub9cc": [0, 16], "\uc0ac\uc6a9\ud558\uc600\ub294\ub370\ub3c4": 0, "\uc774\ubbf8\uc9c0\ub4e4\uc5d0": [0, 6], "\ud2b9\uc9d5\ub4e4\uc774": 0, "\ub098\ud0c0\ub0a9\ub2c8\ub2e4": 0, "dreambooth\uc5d0\uc11c\ub294": 0, "drift\ub97c": 0, "prior": [0, 6, 23, 28], "preserv": [0, 23], "loss\ub97c": [0, 3, 8, 19, 20], "\uc0ac\uc6a9\ud574\uc11c": [0, 2, 4, 6, 9, 21, 22, 25], "\ud574\uacb0\ud558\uc600\uc73c\ubbc0\ub85c": 0, "\ubc29\ubc95\uc744": [0, 2, 3, 9, 10, 11, 15, 17, 18, 19, 20, 22, 27], "\ud574\uacb0\ud558\uae30": [0, 2, 13, 15, 19, 23, 26, 27], "train": [0, 2, 6, 7, 10, 11, 13, 16, 17, 19, 22, 23, 25, 27], "prompt\uc5d0\uc11c": 0, "\uc81c\uc678\ud558\uace0": [0, 13, 15], "\ucd5c\ub300\ud55c": [0, 15, 19, 20, 27, 28], "\ub2e8\uc21c\ud55c": [0, 5], "model\uc744": [0, 4, 5, 7, 9, 10, 11, 13, 15, 16, 19, 22], "\ub2e4\uc2dc": [0, 2, 8, 13, 14, 17, 20, 23, 24, 27, 28], "\uacb0\uacfc": [0, 1, 2, 3, 5, 6, 10, 11, 12, 15, 18, 22, 25, 26, 27], "\uc7ac\ud6c8\ub828": 0, "\uc774\ud6c4\uc5d0\ub3c4": 0, "model\ub85c": [0, 9], "\uc0dd\uc131\ud558\uc600\uc744": 0, "\ub54c\uc640": [0, 20], "\ube44\uc2b7\ud55c": [0, 2, 3, 5, 11, 19, 20, 22, 23], "\uc758": [0, 1, 2, 4, 6, 7, 8, 9, 10, 11, 13, 15, 17, 20, 21, 22, 23, 24, 25, 26, 27, 28], "\uacbd\uc6b0\uc5d0\ub294": [0, 18], "\uc5ec\uc804\ud788": [0, 2, 5, 10, 25], "\uc601\ud5a5\uc744": [0, 3, 8, 11, 16, 17, 18, 22, 25], "\ubc1b\uc740": [0, 19], "\uac83\uac19\uc740": 0, "\uc774\ubbf8\uc9c0\ub4e4\uc774": [0, 2, 4], "photo\uc758": 0, "\uc5ec\ub7ec": [0, 2, 10, 12, 18, 19, 23, 27], "\ub300\uc0c1\ub4e4\uacfc": 0, "\uc0ac\uc6a9\ub418\ub294": [0, 10, 18, 19, 23], "\ud2b9\uc131\uc744": [0, 10, 20], "\uac00\uc9c0\uace0\uc788\uc5b4\uc11c": 0, "\uadf8\ub7f0": [0, 14, 17, 20], "\uac83\uc774\ub77c\ub294": [0, 18], "\uc0dd\uac01\uc774": [0, 18], "\ub4e4\uc5c8\uace0": 0, "\uc774\ub97c": [0, 2, 4, 8, 10, 12, 13, 15, 17, 18, 19, 20, 23, 24, 26, 27, 28], "\uccb4\ud06c\ud574\ubcf4\uae30": 0, "\ud2b9\uc815\ud55c": [0, 2, 6, 17, 19], "photo\uc640": 0, "\uc6a9\ub3c4\ub85c": 0, "prompt\ub4e4\ub85c": 0, "\uc0dd\uc131\ubcf4\uc558\uc2b5\ub2c8\ub2e4": 0, "\ub300\uc0c1": [0, 20], "\uc138\uac00\uc9c0\ub85c\ub294": 0, "cat": [0, 8, 26, 27], "sea": 0, "pirate\uc744": 0, "\uc0ac\uc6a9\ud588\uace0": [0, 3, 9, 15, 20], "\ube44\uc2b7\ud558\uac8c": [0, 2, 19], "\ud14d\uc2a4\ud2b8": [0, 3, 6, 10, 18, 19, 25], "\uc138\uac00\uc9c0\ub294": 0, "illustr": 0, "anim": [0, 21], "wallpaper\ub97c": 0, "\uc774\ubbf8\uc9c0\ub294": [0, 5, 10, 15, 16, 20, 25], "\uae00": 0, "\ub9c8\uc9c0\ub9c9": [0, 8, 9, 17, 18], "\ubd80\ubd84\uc758": 0, "appendix\uc5d0": 0, "\uc788\uc2b5\ub2c8\ub2e4": [0, 1, 4, 6, 10, 17, 18, 20, 23, 24, 26, 27, 28], "\ub300\uc0c1\uc744": [0, 20], "\uc9c0\uce6d\ud558\ub294": 0, "\ud14d\uc2a4\ud2b8\uc758": 0, "\ub300\uc0c1\uc758": [0, 23], "\ud2b9\uc9d5\uc774": 0, "\uc798": [0, 1, 2, 3, 4, 5, 9, 10, 11, 14, 15, 16, 17, 18, 19, 20, 23, 24, 28], "\ub4dc\ub7ec\ub098\ub294": 0, "\uc0dd\uc131\ub418\uc5c8\uc9c0\ub9cc": 0, "\ub300\uc0c1\uacfc": [0, 10, 20], "\ud568\uaed8": [0, 9, 10, 13, 15, 20, 28], "\uc4f0\uc774\ub294": [0, 18, 20, 24], "\uc78e\uc0ac\uadc0\uc758": 0, "\ud2b9\uc9d5\uc744": [0, 4, 23], "\uac00\uc9c0\ub294": [0, 1, 17], "\uc77c\ubd80": [0, 3, 9, 10, 13, 17], "\uc0dd\uc131\ub418\uc5c8\uc2b5\ub2c8\ub2e4": 0, "tuning\ud55c": 0, "400\uc7a5": 0, "\uc0dd\uc131\ud558\uc5ec": 0, "\ud6c8\ub828\ud588\uc2b5\ub2c8\ub2e4": 0, "result_bas": 0, "\ucd94\uac00": [0, 5, 8, 10, 14, 15, 27], "\ud65c\uc6a9\ud55c": [0, 6, 9, 23, 24], "9": [0, 2, 3, 11, 14, 15, 20], "84": 0, "result_now": 0, "kaggle\uc5d0\uc11c": 0, "\uc81c\uacf5\ud558\ub294": [0, 6, 19], "test": [0, 2, 19, 20, 25], "set\uc5d0": [0, 18], "\uc801\uc6a9\ud588\uc744": 0, "\ub54c\ub294": [0, 2, 18], "baseline\uc774": [0, 19], "94": 0, "\uacbd\uc6b0\uac00": [0, 4, 7, 20], "93": 0, "\uc5ec\uc11c": 0, "baseline\ubcf4\ub2e4": 0, "\uc5bb\uc9c0\ub294": 0, "\ubabb": 0, "\ud6c8\ub828": [0, 4, 15, 18, 20, 25], "\uc911\uac04\uc911\uac04\uc5d0": 0, "\uc77c\uc815": 0, "step\ub9c8\ub2e4": 0, "\uc0dd\uc131\ud558\uac8c\ud574\uc11c": 0, "\ud6c8\ub828\uc5d0": [0, 17], "\ubaa8\ub2c8\ud130\ub9c1\uc774": 0, "\uc788\uc73c\uba74": 0, "\uc88b\uaca0\ub2e4\ub294": 0, "\uc0dd\uac01\uc744": 0, "\ud6c8\ub828\uc2dc": 0, "hyperparamet": [0, 7, 10, 16, 22, 27], "tuning\uc744": [0, 4, 10, 13, 18, 19], "\uc880": [0, 4, 6, 16, 25], "\ucca0\uc800\ud558\uac8c": 0, "\ud574\uc57c\uaca0\ub2e4\ub294": 0, "\uc2e4\uc81c\ub85c": [0, 2, 3, 11, 15, 17, 18, 20, 24, 28], "\uc870\uac74\uc744": [0, 10, 19], "\ub9cc\uc871\ud558\ub294\uc9c0": 0, "\uac80\uc218\ud560": 0, "\ubc29\uc548\uc774": 0, "\ud544\uc694\ud569\ub2c8\ub2e4": 0, "\ub0b4\uc5d0\uc11c\ub3c4": 0, "\uce74\ud14c\uace0\ub9ac\ub97c": 0, "\ub098\ub20c": 0, "\uc788\ub2e4\uba74": [0, 6, 8], "\ub098\ub220\uc11c": [0, 25], "\uac01\uac01\uc5d0": [0, 6, 17, 18], "tuning\ud560": [0, 5, 13], "\uc218\ub3c4": [0, 6, 17, 20, 27], "\ud65c\uc6a9\ud574\ubcfc": 0, "submiss": 0, "score\uc5d0\uc11c": [0, 18], "baseline\uc744": 0, "\uc774\uae30\uc9c0": 0, "\ud588\uc9c0\ub9cc": 0, "text": [0, 1, 2, 4, 6, 8, 10, 12, 15, 16, 17, 18, 21, 25, 26, 27], "\uc774\uc6a9\ud55c": [0, 16, 18], "data\uc758": [0, 11, 16], "\uac00\ub2a5\uc131\uc744": [0, 7], "\ubcfc": [0, 1, 6, 10, 15, 17, 18, 19, 20, 21, 22, 26], "\uc788\uc5c8\ub2e4\uace0": [0, 13, 26, 27], "\uc0dd\uac01\ud569\ub2c8\ub2e4": [0, 17], "\uc55e\uc5d0\uc11c": 0, "\uc5b8\uae09\ud55c": [0, 4, 15, 26], "prompt\uc5d0": [0, 5, 9], "\uc608\uc2dc\uc785\ub2c8\ub2e4": [0, 1], "nsfw\ub85c": 0, "\ud310\ub2e8\ub418\uc5b4": 0, "\uac80\uc740\uc0c9\uc73c\ub85c": 0, "\ub098\uc654\uc2b5\ub2c8\ub2e4": [0, 17], "pirat": 0, "wallpap": 0, "sangwoo": [1, 23, 24, 26, 27, 28], "jo": [1, 23, 24, 26, 27, 28], "09": 1, "\uc774\ubc88": [1, 26, 27], "\ud3ec\uc2a4\ud305\uc5d0\uc11c\ub294": [1, 6], "\uc9c1\uc811": [1, 2, 11, 14, 24, 28], "\ud559\uc2b5\ud574\ubcf4\uace0": 1, "\uc2e4\ud5d8\ud55c": [1, 10], "\uacb0\uacfc\ub4e4\uc744": [1, 6, 23, 27], "\uacf5\uc720\ud560\ub824\uace0": 1, "\ud569\ub2c8\ub2e4": [1, 4, 6, 10, 18, 20, 23, 24, 26, 27, 28], "\uc6b0\uc120\uc801\uc73c\ub85c": [1, 21, 27, 28], "\ud559\uc2b5\ub370\uc774\ud130\ub294": 1, "bryandle": 1, "data": [1, 13, 17, 20, 24], "\uacf5\uac1c\ub41c": [1, 13, 26], "yolov5": 1, "\ubaa8\ub378": [1, 2, 3, 5, 6, 7, 9, 10, 11, 13, 14, 15, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27], "\ubc0f": [1, 3, 6, 10, 13, 15, 18, 19, 20, 22, 24, 25, 26, 27], "waifu2x": 1, "\ud6c4\ucc98\ub9ac": [1, 15], "\uae30\ubc95\uc744": [1, 5, 9, 11, 26], "\ud65c\uc6a9\ud558\uc5ec": [1, 9, 10, 15, 21, 22, 23], "\ud504\ub9ac\ub4dc\ub85c\uc6b0\uc5d0": 1, "\ub4f1\uc7a5\ud558\ub294": 1, "\uc778\ubb3c": [1, 20], "\uc0ac\uc9c4\ub4e4\uc744": [1, 23], "\uc218\uc9d1\ud588\uc2b5\ub2c8\ub2e4": 1, "\ub17c\ubb38\uc5d0\uc11c\ub294": [1, 2, 4, 6, 8, 9, 10, 12, 13, 17, 18, 19, 20, 21, 22, 23, 26, 27, 28], "3": [1, 2, 6, 12, 19, 23, 24, 25, 28], "5": [1, 2, 8, 9, 10, 14, 20, 24, 27, 28], "\uc7a5\uc73c\ub85c": 1, "fine": [1, 4, 6, 10, 13, 16, 17, 19, 21, 25, 29], "tune": [1, 6, 10, 13, 21, 25, 29], "\uac00\ub2a5\ud558\ub2e4\uace0": [1, 17], "\uc81c\uc2dc\ub418\uc5b4\uc788\uc9c0\ub9cc": 1, "\uc0ac\uc9c4": [1, 5, 19, 20, 25], "\ub9ce\uc740": [1, 4, 6, 9, 15, 18, 19, 20, 21, 25], "\ud559\uc2b5\ud558\uba74": [1, 8, 23], "\uc131\ub2a5\uc774": [1, 2, 8, 11, 13, 15, 18, 22, 26, 27], "\uc88b\uc544\uc838\uc11c": 1, "15": [1, 3, 10], "20": [1, 3, 9, 11, 23], "\uc7a5\uc758": [1, 6], "\ud559\uc2b5\ud558\uc600\uc2b5\ub2c8\ub2e4": 1, "\ud559\uc2b5\ud55c": [1, 5, 6, 9, 11, 13, 16, 18, 23, 25, 26], "\uc774\ubbf8\uc9c0\ub4e4": [1, 15], "\uc2e4\ud5d8\ud558\uba74\uc11c": 1, "\ub300\ud45c\uc801\uc73c\ub85c": [1, 23, 27, 28], "\uadf8\ub9ac\uace0": [1, 2, 10, 18, 19, 23, 24, 26, 27, 28], "\ub9c8\uc9c0\ub9c9\uc73c\ub85c": [1, 10, 17, 23, 26, 27, 28], "\ubc18\uc601\ud558\ub294": 1, "\uc815\ub3c4\ub97c": [1, 7, 11], "\uc870\uc808\ud558\ub294": [1, 4, 7, 10, 18], "prior_loss_weight": [1, 23], "\ubc14\uafd4\uac00\uba74\uc11c": 1, "\ud559\uc2b5\ud574\ubcf4\uc558\uc2b5\ub2c8\ub2e4": 1, "\uc0ac\uc804\ud559\uc2b5\ub41c": [1, 18, 23], "\ubaa8\ub378\ub85c": [1, 5, 10, 17, 18, 21, 24, 26, 27], "\ucc98\uc74c\uc5d0\ub294": [1, 3, 13, 18], "hakurei": 1, "waifu": 1, "diffus": [1, 2, 4, 6, 7, 10, 13, 15, 24, 26], "\ubaa8\ub378\uc744": [1, 3, 5, 6, 7, 8, 9, 10, 11, 14, 15, 18, 19, 20, 23, 25, 26, 27], "\uc2dc\ub3c4\ud574\ubd24\uc9c0\ub9cc": 1, "\uacb0\uacfc\uac00": [1, 2, 8, 9, 18, 20, 22], "\ub9cc\uc871\uc2a4\ub7fd\uc9c0": 1, "\ubabb\ud574": 1, "runwayml": 1, "stabl": [1, 4, 10, 11, 13, 15, 18, 23, 26], "v1": [1, 10], "\uc791\uc5c5\uc744": [1, 19, 27], "\uc9c4\ud589\ud588\uc2b5\ub2c8\ub2e4": [1, 10, 24, 26, 27], "\uc81c\uc678\ud55c": 1, "\ub3d9\uc77c\ud55c": [1, 3, 10, 11, 15, 18, 20, 23, 26, 27], "configur": [1, 22, 24], "\uc73c\ub85c": [1, 2, 6, 10, 13, 19, 21, 23, 25, 26, 27], "\uacb0\uacfc\uc785\ub2c8\ub2e4": [1, 12, 18, 27], "model_nam": 1, "instance_prompt": 1, "A": [1, 2, 3, 4, 5, 6, 10, 13, 17, 19, 23, 25, 27], "sk": [1, 16, 19], "girl": 1, "class_prompt": 1, "python3": 1, "train_dreambooth": [1, 23], "py": [1, 23], "pretrained_model_name_or_path": [1, 23], "pretrained_vae_name_or_path": 1, "stabilityai": 1, "sd": [1, 15, 27], "vae": [1, 2, 5, 11, 23, 24], "ft": 1, "mse": [1, 8], "output_dir": 1, "revis": [1, 23], "fp16": 1, "with_prior_preserv": [1, 23], "1": [1, 2, 4, 6, 10, 12, 15, 17, 19, 20, 23, 24, 25, 28], "0": [1, 2, 3, 4, 5, 6, 7, 8, 12, 14, 15, 17, 18, 20, 21, 23, 24, 27, 28], "seed": 1, "1337": 1, "resolut": [1, 2, 9, 11, 12, 18, 22, 26], "512": [1, 15, 24], "train_batch_s": 1, "train_text_encod": [1, 23], "mixed_precis": 1, "use_8bit_adam": 1, "gradient_accumulation_step": [1, 23], "gradient_checkpoint": 1, "learning_r": 1, "1e": [1, 16], "6": [1, 2, 3, 5, 14, 15, 16, 20], "lr_schedul": [1, 23], "constant": [1, 11, 22], "lr_warmup_step": 1, "num_class_imag": 1, "200": [1, 2, 15, 25], "sample_batch_s": 1, "4": [1, 2, 6, 12, 17, 20, 24], "max_train_step": 1, "800": 1, "save_interv": 1, "100": [1, 11, 18, 20], "save_sample_prompt": 1, "concepts_list": 1, "json": 1, "w": [1, 2, 4, 5, 8, 12, 13, 17, 21, 25], "o": [1, 16, 26], "\uc544\ub798": [1, 2, 4, 6, 11, 17, 18, 20, 21, 23, 24, 25, 27, 28], "\uadf8\ub9bc\ucc98\ub7fc": [1, 6, 13, 24, 25], "infer": [1, 2, 8, 15, 27, 28], "\uc785\ub825\ud588\uc744": 1, "\ub54c": [1, 2, 3, 4, 5, 6, 7, 8, 10, 11, 12, 14, 15, 16, 17, 18, 19, 20, 22, 23, 24, 28], "\uc81c\uc678\ud568\uc73c\ub85c\uc368": 1, "input": [1, 2, 3, 4, 5, 6, 13, 17, 19, 20, 21, 23, 24, 26, 27], "\uac00\uae4c\uc6b4": [1, 3, 19, 20, 21], "\uc6f9\ud230": 1, "\uc788\uc5c8\uc2b5\ub2c8\ub2e4": [1, 4, 6, 10, 18, 26], "\ub610\ud55c": [1, 2, 3, 4, 5, 8, 9, 10, 12, 13, 15, 18, 20, 23, 26, 27], "\ud551\ud06c\uc0c9": 1, "\uba38\ub9ac\ub97c": 1, "\ud55c": [1, 2, 6, 8, 9, 10, 11, 13, 15, 17, 18, 20, 21, 22, 23, 24, 25, 26, 27, 28], "\uc774\ubbfc\uc9c0": 1, "\uce90\ub9ad\ud130\ub97c": 1, "\uc5b4\ub290": [1, 17, 18, 19], "\uc815\ub3c4": [1, 3, 7, 11, 13, 17, 18], "\uc0dd\uc131\ud558\ub294": [1, 3, 4, 6, 8, 9, 10, 12, 14, 17, 18, 20, 23, 24, 25, 26, 28], "\ubd80\ubd84\ub3c4": [1, 26], "\ud655\uc778\ud560": [1, 11, 14, 15, 18, 23, 26, 27], "pink": 1, "hair": [1, 16, 17], "With": 1, "without": [1, 13, 16, 17], "\ub3c4": [1, 2, 3, 6, 8, 10, 16, 23, 27, 28], "\uce90\ub9ad\ud130\uc758": [1, 23], "\ubd80\uc790\uc5f0\uc2a4\ub7ec\uc6b4": 1, "\ubd80\ubd84\uc774\ub098": 1, "\uc800\ud574\uc0c1\ub3c4": 1, "\uacbd\uc6b0\ub4e4\uc774": 1, "\uc885\uc885": [1, 20], "\ubc1c\uc0dd\ud588\ub294\ub370": 1, "\ud1b5\ud574": [1, 2, 3, 5, 7, 8, 9, 10, 13, 14, 15, 17, 18, 19, 20, 21, 22, 23, 24, 27, 28], "\ud004\ub9ac\ud2f0\uc758": [1, 11, 14, 16, 18], "ugli": 1, "disfigur": 1, "deform": 1, "low": [1, 6, 10, 11, 14, 21, 27], "\ub17c\ubb38\uc5d0\uc11c": [1, 2, 4, 9, 12, 17, 18, 20, 21, 23, 24, 28], "\uc81c\uc2dc\ud55c": [1, 5, 6, 9, 14, 21, 24, 25], "\uc678\uc5d0": 1, "style": [1, 2, 6, 10, 16, 19, 23], "\ub77c\ub294": [1, 2, 4, 6, 10, 18, 19, 20, 22, 25], "\ub85c": [1, 2, 3, 4, 6, 8, 9, 10, 11, 12, 13, 14, 15, 17, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29], "\ud559\uc2b5\uc744": [1, 5, 9, 12, 15, 17, 18, 20, 22, 26], "\uc2dc\ub3c4\ud574\ubcf4\uae30\ub3c4": 1, "\ud2b9\uc815": [1, 5, 6, 7, 8, 9, 10, 11, 13, 15, 16, 18, 19, 25, 27], "\uc5ec\uc790": 1, "\uce90\ub9ad\ud130\uc5d0": 1, "\uc815\ubcf4\ubfd0\ub9cc": 1, "\uc544\ub2c8\ub77c": [1, 2, 5, 6, 8, 9, 13, 15, 17, 20, 23], "\ud504\ub9ac\ub4dc\ub85c\uc6b0": 1, "\uadf8\ub9bc\uccb4": 1, "\uc790\uccb4\ub97c": [1, 2, 6, 11], "\ub2f4\uc544\ub0b4\uae30": 1, "\uc704\ud55c": [1, 2, 3, 6, 10, 11, 12, 13, 15, 18, 19, 20, 22, 28], "\ubaa9\uc801\uc774\uc600\uc2b5\ub2c8\ub2e4": 1, "differ": [1, 6, 13, 17], "\uc2dc": [1, 21, 22, 23, 24, 25, 26, 27], "\ud504\ub9ac\ub4dc\ub85c\uc6b0\uc758": 1, "\uadf8\ub9bc\uccb4\uac00": [1, 6], "\ubc18\uc601\ub41c": [1, 6], "\ub0a8\uc790\uac00": 1, "\uc0dd\uc131\ub418\ub3c4\ub85d": 1, "boi": 1, "\uc785\ub825\ud588\uc744\ub54c\uc758": 1, "\ud639\uc740": [1, 2, 5, 6, 12, 15, 23, 28], "\uc791\uac00\ub2d8\uc758": 1, "\uc7a5\uba74\ub4e4\ub85c": 1, "\uc804\uccb4\uc801\uc73c\ub85c": 1, "\ud559\uc2b5\ud558\uac8c": [1, 26, 27], "\ub41c\ub2e4\uba74": [1, 15], "\ub2e4\uc591\ud55c": [1, 3, 5, 6, 10, 11, 12, 15, 16, 18, 19, 20, 21, 22, 23, 25, 27], "\uac83": [1, 2, 6, 8, 18, 19, 20, 25], "num_inference_step": [1, 27], "24": [1, 18], "step": [1, 2, 5, 6, 7, 8, 9, 11, 16, 18, 22, 23, 24, 27], "\uc744": [1, 2, 3, 5, 6, 7, 8, 9, 10, 11, 12, 13, 15, 17, 19, 20, 21, 22, 23, 24, 25, 27, 28], "\ub298\ub824\uac00\uba74\uc11c": 1, "\ucd94\ub860\ub41c": 1, "\ud004\ub9ac\ud2f0\uac00": [1, 3, 15, 27], "\uc0c1\uc2b9\ud558\ub294": 1, "\uc2e4\ud5d8\ub3c4": 1, "\uc9c4\ud589\ud588\ub294\ub370": 1, "\uc791\uc744\uc218\ub85d": [1, 18], "\uc640": [1, 2, 3, 5, 6, 7, 8, 9, 10, 11, 12, 13, 16, 17, 19, 21, 23, 24, 25, 26, 27, 28], "\ubb34\uad00\ud55c": [1, 26], "random": [1, 2, 5, 7, 8, 10, 13, 14, 15, 19, 23, 24, 26, 27], "\uc0dd\uc131\ud558\uac8c": [1, 18, 23, 25, 27, 28], "\ub429\ub2c8\ub2e4": [1, 4, 6, 10, 17, 18, 20, 23, 24, 26, 27, 28], "\ucd5c\uc885\uc801\uc73c\ub85c": [1, 17, 18, 27], "num_infer": 1, "\uac12\uc740": [1, 2, 11, 18, 22, 23], "\uac01\uac01": [1, 2, 3, 4, 5, 6, 9, 10, 19, 23, 24, 27, 28], "\uacfc": [1, 2, 3, 6, 7, 8, 11, 15, 16, 19, 21, 23, 25, 27], "\uc124\uc815\ud558\uc600\uc2b5\ub2c8\ub2e4": 1, "increas": [1, 6], "number": [1, 22, 27], "guidance_scal": [1, 27], "\uc81c\uc678\ud574\ubcf8": 1, "\uc0dd\uc131\ub41c": [1, 2, 6, 9, 10, 14, 15, 17, 18, 19, 20, 22, 23, 24, 25, 26, 27, 28], "\ub0a8\uc790\uc758": 1, "\uba38\ub9ac\uce74\ub77d\uc774": 1, "\uae38\uc5b4\uc9c0\uace0": 1, "\uc5ec\uc131\uc2a4\ub7ec\uc6b4": 1, "\uc0dd\uae40\uc0c8\ub97c": [1, 19], "\ub180\ub77c\uc6b4": [1, 6, 14, 18], "\uc0ac\uc2e4\ub3c4": 1, "\uadf8": [1, 2, 3, 6, 8, 10, 12, 14, 15, 17, 18, 19, 20, 27], "\uc678": [1, 14, 20], "\ub530\ub978": [1, 4, 6, 11, 18, 21, 23, 26, 28], "\uc7ac\ubbf8\uc788\ub294": 1, "\uc2e4\ud5d8\uacb0\uacfc\ub4e4\uc744": 1, "\uacf5\uc720\ud569\ub2c8\ub2e4": [1, 23, 27], "\uc544\uc9c1": [1, 6, 19, 22], "\uc190\uc758": 1, "\ubaa8\uc591\uc744": 1, "\uc0dd\uc131\ud558\uc9c0": 1, "\ubabb\ud558\ub294": [1, 10, 24], "\uc7ac\ucc28": 1, "climb": 1, "up": [1, 3, 8], "mountain": 1, "paint": [1, 23, 26], "2": [1, 2, 6, 10, 12, 17, 19, 20, 23, 24, 25], "hand": 1, "draw": [1, 16], "\ud558\ub2e8\uc758": 1, "\uc88c\uce21\uacfc": 1, "\uc6b0\uce21": 1, "\uc0ac\uc9c4\uc740": 1, "\uc774\ub77c\ub294": [1, 2, 22, 25], "\ub098\ube44\ub97c": 1, "\uc0dd\uc131\ud558\ub77c\ub294": 1, "\ucd94\ub860\ud574\ubcf8": 1, "\uc218\uc2dd\ud558\ub294": 1, "\uba85\uc0ac\uac00": 1, "\uc774\ub3c4\ub85d": 1, "\uc218\uc815\ud568\uc73c\ub85c\uc368": [1, 11], "butterfli": 1, "\uc0ac\uc9c4\uc744": [1, 20, 22], "\uc0dd\uc131\ud560\ub54c": 1, "\uc870\uae08\uc774\ub098\ub9c8": 1, "\uc6f9\ud230\uc758": 1, "\uadf8\ub9bc\uccb4\ub97c": 1, "\ubc18\uc601\ud560": 1, "\uc788\uc5c8\ub358": 1, "scale": [2, 3, 5, 6, 9, 13, 17, 22, 25, 27], "autoregress": 3, "multi": [2, 3, 5, 6, 19, 25, 27], "modal": [2, 3, 6, 19, 25], "refer": [2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 14, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28], "paper": [2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29], "http": [2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28], "scontent": 3, "gmp1": 3, "xx": 3, "fbcdn": 3, "net": [2, 3, 8, 12, 23, 25], "v": [2, 3, 5, 6, 8, 10, 12, 13, 16, 19, 21, 23, 24, 27], "t39": 3, "2365": 3, "358725877_789390529544546_1176484804732743296_n": 3, "pdf": [3, 6, 10, 14, 19, 21, 26], "_nc_cat": 3, "108": 3, "ccb": 3, "_nc_sid": 3, "3c67a6": 3, "_nc_ohc": 3, "plfu_ur_vyaax_nagu8": 3, "_nc_ht": 3, "oh": 3, "00_afdrhahxv1pcf0lqicjiynmorpvcgeq0emv5_ve2_tncvg": 3, "oe": 3, "652ff632": 3, "code": [2, 3, 4, 5, 8, 12, 13, 17, 19, 20, 21, 22, 23, 24, 27, 28, 29], "x": [2, 3, 4, 6, 7, 9, 10, 12, 13, 17, 20, 21, 22, 23, 24, 25, 27, 28], "jun": 3, "hyoung": 3, "lee": 3, "oct": [3, 9, 10, 14, 18, 27], "\ubcf5\uc7a1\ud558\uac8c": 3, "\uad6c\uc131\ub41c": [3, 27, 28], "\uac1d\uccb4": [3, 6, 20, 26], "\uc190": 3, "\uc0dd\uc131\ud55c\ub2e4": [3, 15], "\ud14d\uc2a4\ud2b8\uc640": [3, 6, 9, 10, 19], "\ub458": [3, 9], "\ub2e4": 3, "\ub2a5\ub825\uc744": [2, 3, 4, 6, 10, 18, 19], "\uac00\uc9c4": [3, 9, 11, 15, 16, 18, 19, 20, 21, 23, 27, 28], "\uac80\uc0c9": 3, "\uc99d\uac15": 3, "\ud1a0\ud070": 3, "\uae30\ubc18": [2, 3, 10, 14, 15, 18, 19, 21], "\ub514\ucf54\ub354": 3, "\uc804\uc6a9": [3, 25], "\uba40\ud2f0": 3, "\ubaa8\ub2ec": 3, "\ubaa8\ub378\uc774\ub2e4": [3, 9, 15], "cm3": 3, "\uc544\ud0a4\ud14d\ucc98\ub97c": [3, 15], "\uc0ac\uc6a9\ud558\uba70": 3, "\uad6c\uc870\uc801": 3, "\uc2a4\ud0c0\uc77c": [3, 20], "\ub370\uc774\ud130\uc5d0": [3, 15, 17, 18], "tun": 3, "\ud560": [2, 3, 4, 6, 8, 10, 15, 16, 17, 18, 19, 21, 22, 23, 25, 27], "\uc788\ub294": [2, 3, 4, 6, 9, 10, 11, 15, 17, 18, 19, 20, 22, 23, 26, 27], "\uac00\uc84c\ub2e4": 3, "\ubaa8\ub378\uc5d0": [3, 5, 6, 8, 9, 11, 15, 16, 18, 19, 23, 26, 27], "\ub9de\ub3c4\ub85d": [3, 15], "\ud559\uc2b5\ud588\ub2e4": [3, 15], "larg": [3, 5, 18, 27], "scale\uc758": 3, "\ub2e8\uacc4\ub97c": [3, 10, 15, 24], "\ud3ec\ud568\ud55c\ub2e4": [3, 15], "\ub370\uc774\ud130\ub294": 3, "\ub77c\uc774\uc13c\uc2a4\uac00": 3, "shutterstock\uc758": 3, "scale\ub85c": 3, "\ud559\uc2b5\ud55c\ub2e4": [3, 8, 15], "sft": 3, "\ub2e8\uacc4\ub85c": 3, "\uc9c4\ud589\ud588\ub2e4": 3, "\uc785\ub825\uacfc": [3, 20], "\ucd9c\ub825": [3, 20], "\ubaa8\ub450": [3, 5, 6, 7, 10, 11, 14, 15, 16, 17, 18, 19, 20, 22, 23, 26, 27], "\uc774\ubbf8\uc9c0\uc640": [3, 6, 7, 9, 10, 16, 19, 23, 26, 28], "\ud1a0\ud070\uc744": [3, 19], "\uc11e\uc744": 3, "\uc788\ub2e4": [2, 3, 7, 9, 11, 13, 14, 15, 16, 19, 20, 22, 25], "\ud504\ub86c\ud504\ud2b8\uc5d0": 3, "\ub9de\ub294": [3, 23], "\uc774\ubbf8\uc9c0\ub9cc": [3, 22], "\uc0dd\uc131\ud558\ub294\ub370": [3, 27], "cm3leon\uc740": 3, "\uace0\ud574\uc0c1\ub3c4": [3, 15, 18, 20], "output\uc744": [3, 4, 9, 15], "self": [3, 4, 7, 8, 10, 13, 17, 24, 25, 27, 28], "contain": 3, "\uc18c\uac1c\ud55c\ub2e4": [3, 9, 15], "iamg": 3, "\ubd80\ud130": [2, 3, 9, 21], "control": [3, 16, 17, 27], "segmentation\uae4c\uc9c0": 3, "\uac00\ub2a5\ud558\ub2e4": [3, 7, 8, 14, 16, 22], "3\uc5b5": 3, "\uac1c\uc758": [3, 10, 13, 15, 17, 19, 20, 21, 23], "\ud1a0\ud070\uc73c\ub85c": [3, 21], "\ud559\uc2b5\ud588\ub294\ub370": 3, "generation\ub3c4": 3, "\uc218\ud589\ud55c\ub2e4": 3, "\ud559\uc2b5": [2, 3, 4, 5, 6, 7, 8, 9, 11, 12, 13, 14, 15, 16, 18, 20, 21, 22, 23, 24, 25, 27], "\uc5f0\uc0b0\uc744": 3, "5\ubc30\ub85c": 3, "\uc904\uc600\ub2e4": 3, "zero": [3, 6, 9, 13, 18, 21, 25], "shot": [3, 6, 9, 16, 18, 21, 25], "coco\ub85c": [3, 25], "fid\ub97c": [3, 7], "\uce21\uc815\ud55c": 3, "88": 3, "\uc810\uc73c\ub85c": 3, "google\uc758": 3, "parti": 3, "\ubaa8\ub378\uc758": [2, 3, 4, 5, 6, 9, 10, 11, 13, 15, 16, 17, 18, 19, 22, 27], "\uc131\ub2a5\uacfc": [3, 14], "\uc218\uc900\uc744": 3, "\ub2ec\uc131\ud588\ub2e4": 3, "ra": 3, "cm3\ub97c": 3, "\uae30\ubc18\uc73c\ub85c": [3, 6, 10, 12, 14, 16, 20, 23, 28], "t2i": [3, 6, 10, 15], "\ub3c4\uba54\uc778\uc5d0\uc11c": 3, "\uc7a0\uc7ac\ub825\uc744": 3, "\uc5f0\uad6c\ud588\ub2e4": 3, "gafni\uc758": 3, "tokenizer\ub97c": [3, 19], "\uc0ac\uc6a9\ud588\ub2e4": [3, 15, 20], "tokenizer\ub294": 3, "256x256": [3, 9, 15, 18, 26], "8192\uac1c\uc758": 3, "vocabulary\uc5d0\uc11c": 3, "1024\uac1c\uc758": 3, "\uc778\ucf54\ub529\uc744": 3, "\uc9c4\ud589\ud55c\ub2e4": 3, "\ud14d\uc2a4\ud2b8\uc5d0\uc11c\ub294": 3, "zhang\uc758": 3, "\ucee4\uc2a4\ud140": 3, "56320": 3, "vocabulari": 3, "size": [3, 13, 15, 17, 20, 24, 25, 26, 27, 28], "\uc0c8\ub85c\uc6b4": [2, 3, 5, 10, 14, 15, 17, 18, 19, 22, 23, 24, 25, 26, 28], "\uc2a4\ud398\uc15c\ud55c": 3, "\ud1a0\ud070\uc778": 3, "break": 3, "figure_8_9": 3, "modality\uac04": 3, "transition\uc744": 3, "\ud558\uac8c": [3, 6, 15, 22, 24, 26, 27, 28], "\ud55c\ub2e4": [2, 3, 9, 12, 19, 22, 25], "\ubaa9\uc801": 3, "\uc785\ub825": [3, 18, 19, 20, 23, 26, 28], "sequence\uc5d0": 3, "\ub9de\ucdb0": [3, 9, 21], "\uad00\ub828\uc131\uc774": 3, "\ub192\uace0": 3, "\ubb38\uc11c": 3, "from": [3, 6, 8, 17, 24], "memori": [3, 13, 21, 25], "bank": 3, "\uac80\uc0c9\ud558\ub294": 3, "\uac83\uc774\ub2e4": [3, 9, 14], "dens": [3, 13], "strategy\uc744": 3, "\ud3ec\ud568\ud558\uace0": [3, 20], "\ucffc\ub9ac": 3, "q": [2, 3, 9, 12, 21, 22], "\uc608": 3, "sequenc": [3, 6, 13], "mathcal": [3, 4, 8, 9, 12, 13], "m": [3, 6, 7, 8, 12, 16], "\ub85c\ubd80\ud130": [2, 3, 4, 6, 9, 20, 23, 24, 27, 28], "\ud6c4\ubcf4": 3, "\uac00\uc9c0\uace0": [2, 3, 6, 9, 10, 15, 17, 20, 21, 24, 27, 28], "\uad00\ub828\uc131": 3, "\uc810\uc218": [3, 25], "r": [3, 8, 11, 12, 13, 16, 26], "return": [3, 4, 5, 7, 8, 13, 17, 24, 27, 28], "\ud574\uc900\ub2e4": [3, 22], "retriv": 3, "\ubc29\ubc95\uc740": [2, 3, 9, 10, 13, 19, 20, 23], "clip": [3, 5, 6, 10, 15, 19, 21, 23, 25, 26, 27], "\uae30\ubc18\uc778": 3, "bi": 3, "encod": [2, 3, 6, 9, 12, 21, 23, 24, 25, 26, 27, 28], "\uad6c\uc870\ub97c": [3, 4, 12, 17, 20, 21, 23, 25, 27], "\ub530\ub790\ub2e4": 3, "karpukhin": 3, "\ubb38\uc11c\ub97c": 3, "\ud30c\ud2b8\ub85c": 3, "\ubd84\ub9ac\ud558\uace0": 3, "\uc778\ucf54\ub354": 3, "vit": [3, 9, 15, 23], "b": [2, 3, 4, 8, 9, 10, 13, 17, 21, 25], "32": [3, 7, 8, 13, 17, 18, 21, 22, 27], "\ubb38\uc11c\uc758": 3, "vector": [3, 6, 12, 17, 19, 21, 23], "representation\ub85c\uc368": 3, "\ub450": [2, 3, 4, 6, 8, 10, 11, 15, 17, 18, 20, 23], "\uac1c\ub97c": 3, "\ud3c9\uade0\uc744": [2, 3], "\ub0b8\ub2e4": [3, 19], "\ucd5c\uc885": [3, 8, 15, 27], "\uac80\uc0c9\uc740": 3, "\uc810\uc218\uc5d0": [3, 25], "\uc815\ub82c\ub41c": 3, "\ubaa9\ub85d\uc744": 3, "\uc5bb\uae30": 3, "maximum": [2, 3], "inner": [3, 13], "product": [3, 6], "search\ub85c": 3, "generator\ub97c": [3, 17, 24], "\uc720\uc6a9\ud55c": 3, "\ucd94\ucd9c\ud558\uae30": 3, "\uc138": [2, 3, 6, 15, 17, 20, 23], "\uac00\uc9c0": [3, 4, 6, 10, 15, 17, 19, 20, 23], "\uc694\uc18c\ub97c": [3, 10, 18], "\uace0\ub824\ud588\ub2e4": 3, "relev": [3, 7], "\uac80\uc0c9\ub41c": 3, "\ubb38\uc11c\ub294": 3, "\uad00\ub828\uc788\uc5b4\uc57c": 3, "\uc810\uc218\ub97c": [3, 6, 9, 21, 25], "\uc0ac\uc6a9\ud55c\ub2e4": [3, 9, 15], "\ud14d\uc2a4\ud2b8\ub85c": [3, 4], "\ubb38\uc11c\ub85c": 3, "\ub610\ub294": [3, 9, 10, 14, 19, 20], "divers": [2, 3, 6, 10, 11, 22, 23], "\ub2e4\uc591\uc131\uc740": 3, "\ubb38\uc11c\uc5d0\uc11c": 3, "\uc911\ubcf5\uc131\uc744": 3, "\ud53c\ud558\uae30": 3, "\ud544\uc218\uc801\uc778": 3, "\uc808\ucc28\ub2e4": 3, "\ub2e8\uc21c\ud558\uac8c": 3, "\uae30\ubc18\ud574": [3, 13], "top": [3, 6, 15], "\ubb38\uc11c\ub9cc": 3, "\uac00\uc838\uc628\ub2e4\uba74": 3, "\uc911\ubcf5\uc774": 3, "\ubc1c\uc0dd\ud560": 3, "downstream": [3, 13], "\uc548\uc88b\uc740": 3, "\ub07c\uce60": 3, "\uc810\uc218\uac00": [3, 21, 25], "\uc774\ud558\ub85c": 3, "queri": [3, 5, 12, 13, 19], "dropout": 3, "\uac80\uc0c9\uc5d0": 3, "\uc0ac\uc6a9\ub41c": [2, 3, 20], "\ucffc\ub9ac\uc758": 3, "\uc0ad\uc81c": [3, 8], "\uc801\uc6a9\ud588\ub2e4": 3, "\ub2e4\uc591\uc131\uacfc": [3, 18], "\uc815\uaddc\ud654\ub97c": [3, 17], "\uc2dc\ucf30\ub2e4": [3, 15], "\ud14d\uc2a4\ud2b8\ub97c": [3, 6, 17, 18, 19], "\uac80\uc0c9\ud55c\ub2e4": 3, "\ud559\uc2b5\uc5d0\uc11c\ub294": 3, "\ub370\uc774\ud130\uc14b\uc758": [3, 6, 9, 18, 26], "\ubaa8\ub4e0": [2, 3, 5, 7, 10, 13, 14, 15, 16, 17, 19, 20, 22], "\ucea1\uc158": [3, 9], "\uc30d\uc5d0": 3, "\ub300\ud574": [3, 4, 5, 8, 9, 10, 11, 15, 18, 19, 20, 23, 24, 26, 27, 28], "\uc0d8\ud50c": [3, 6, 12, 17, 18], "3\uac1c\ub97c": 3, "\ubb34\uc791\uc704\ub85c": [3, 19], "\uc120\ud0dd\ud55c\ub2e4": 3, "\uc0ac\uc2e4\uc0c1": 3, "\uc0ac\uc804": [3, 6, 19, 23], "\ud559\uc2b5\uc5d0\uc11c": 3, "\uc0ac\uc6a9\ud560": [3, 4, 9, 13, 18], "\uc218\uc758": 3, "4\ubc30\uc774\ub2e4": 3, "chameleon": 3, "\ubcc0\ud615\uc2dc\ucf1c": 3, "mask": [3, 21, 26, 27], "infil": 3, "\ud45c\ud604\ud55c\ub2e4": 3, "\ucd94\uac00\ub418\uc5c8\uace0": 3, "\ub2e8\uc5b4\uc758": 3, "\uc7ac\ubc30\uce58\uac00": 3, "\uc9c4\ud589\ub410\ub2e4": 3, "\ud559\uc2b5\uc5d0\ub294": 3, "\ub2e4\uc74c": [2, 3, 17, 19, 20, 25, 27], "\uc608\uce21\ud558\ub294": [2, 3, 8, 9, 10], "\ub2e4\uc6a9\ub3c4": 3, "\uac00\uc838\uc654\ub2e4": [3, 15], "generation\uc5d0\uc11c\ub294": 3, "cm3\uac00": 3, "\ud504\ub86c\ud504\ud2b8\ub85c": [3, 18], "\uc0dd\uc131\ud558\uace0": [2, 3, 19, 21, 23], "cm3\ub294": 3, "\ud504\ub86c\ud504\ud2b8\ub97c": [3, 10, 15, 18], "\ud65c\uc6a9\ud55c\ub2e4": 3, "\ub514\ucf54\ub354\ub9cc": 3, "\uc0ac\uc6a9\ud558\ub294": [3, 4, 6, 8, 17, 20, 22, 23, 25, 26], "transform": [3, 8, 9, 11, 15, 18], "\uc544\ud0a4\ud14d\uccd0\ub97c": [3, 6], "zhang\uc5d0": 3, "bia": [3, 6, 8], "term": [2, 3, 11, 20, 28], "layer": [3, 4, 7, 8, 13, 17, 20, 24, 25, 27], "norm\uc758": 3, "\uac00\ub2a5\ud55c": [2, 3, 7, 8, 12, 16, 20, 27], "\ud30c\ub77c\ubbf8\ud130\ub97c": [3, 10, 13, 15, 18], "\uc81c\uac70\ud588\ub2e4": [3, 15], "length\ub97c": 3, "2048": 3, "4096\uae4c\uc9c0": 3, "\ud655\uc7a5\ud588\ub2e4": 3, "weight": [3, 5, 7, 10, 13, 18, 26, 27], "\ucd08\uae30\ud654": 3, "\ud3c9\uade0": [2, 3, 8, 23, 28], "\ud45c\uc900": [3, 20], "\ud3b8\ucc28": 3, "006": 3, "\uc778": [2, 3, 6, 12], "truncat": 3, "3\uc73c\ub85c": [3, 25], "\uc798\ub9b0": [3, 15], "normal": [3, 7, 8, 17, 20, 24, 25], "distribut": [2, 3, 6, 7, 8, 21, 22, 27, 28], "output": [2, 3, 4, 6, 8, 13, 20, 21, 23, 26], "0\uc73c\ub85c": [3, 4, 18, 20, 22, 26], "0\uc5d0": 3, "0002\ub85c": [3, 20], "posit": [3, 8, 9, 16], "embed": [3, 4, 5, 6, 8, 9, 13, 16, 21, 22, 23, 27], "\ucd08\uae30\ud654\ud55c\ub2e4": 3, "metaseq": 3, "\ud559\uc2b5\ub410\ub2e4": 3, "\uc0ac\uc774\uc988": 3, "350m": 3, "760m": 3, "7b": 3, "4t": 3, "trillion": 3, "9t": 3, "\uc8fc\uc694\ud55c": [3, 10], "\ud558\uc774\ud37c": 3, "\ud30c\ub77c\ubbf8\ud130\ub294": [3, 27], "learn": [3, 6, 13, 16, 17, 18, 19, 20, 25, 27], "rate": [3, 5, 7, 16, 27], "batch": [3, 5, 13, 20, 23, 24, 27], "size\ub85c": 3, "\uba40\ud2f0\ubaa8\ub2ec": 3, "\ub9de\uac8c": [3, 6, 9, 13, 15, 18], "\uc124\uc815\ud588\ub2e4": [3, 20], "\ucc38\uace0": [2, 3, 6, 18], "perplex": 3, "ppl": [3, 23], "\uc5b8\uc5b4": 3, "\ud3c9\uac00": [2, 3, 5, 14, 20, 23], "\ubc29\ubc95": [2, 3, 6, 22, 25, 29], "\ud558\ub098\uc774\ub2e4": 3, "\ud5f7\uac08\ub9ac\ub294": 3, "\uac12\uc774": [2, 3, 5, 7, 11, 16, 20, 21, 22, 25], "\ub0ae\uc744": [3, 4], "\uc218\ub85d": 3, "\uc88b\ub2e4": [3, 6, 13, 22], "\ubaa8\ub378\uc5d0\uc11c": [3, 15, 18, 19, 22, 25], "\uc54c\uace0\ub9ac\uc998\uc5d0": 3, "\uc0c1\ub2f9\ud55c": 3, "\uc5f0\uad6c\uac00": [3, 9, 10], "\uc9c4\ud589\ub418\uc5b4": 3, "\uc654\ub2e4": [3, 11], "dall": [3, 6, 9, 10, 23, 25, 26], "e\ub294": [3, 21], "\uc544\uc6c3\ud48b\uc758": 3, "\ud5a5\uc0c1\ub418\ub294": [3, 18], "e": [2, 3, 5, 6, 7, 8, 11, 12, 14, 16, 17, 23, 24, 25, 27, 28], "\ub294": [2, 3, 4, 6, 8, 9, 10, 11, 12, 13, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 26, 27, 28], "\uc0d8\ud50c\ub9c1\uacfc": 3, "512\uac1c": [3, 21], "re": 3, "rank": 3, "\uc804\ub7b5\uc744": [3, 10], "\ucc44\ud0dd\ud588\ub2e4": 3, "make": [3, 13], "scene": [3, 6, 26], "\uae30\ubc18\uc758": [3, 10, 26], "guidance\ub85c": 3, "ranking\uc5d0": 3, "\uc624\uc9c1": [3, 20], "16": [2, 3, 8, 12, 13, 15, 17, 21, 22, 23, 27], "\uc0d8\ud50c\ub9cc": 3, "\ud544\uc694\ud558\uac8c": 3, "\ub428\uc73c\ub85c\uc368": [3, 26], "\ud6c4\ubcf4\uc758": 3, "\uc218\ub97c": [3, 15, 21], "\ud655\ub960\uc801": 3, "\uae30\uc220\ub85c": [3, 19], "\uc0ac\uc6a9\ub41c\ub2e4": [3, 8, 15], "\uc0d8\ud50c\ub9c1\uc5d0\uc11c": 3, "softmax\uc758": 3, "temperature\ub97c": 3, "\uc218\uc815\ud574": [3, 6], "\uc608\uce21": [3, 6, 7, 8, 10, 21], "\ubb34\uc791\uc704\uc131\uc744": 3, "\uc81c\uc5b4\ud55c\ub2e4": 3, "nucleu": 3, "\uc0d8\ud50c\ub9c1\uc73c\ub85c\ub3c4": 3, "\ubd88\ub9ac\uace0": 3, "\ubbf8\ub9ac": [3, 19], "\uc815\uc758\ud55c": [3, 11], "\uc784\uacc4\uac12\uc744": [3, 25], "\ucd08\uacfc\ud558\ub294": 3, "\ub204\uc801": 3, "\ud655\ub960\uc744": [3, 6, 24], "\uac00\uc7a5": [2, 3, 5, 6, 9, 10, 17, 18, 19, 20, 23], "\uc791\uc740": [2, 3, 5, 8, 10, 13, 15, 18, 19, 20], "\uc0c1\uc704": 3, "\uc138\ud2b8\uc5d0\uc11c": 3, "\uc0d8\ud50c\ub9c1\uc744": [3, 18, 24], "begin": [3, 8, 27], "align": [3, 5, 8, 18, 19, 25, 26, 27], "operatornam": 3, "logit": 3, "_": [3, 4, 10, 12, 13, 17, 23, 24, 27, 28], "cond": [3, 15], "t": [2, 3, 4, 5, 6, 7, 9, 11, 12, 21, 22, 23, 27], "left": [3, 8, 10, 12, 13, 15, 22, 25, 28], "t_y": 3, "mid": [3, 8, 13, 14], "t_x": 3, "right": [3, 8, 10, 12, 13, 22, 25, 28], "uncond": 3, "bf": [3, 22], "mathrm": [3, 8], "cf": 3, "alpha_c": 3, "cdot": [3, 12, 24, 27], "end": [2, 3, 8, 17], "cfg\ub294": 3, "uncondit": [3, 8, 9, 22], "\uc0d8\ud50c\uc744": [2, 3, 9, 10, 19, 20, 28], "condit": [2, 3, 5, 6, 8, 11, 14, 18, 22, 23, 26, 27], "\uc0d8\ud50c\uc5d0": [3, 15, 18], "\ud558\ub294": [2, 3, 4, 8, 9, 10, 11, 14, 15, 16, 17, 18, 19, 20, 22, 23, 25, 27, 28], "\uac83\uc744": [3, 4, 6, 8, 10, 11, 14, 15, 17, 18, 19, 20, 21, 22, 23, 25, 26, 27], "\uc758\ubbf8\ud55c\ub2e4": [3, 14], "text\ub97c": [3, 9, 21], "\ubaa9\ud45c\uc758": 3, "\ub9c8\uc2a4\ud06c": [3, 9], "\ub300\uccb4\ud55c\ub2e4": 3, "\ubaa9\ud45c\ub97c": 3, "\ud559\uc2b5\uc758": 3, "\ud575\uc2ec": [3, 10, 14, 18, 22], "\uc774\uc810": 3, "\ud558\ub098\uc774\uba70": 3, "finetun": [3, 5], "\uc5c6\uc774": [3, 5, 7, 8, 9, 10, 15, 16, 18, 19, 20, 21, 22, 25, 27], "\uc5c6\ub294": [3, 15, 16, 20, 22, 23, 25], "guidance\ub97c": [3, 9, 22, 25], "\uc218\ud589\ud560": [3, 6], "\ucd94\ub860\uc5d0\uc11c\ub294": 3, "stream\uc744": 3, "\ud14d\uc2a4\ud2b8\uc5d0": [3, 6], "\ub2ec\ub77c\uc9c0\ub294": [3, 23], "stream\uacfc": 3, "\ud1a0\ud070\uc5d0": 3, "condition\ub41c": 3, "stream": 3, "cfg\uc5d0\uc11c": 3, "logit\uc758": 3, "\ube84\uc148": 3, "\uc5f0\uc0b0\uc774": [3, 12], "\ud14d\uc2a4\ud2b8\uc5d0\uc11c": [3, 10], "\ubc29\ubc95\uc758": [3, 19], "log": [3, 8, 13, 18, 21, 22, 24, 28], "probability\ub97c": 3, "\ube84\uc148\ud558\ub294": 3, "\uc5f0\uc0b0\uacfc": 3, "\ube44\uc2b7\ud558\ub2e4": [3, 15], "ms": [3, 9, 21, 25], "coco": [3, 9, 21, 25, 27], "30k": 3, "fid": [2, 3, 6, 8, 9, 11, 12, 15, 17, 21, 22, 25, 27], "\uce21\uc815\ud588\ub2e4": 3, "onli": [3, 7, 15, 16, 21], "\ud6a8\uc728\uc131\uc774": 3, "\ucd94\ub860\uc5d0\uc11c": 3, "1\uac1c": [3, 21], "2\uac1c\ub85c": 3, "\uc608\uc81c\ub85c": 3, "\ub3d9\uc791\ud560": [3, 18], "\uc6b0\uc218\ud55c": [3, 7, 10, 15, 20, 21], "\uae30\ub85d\ud588\ub2e4": [3, 16], "\uace0\ud488\uc9c8": [3, 6, 10, 15], "\ud655\uc7a5\uc2dc\ud0a4\ub294": 3, "\uac80\uc0c9\uc758": 3, "\uc911\uc694\uc131\uc744": [3, 10, 20], "\ubcf4\uc5ec\uc900\ub2e4": [3, 7, 9, 15, 16, 19, 25], "figure5": 3, "llm\uc5d0\uc11c": 3, "\uc911\uc694\ud55c": [3, 6, 7, 11, 17, 19, 20], "\ub2e8\uacc4\uc774\ub2e4": 3, "\uba85\ub839\uc5b4": 3, "\uc774\ud574\ud558\ub294": 3, "\ub3c4\uc640\uc8fc\uba70": 3, "task\uc5d0\uc11c\ub3c4": 3, "\uc5bb\uc5c8\ub2e4": [3, 15], "\ud29c\ub2dd\uc774": 3, "task\uc5d0": [3, 4, 12, 13, 19, 20], "\ub208\uc5d0": 3, "\ub744\uac8c": 3, "\uc99d\ud3ed\uc2dc\ud0a4\ub294": 3, "\ubc1c\uacac\ud588\ub2e4": 3, "cm3leon\uc744": 3, "task\ub97c": [3, 13, 14, 19, 21], "\uc11e\uc5b4": 3, "\ub113\uc740": 3, "\ubc94\uc704\uc5d0\uc11c": 3, "\ud588\ub2e4": [3, 15, 20], "\uacfc\uc815\uc740": 3, "\ub530\ub974\uba70": 3, "instruction\uacfc": 3, "\ucd9c\ub825\uc744": 3, "\uacb0\ud569\ud574": 3, "objective\ub97c": [3, 13, 19], "figure6": 3, "\uae30\ubc18\ud55c": [2, 3, 7], "initi": [3, 13], "image\ub97c": [3, 5, 8, 9, 10, 12, 14, 18, 19, 20, 21], "\uc218\uc815\ud558\ub294": [3, 19], "task\uc774\ub2e4": 3, "instructpix2pix": 3, "\ud558\ub298\uc758": 3, "\uc0c9\uc744": 3, "\ud30c\ub780\uc0c9\uc73c\ub85c": 3, "\ubcc0\uacbd\ud574\uc918": 3, "\ud3b8\uc9d1\uc774": 3, "\uc774\uac83\uc740": [3, 6, 10, 18], "cm3leon\uc774": 3, "\ub3d9\uc2dc\uc5d0": [3, 5, 15, 16], "\uc774\ud574\ud558\uace0": 3, "\uc788\uc5b4\uc11c": 3, "feature\uacfc": [3, 15], "\uc0dd\uc0b0\ud558\ub294": 3, "controlnet": [3, 27], "\uc0dd\uc131\uc5d0": [3, 9, 10, 18, 19, 27], "\uacf5\uac04\uc801": 3, "\uc815\ubcf4": [3, 16], "\uc704\uce58": 3, "\ud1b5\ud569\uc2dc\ud0ac": [3, 6], "\uc788\ub3c4\ub85d": [3, 10, 15, 19, 20, 26], "figure16": 3, "flamingo": 3, "1000\uc5b5": 3, "openflamingo": 3, "400\uc5b5": 3, "30\uc5b5": 3, "\uc740": [2, 3, 5, 6, 8, 15, 17, 19, 20, 22, 23, 24, 25, 27, 28], "\ud1a0\ud070\uc784\uc5d0\ub3c4": 3, "\ubd88\uad6c\ud558\uace0": [3, 6, 9, 18, 20], "\ub3d9\ub4f1\ud55c": 3, "ad": [4, 27], "arxiv": [2, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28], "org": [2, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28], "ab": [2, 4, 5, 7, 8, 9, 11, 12, 13, 14, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 27, 28], "2302": [4, 27], "05543": 4, "lllyasviel": 4, "mai": [4, 12, 19, 22, 23], "28": [4, 28], "\uae30\uc874\uc758": [2, 4, 6, 14, 18, 27], "\ubaa8\ub378\ub4e4\uc740": [4, 5, 6, 22], "prompt\ub85c": [4, 16, 19], "\uc870\uc808\ud560": [4, 17, 18], "\ud558\uc9c0\ub9cc": [2, 4, 5, 6, 7, 9, 10, 11, 13, 14, 16, 18, 19, 20, 22, 23, 24, 28], "\uc774\ub7f0": [4, 6, 17, 18], "control\ub9cc\uc73c\ub85c": 4, "\uc870\uc808\ud558\ub294\ub370": 4, "\ud55c\uacc4\uac00": [4, 16, 19, 20, 25, 27], "condition\uc744": [4, 5, 15], "\ucd94\uac00\uc801\uc73c\ub85c": [4, 6, 9, 15, 18], "\uc918\uc11c": 4, "\uc0dd\uc131\ub418\ub294": [4, 14, 16, 17, 18, 22, 27], "controlnet\uc774\ub77c\ub294": 4, "\uc2e0\uacbd\ub9dd": [2, 4, 10], "\uc81c\uc548\ud569\ub2c8\ub2e4": [4, 6, 10], "\uadf8\ub9bc\uc740": [4, 6, 12, 15, 17, 18], "high": [4, 6, 10, 11, 12, 14, 15, 17, 18, 21, 24, 25, 27], "qualiti": [4, 7, 16, 21, 22, 24, 25, 28], "detail": [4, 6, 10, 16], "profession": 4, "prompt\uc640": [4, 5, 9], "\uc67c\ucabd": [4, 10, 15, 18], "\uc544\ub798\uc758": [2, 4, 6, 12, 15, 25], "canni": 4, "edge\ub97c": 4, "input\uc73c\ub85c": [4, 12, 15, 17], "\ubc1b\uc544\uc11c": [4, 6, 10, 17, 28], "\uc624\ub978\ucabd\uc758": 4, "\uc2dd\uc73c\ub85c": 4, "\ucd94\uac00\uc801\uc778": [4, 5, 8, 9, 12, 13, 14, 17, 19, 20, 27], "\uadf8\ub9bc\uc5d0\uc11c\ub294": 4, "edg": [2, 4, 20, 27], "\ubc1b\uc544": [4, 8, 18, 22], "\uac83\uc774": [2, 4, 6, 8, 9, 10, 13, 17, 18, 19, 20, 22, 23, 25, 26, 27, 28], "controlnet\uc774": 4, "\uc5ed\ud560\uc785\ub2c8\ub2e4": 4, "gener": [2, 4, 6, 9, 10, 11, 16, 17, 19, 21, 22, 23, 24, 25, 27], "conrolnet": 4, "\uadf8\ub7ec\uba74": [4, 17], "\uc5b4\ub5a4": [4, 6, 8, 9, 13, 15, 18, 19, 20, 28], "\uac00\ub2a5\ud558\uac8c": [4, 13, 17], "\ud588\uc744\uae4c\uc694": [4, 6], "\uc774\uc81c\ubd80\ud130": 4, "\uc54c\uc544\ubcf4\ub3c4\ub85d": [4, 17], "\ud558\uaca0\uc2b5\ub2c8\ub2e4": [4, 10, 17, 28], "controlnet\uc758": 4, "\uad6c\uc870\ub294": [4, 17, 18, 27], "\ub2e4\uc74c\uacfc": [2, 4, 6, 9, 10, 12, 17, 19, 20, 23, 24, 27, 28], "\uac00\uc9d1\ub2c8\ub2e4": [4, 12], "pretrain": [2, 4, 5, 9, 13, 14, 15, 16, 18, 19, 21], "lock": 4, "copy\uc640": 4, "trainabl": [4, 7, 10, 11, 13], "copy\ub97c": 4, "\uc0ac\uc6a9": [2, 4, 5, 8, 9, 11, 16, 20, 21, 22, 25], "\uc65c": [2, 4, 6, 11], "\uc774\ub807\uac8c": [2, 4, 10, 18, 19, 24], "\uc124\uacc4\ud588\ub294\uc9c0": 4, "\uc54c\uc544\ubd05\uc2dc\ub2e4": 4, "\uc6b0\uc120": [4, 6, 9, 15, 23], "\uc774\uc720\ub294": [4, 8, 15], "\uae30\uc874\uc5d0": [4, 5, 12, 13, 14, 17, 18], "\ubc29\ub300\ud55c": 4, "\uc591\uc758": [2, 4], "\ud559\uc2b5\uc2dc\ud0a8": [4, 9], "\uc720\uc9c0\ud558\uae30": 4, "\uc704\ud574\uc11c\uc785\ub2c8\ub2e4": 4, "\ub370\uc774\ud130\uac00": [4, 8, 15, 17, 18, 19, 20, 24, 28], "\uc591\uc774": [4, 18], "\uacbd\uc6b0\uc5d0": [4, 18, 20, 27], "\uc624\ubc84\ud53c\ud305\uc744": 4, "\ud53c\ud560": 4, "\ud6a8\uacfc\ub3c4": 4, "convolution\uc774\ub780": 4, "weight\ub791": 4, "bias\uac00": [4, 17], "\ucd08\uae30\ud654\ud55c": 4, "1x1": 4, "convolution\uc744": 4, "\ub9d0\ud569\ub2c8\ub2e4": [4, 20], "\ud6c8\ub828\uc774": 4, "\uc2dc\uc791\ub418\uae30": 4, "\uc804\uc5d0\ub294": 4, "input\uc5d0": [4, 21], "model\uacfc": [4, 9, 11, 12], "output\uc774": [4, 20], "\ub611\uac19\uc544\uc9d1\ub2c8\ub2e4": 4, "\ubaa8\ub378\uc774\ub791": 4, "\ub611\uac19\uc740": 4, "\uac00\uc9c0\uac8c\ub418\ubbc0\ub85c": 4, "\uc720\uc9c0\ud560": [4, 10, 14, 19], "\uc788\uc73c\uba70": [4, 10, 18, 20, 21, 25], "\uac83\uacfc": [2, 4, 9, 19, 20, 26], "\ube44\uc2b7\ud558\ubbc0\ub85c": 4, "scratch\ubd80\ud130": 4, "\ud559\uc2b5\ud558\ub294": [4, 8, 11, 14, 17, 20, 24], "\uac83\uc5d0": [4, 19, 20], "\ube60\ub974\uac8c": [4, 10, 11, 22, 25], "\ud6c8\ub828\uc2dc\ud0ac": 4, "\uc788\uac8c\ub429\ub2c8\ub2e4": 4, "convolution\uc740": 4, "\uc5b4\ub5bb\uac8c": [4, 6, 8, 18, 19], "\ud558\ub294\uc9c0": 4, "\uc790\uc138\ud788": [4, 6, 10, 17, 23], "\uba3c\uc800": [4, 10, 17, 18, 21, 24], "\uc704\uc758": [2, 4, 6, 7, 9, 11, 18], "\uadf8\ub9bc\uc5d0\uc11c": [4, 6, 17, 18, 20, 23], "\ud574\ub2f9\ud558\ub294": [4, 9, 26], "\ubd80\ubd84\uc744": [4, 9, 15, 18, 23, 26, 27], "\uc218\uc2dd\uc73c\ub85c": 4, "\ud45c\ud604\ud558\uaca0\uc2b5\ub2c8\ub2e4": 4, "mathbf": [4, 17], "y": [2, 4, 6, 7, 8, 13, 17, 20, 21, 22, 27], "f": [4, 7, 8, 9, 12, 13, 20, 23, 24], "theta": [2, 4, 8, 9, 10, 12, 13, 21, 22, 23, 24, 27, 28], "featur": [2, 4, 5, 6, 8, 15, 16, 17, 23, 27], "map": [2, 4, 6, 8, 16, 20, 27, 28], "neural": [4, 7, 11, 20], "network": [4, 7, 8, 10, 14, 20, 24], "paramet": [2, 4, 7, 8, 10, 13, 16, 21, 23, 25, 27], "\uc758\ubbf8\ud569\ub2c8\ub2e4": [4, 10, 18], "\uc704": [2, 4, 6, 8, 10, 11, 13, 16, 18, 19, 20], "\uadf8\ub9bc\uc758": [4, 6], "\ud45c\ud604\ud558\uae30\uc704\ud574": 4, "\ub9cc\ub4e4\uc5b4\uc11c": [4, 6, 19], "parameter\ub97c": [4, 5, 8, 18, 21, 22], "theta_": 4, "c": [4, 5, 8, 9, 10, 12, 16, 17, 23, 27], "\ub77c\uace0\ud558\uace0": 4, "\uace0\uc815\uc2dc\ucf1c\ub450\uaca0\uc2b5\ub2c8\ub2e4": 4, "z": [2, 4, 8, 12, 13, 16, 21, 22, 24, 28], "\ud45c\ud604\ud558\uace0": 4, "convolution\uc758": 4, "z1": 4, "z2": 4, "\ub450\uaca0\uc2b5\ub2c8\ub2e4": 4, "\ud45c\ud604\ud560": [2, 4, 10], "\uadf8\ub7f0\ub370": [4, 17], "weight\uc640": [4, 18], "bias\uc758": 4, "\ucd08\uae43\uac12\uc774": 4, "0\uc774\ubbc0\ub85c": 4, "\uc9c4\ud589\ub418\uc9c0": 4, "\uc54a\uc558\uc744": [4, 18], "\uc785\ub2c8\ub2e4": [4, 6, 10, 18, 20, 24, 28], "\uc2dc\uc791": [2, 4], "controlnet\uacfc": 4, "\ub0b4\ubbc0\ub85c": 4, "\ubcf4\uc874\ud560": 4, "\uc804\ubd80": 4, "\ucd08\uae30\ud654\ub418\uc5b4\uc788\uc73c\uba74": 4, "gradient\uac00": [4, 20], "0\uc774\ub77c\uc11c": 4, "\uc548": [4, 6, 20], "\ub418\ub294\uac70": 4, "\uc544\ub2d0\uae4c\uc694": 4, "\ud655\uc778\ud558\uae30": [4, 9], "\uac04\ub2e8\ud55c": [2, 4, 15, 19], "\uacbd\uc6b0\ub97c": [4, 10], "\uc0dd\uac01\ud574\ubcf4\uc8e0": 4, "wx": 4, "gradient\ub294": 4, "frac": [4, 8, 12, 17, 22, 24, 28], "partial": [4, 5, 8], "0\uc774\uace0": [4, 18], "neq0": 4, "\uc774\ub77c\uace0": [2, 4, 6, 19], "\ud558\uba74": [2, 4, 6, 18, 20], "\uccab": [2, 4, 15, 17, 20], "\ubc88\uc9f8": [2, 4, 15, 17, 18], "gradient": [4, 7, 8, 11, 13, 22, 24], "step\uc5d0\uc11c": [4, 8, 11], "weight\ub294": [4, 13], "0\uc774": [4, 7, 8], "\uac12\uc73c\ub85c": [4, 9, 11, 13, 28], "\uac00\uac8c\ub418\uace0": 4, "\ub418\ubbc0\ub85c": [2, 4], "\uc5ec\uae30\uc11c": [2, 4, 8, 10, 12, 17, 18, 19, 20, 28], "\ud575\uc2ec\uc801\uc778": [4, 9], "\uac00\uc815\uc774": 4, "\uc778\ub370": 4, "\ubd80\ubd84\uc740": [4, 9, 10, 12, 17, 18, 27], "\ud6c8\ub828\ub41c": [4, 10, 18, 19], "\uc0ac\uc6a9\ud558\uace0": [4, 5, 10, 13, 16, 18, 19, 23, 26], "\uc788\uae30": [4, 20], "\ub54c\ubb38\uc5d0": [4, 6, 8, 10, 13, 17, 18, 20, 23, 27, 28], "\uc704\ubc30\ub420": 4, "\uac00\ub2a5\uc131\uc774": [4, 11], "\uc9c0\uae08\uae4c\uc9c0": [4, 8], "\uc598\uae30\ud55c": 4, "diffusion\uc5d0": 4, "\uc801\uc6a9\ud55c": [4, 9, 11, 14, 15, 27], "\uadf8\ub9bc\uacfc": [2, 4, 17, 20, 21, 24, 25, 28], "overal": [4, 6, 25], "structur": [4, 12, 13, 17, 27], "loss\ub294": [4, 8], "diffusion\uc5d0\uc11c": 4, "\ucd94\uac00\ub41c": [4, 8], "\ud615\ud0dc\uc785\ub2c8\ub2e4": [4, 17], "loss": [2, 4, 7, 10, 11, 12, 14, 16, 21, 22, 23, 24, 28], "training\uc744": [4, 9], "50": [4, 8, 18, 19, 20], "\ud655\ub960\ub85c": 4, "empti": [4, 9], "string\uc73c\ub85c": 4, "\ubc14\uafd4\uc8fc\uc5c8\ub2e4\uace0": 4, "prompt\uac00": [4, 5], "\uc8fc\uc5b4\uc9c0\uc9c0\uc54a\uc744": 4, "semantics\ub97c": 4, "\ubc30\uc6b0\ub294": 4, "\uacbd\ud5a5\uc774": [4, 5, 6, 16], "\uc0dd\uc131\uc744": [4, 10, 19, 25], "\ud5a5\uc0c1\uc2dc\ucf1c\uc904": 4, "\uc788\ub2e4\uace0": [4, 6, 8, 9, 15, 18, 19, 21, 27, 28], "\uacb0\uacfc\ub294": [4, 9, 11, 18, 20], "training\uc774": 4, "\ubc29\ubc95\ubcf4\ub2e4": 4, "\ud6a8\uc728\uc801\uc774\ub77c\ub294": 4, "\ubcf4\uc5ec\uc90d\ub2c8\ub2e4": [4, 20, 23, 27], "effici": [4, 6, 7, 13, 25], "\uacb0\uacfc\ub4e4\uc740": 4, "\uacb0\uacfc\ub4e4\uc785\ub2c8\ub2e4": 4, "\ub17c\ubb38\uc5d0": [4, 8, 18, 28], "\uc788\uc73c\ub2c8": [2, 4], "\ucc38\uace0\ud558\uc2dc\uae30": 4, "\ubc14\ub78d\ub2c8\ub2e4": 4, "pose": [4, 16, 17, 23, 27], "limitation\uc774\ub77c\uace0": 4, "\uc774\ubbf8\uc9c0\uc785\ub2c8\ub2e4": [4, 6], "\uc8fc\uc5c8\uc74c\uc5d0\ub3c4": 4, "\uc6d0\ud558\ub294": [4, 5, 6, 14, 16, 17, 18, 20], "\uc0dd\uc131\ub418\uc9c0": 4, "\uc54a\ub294": [4, 6, 10, 18, 20, 23], "\ubc1c\uc0dd\ud588\uc2b5\ub2c8\ub2e4": 4, "limit": 4, "\ucf54\ub4dc\ub294": 4, "\uacf5\uc2dd": 4, "\uad6c\ud604": [4, 24, 28], "\uac00\uc838\uc654\uc2b5\ub2c8\ub2e4": 4, "\ucd08\uae30\ud654\ud558\ub294": 4, "\ucf54\ub4dc\ub85c": [4, 15], "\ub9cc\ub4e4": [2, 4, 6, 19, 21], "\uc0ac\uc6a9\ub429\ub2c8\ub2e4": [4, 10], "def": [4, 7, 8, 13, 17, 24, 27, 28], "zero_modul": 4, "modul": [4, 8, 13, 17, 24, 27, 28], "out": [4, 8, 27, 28], "p": [2, 4, 6, 11, 22, 23, 28], "detach": [4, 24], "zero_": 4, "\uae30\ubcf8\uc801\uc73c\ub85c": [4, 13, 16, 25, 27], "nn": [4, 8, 13, 17, 24, 27, 28], "sequential\uacfc": 4, "\uac19\uc740\ub370": 4, "time": [2, 4, 5, 7, 8, 9, 11, 12, 13, 15, 16, 21, 22, 25, 27], "step\uac19\uc740": 4, "input\uc744": 4, "\ubc1b\uc544\uc904": 4, "\uc788\uac8c": [4, 6, 15, 18, 25], "\ub9cc\ub4e0": [4, 17, 18, 20], "timestepembedsequenti": 4, "sequenti": [4, 8, 17, 24, 27], "timestepblock": 4, "pass": [4, 6], "timestep": [4, 5, 6, 23, 27], "children": 4, "support": 4, "an": [4, 6, 17, 19, 28], "extra": [4, 6], "forward": [2, 4, 9, 11, 13, 14, 17, 20, 23, 24, 27, 28], "emb": [4, 8], "context": [4, 6, 8, 13, 15, 16, 21, 23], "none": [2, 4, 8, 17, 27], "isinst": 4, "elif": [4, 8, 23], "spatialtransform": 4, "els": [4, 7, 8, 13, 17, 23, 27], "github\uc758": 4, "cldm": 4, "py\uc5d0": 4, "class\uc785\ub2c8\ub2e4": 4, "init": [4, 13], "\uae38\uc5b4\uc11c": 4, "\uc0dd\ub7b5\ud588\uc2b5\ub2c8\ub2e4": 4, "__init__": [4, 7, 8, 17, 24, 27, 28], "make_zero_conv": 4, "channel": [4, 8, 17, 22, 26, 27], "conv_nd": 4, "dim": [4, 8, 17, 23, 27], "pad": [4, 8, 27], "hint": [4, 5], "kwarg": 4, "t_emb": 4, "timestep_embed": 4, "model_channel": 4, "repeat_onli": 4, "fals": [4, 7, 8, 13, 24, 27], "time_emb": 4, "guided_hint": 4, "input_hint_block": 4, "h": [4, 8, 12, 13, 21, 22, 27], "type": [4, 23, 24], "dtype": [4, 8, 23], "zero_conv": 4, "zip": [4, 7, 8], "input_block": 4, "append": [4, 8, 17, 24, 27], "middle_block": 4, "middle_block_out": 4, "customizi": 5, "To": [5, 6, 7], "cvpr": [2, 5, 11, 12, 17, 23, 26], "2212": [5, 26], "04488": 5, "offici": [5, 7, 21, 22], "seunghwan": [5, 7, 11, 14, 16], "ji": [5, 7, 11, 14, 16], "aug": [5, 11, 16], "\ub6f0\uc5b4\ub09c": [5, 7, 11, 18, 21], "\ubcf4\uc774\ub294": [5, 7, 11, 14, 20], "\ucd94\uc138": 5, "user\uc758": 5, "private\ud55c": 5, "concept\uc744": [5, 19], "\uc0dd\uc131\ud558\uace0\uc790\ud558\ub294": 5, "\uc695\uad6c\ub294": 5, "\ud480\uc9c0": 5, "\ubabb\ud568": 5, "diffusion\uc740": 5, "partial\ud55c": 5, "\ubd80\ubd84\ub9cc\uc744": 5, "\ud559\uc2b5\uc2dc\ud0b4\uc73c\ub85c\uc368": 5, "\uae30\uc874\ubcf4\ub2e4": 5, "\ube60\ub978": [5, 10], "\ubc29\uc2dd\uc744": [5, 9, 14, 27], "\uc81c\uc548": [5, 7, 11, 18, 19, 20, 21], "\ubfd0": 5, "concept\uc5d0": [5, 19], "\ud559\uc2b5\uc774": [2, 5, 8, 16, 20, 27], "\uac00\ub2a5": [2, 5, 8, 9, 22, 27], "\ud558\ub098\uc758": [2, 5, 8, 13, 16, 17, 19, 20, 28], "compress\ud558\ub294": 5, "\ucd5c\uadfc": [5, 11, 14, 16, 20], "\ubaa8\ub378\ub4e4\uc774": [5, 6, 17, 18, 23, 27], "\ud65c\ubc1c\ud558\uac8c": 5, "\uc5f0\uad6c": [5, 11, 14], "\ub418\uc5b4\uc9d0": 5, "\uc785\ub825\ub9cc\uc73c\ub85c": 5, "\uc0dd\uc131\ud574\ub0b4\ub294": [2, 5, 6, 11, 14], "\uc218\uc900\uae4c\uc9c0": [5, 11], "\uc774\ub984": 5, "\uc774\ub7ec\ud55c": [2, 5, 6, 10, 11, 15, 16, 18, 19, 20, 23, 28], "general\ud55c": 5, "\uc0dd\uc131\ud558\uc9c0\ub9cc": [5, 10, 24], "user\uac00": 5, "specif": [5, 17, 23], "concept\uc758": [5, 19], "g": [5, 7, 9, 11, 16, 17, 20, 23, 24, 27], "\ud589\ubcf5\ud55c": 5, "\uc6b0\ub9ac": [5, 19, 20], "\uac00\uc871": 5, "\uc6b0\ub9ac\uc9d1": 5, "\uac15\uc544\uc9c0": 5, "\ubf40\uc090\uac00": 5, "\ud30c\ub9ac\ub85c": 5, "\uc5ec\ud589\uc744": 5, "\ub5a0\ub098\ub294": 5, "\ub4f1": [5, 7, 9, 10, 13, 16, 19, 23], "\uacfc\uc815\uc911\uc5d0": 5, "\ub370\uc774\ud130\ub97c": [5, 6, 8, 9, 11, 15, 16, 18, 20, 24, 28], "\ubcf4\uc9c0": [5, 20, 23], "\ubabb\ud588\uae30\ub54c\ubb38\uc5d0": 5, "model\uc5d0\uac8c\ub294": 5, "\ub2f9\uc5f0\ud55c": 5, "\uba87\uc7a5\uc758": 5, "\ud3ec\ud568\ud558\ub294": [5, 19, 20, 26], "\uc774\ubbf8\uc9c0\ub9cc\uc73c\ub85c": [5, 16], "finetuning\ud558\ub294": [5, 10], "\ubc29\uc2dd": 5, "In": 5, "person": [5, 6, 10, 19], "\ubaa9\ud45c": [5, 6, 19, 20], "\ud559\uc2b5\ud558\uace0\uc790\ud558\ub294": 5, "\uc0dd\uc131\ud574\ub0b4\uc57c\ud568": 5, "\ud559\uc2b5\ub418\uc5c8\ub358": 5, "finetuning\ud55c": 5, "\ud6c4\uc5d0\ub3c4": [5, 10], "customization\uc774": 5, "\uc5b4\ub824\uc6b4": [5, 19], "\uc774\uc720": [2, 5, 11, 20], "\uc9c4\ud589\ud558\ub2e4\ubcf4\uba74": 5, "\ud559\uc2b5\ud588\ub358": 5, "\uc78a\uc5b4\ubc84\ub9ac\uac70\ub098": 5, "\uc65c\uace1\ud574\ubc84\ub9bc": 5, "draft": 5, "overfit": [5, 23], "\ub418\uc5b4\uc11c": 5, "\uacb0\uacfc\ubb3c\uc758": [5, 15], "variation\uc774": [5, 17], "\ub0ae\uc544\uc9d0": 5, "\uc880\ub354": [5, 7, 11, 16], "\ub098\uc544\uac00": 5, "\uc5b4\ub824\uc6c0": [2, 5], "text\ub85c": 5, "\uacfc\uc815": [5, 8, 11, 19], "\uc131\ub2a5": [2, 5, 6, 13, 18, 20, 21, 22, 23, 24, 27], "\uc720\uc9c0\ub97c": 5, "real": [5, 14, 16, 24], "image\uc640": [5, 9, 20, 21], "caption\uc744": 5, "regular": [5, 8, 28], "data\ub85c": 5, "tuning\ub3d9\uc548": 5, "augment": [5, 16, 18], "\uc18c\uac1c": [2, 5, 6], "gan": [2, 5, 7, 9, 14, 16, 20, 21, 27], "\ubc29\uc2dd\uc758": [5, 7, 14, 15, 22], "model\ub4e4\uc774": [5, 7], "\ubcf4\uc5ec\uc8fc\uace0\uc788\uc74c": 5, "\uac8c\ub2e4\uac00": [5, 6, 9, 15], "control\ub3c4": 5, "\uac00\ub2a5\ud568": [5, 13, 19, 21, 25], "general\ud558\uc9c0": 5, "\uc54a\uc740": [5, 6, 10, 11, 15, 17, 18, 20], "\uc0dd\uc131\uc740": 5, "\ubd88\uac00\ub2a5\ud568": 5, "new": [5, 7, 10, 17, 19], "global\ud55c": 5, "distribution\uc744": [5, 16, 17, 21], "\uc774\ubbf8": [5, 20, 25], "\ud3ec\ud568\ud55c": [5, 14], "\uc18c\ub7c9\uc758": [5, 8], "\uae30\ubc95": [5, 13], "learning\uc740": 5, "\uc0dd\uac01\ubcf4\ub2e4": 5, "\ud6a8\uacfc\uc801\uc774\uace0": 5, "\uc720\uc6a9\ud568": 5, "\ub300\ubd80\ubd84": [5, 11, 16, 18], "\uc2dc\uc5d0\ub294": [5, 8, 11, 21, 27], "\uc804\uccb4\ub97c": [5, 26], "\ud559\uc2b5\ud558\uac70\ub098": 5, "\ucd94\uac00\ud574": [5, 8, 14, 15], "\uc7ac\ud559\uc2b5": [5, 7, 22], "\uc704\uc5d0\uc11c": [5, 9, 10, 14, 15], "customization\uc758": 5, "\ubb38\uc81c\ub97c": [5, 13, 15, 19, 25], "\uc77c\uc73c\ud0a4\uae30": 5, "\uc26c\uc6c0": 5, "etc": [5, 14], "\uc544\uc8fc": 5, "\uc77c\ubd80\ub9cc\uc744": 5, "\ub300\uc0c1\uc73c\ub85c": [5, 20], "\ucee8\uc149\uc73c\ub85c": 5, "finetuning\uc744": [5, 10], "\ud1b5\ud55c": [5, 14, 19, 22], "\uc5f0\uad6c\ub4e4\uc774": [5, 11, 16], "\uc788\uc74c": [2, 5, 8, 11, 13, 18, 19, 20, 21, 22, 25], "textual": [5, 10, 23], "invers": [5, 6, 14, 23], "vs": [5, 7, 9, 15, 20, 21, 22, 25, 26, 27], "\ubaa8\ub378\ub4e4\uc744": [5, 26], "compress\ud560": 5, "finetuning\ud568\uc73c\ub85c\uc368": 5, "resourse\ub97c": 5, "\uc808\uc57d\ud560": 5, "backbone\uc73c\ub85c": 5, "latent": [2, 5, 6, 10, 14, 15, 16, 17, 23, 26, 27, 28], "\ucc44\ud0dd": [5, 22], "l": [2, 5, 9, 10, 12, 27, 28], "dm\uc758": 5, "equat": [5, 6, 7, 9, 11, 14, 16, 22], "x_": [2, 5, 7, 8, 9, 16, 22, 23], "\uc2dc\uc810\uc5d0": [2, 5], "noise\uac00": [5, 8, 9, 11, 17], "\uc11e\uc778": 5, "text\ub098": 5, "\ubc14\ub85c": [5, 6, 8, 15, 17], "\uc0ac\uc6a9\ud558\uc9c0\uc54a\uace0": 5, "space\ub85c": [5, 14, 19], "embedding\ub41c": 5, "\uac12\uc744": [2, 5, 7, 11, 14, 16, 21, 22, 25], "us": [5, 6, 7, 19, 20, 21, 25, 29], "\u03b5": [5, 7], "nois": [2, 5, 7, 8, 10, 11, 14, 15, 17, 18, 22, 23, 24, 27, 28], "\u03b5_": 5, "\u03b8": 5, "\ub080": 5, "\u03b5\ub97c": 5, "\uc608\uce21\ud574\ub0b4\ub294": [5, 8], "\uc989": [2, 5, 6, 8, 9, 14, 19, 20, 21, 22, 25, 27], "ldm": [2, 5, 10, 12, 15, 16], "tuning\ud560\ub54c\ub294": 5, "layer\uc5d0\ub300\ud574": 5, "update\ud558\ub294\uac8c": 5, "\uae30\ubcf8": [5, 15, 18], "\ubc29\uc2dd\uc740": [5, 10, 13, 19, 27], "resource\uac00": 5, "\ube44\ud6a8\uc728\uc801\uc73c\ub85c": 5, "\ub9ce\uc774\ub4e4\uace0": 5, "\uc774\ubbf8\uc9c0\uc5d0": [5, 7, 11, 14, 16, 18, 19, 20, 21], "overfitting\ub418\uae30": 5, "\ubcc0\ud654\ub7c9\uc744": 5, "\uccb4\ud06c": 5, "delta": [2, 5, 10, 13], "while": 5, "\ubd80\ubd84\uc5d0\ube44\ud574": 5, "cross": [5, 10, 15, 16, 23, 25, 27], "attent": [2, 5, 8, 9, 10, 12, 13, 15, 16, 21, 22, 25, 27], "\uc5f0\uc0b0\uc758": [5, 12], "wegith": 5, "\ubcc0\ud654\ub7c9\uc774": [2, 5], "\ud07c": [2, 5], "fig": [5, 7, 11], "latent\uc5d0": 5, "\uc8fc\uc785\ud558\ub294": 5, "mechan": [2, 5], "kei": [5, 12, 13, 14, 16], "valu": [5, 7, 13], "parameter\uc5d0": 5, "\ub2e8": [5, 10, 14, 16, 23], "\ucc28\uc9c0": 5, "\uc758\ubbf8\ud558\ub294": [5, 18, 28], "\ud3ec\ud568\ub418\ub294": 5, "k": [5, 8, 12, 13, 14, 21, 24, 27], "\ub9cc": [2, 5, 6, 8, 13, 23], "\ub098\uba38\uc9c0\ub294": [5, 16, 20], "freez": [5, 10, 13, 23, 25], "\uc2e4\uc81c\ub85c\ub294": [5, 7], "\uc4f0\uc9c0\uc54a\ub294": 5, "\ub2e8\uc5b4\ub85c": 5, "\ud615\uc2dd\uc73c\ub85c": 5, "captioning\ud55c": 5, "\ud6c4\uc5d0": [5, 22], "\ub610": [5, 11, 14, 15, 16, 23, 26], "finetuning\uc911\uc5d0": 5, "\uc78a\uc5b4\ubc84\ub9ac\ub294": 5, "\ud604\uc0c1\uc774": [5, 17, 26], "\uc788\uc744\uc218\uc788\uc74c": 5, "moon": 5, "\uc0dd\uc131\ud558\uba74": [5, 20], "finetuning\ud588\ub358": 5, "moongat": 5, "\uc0dd\uc131\ud574\ubc84\ub9bc": 5, "\ubc29\uc9c0\ud558\uae30\uc704\ud574": 5, "world\uc758": 5, "image\uc5d0\uc11c": [5, 20, 21], "target": [5, 6, 8, 16, 19, 20, 23], "\uc720\uc0ac\ud55c": [2, 5, 10, 12, 19, 20, 21, 23, 28], "200\uc7a5\uc758": [5, 16], "regul": 5, "\uc720\uc0ac\ud558\ub2e4": 5, "clip\uc5d0\uc11c": [5, 18], "\ucd94\ucd9c\ud55c": 5, "space\uc0c1\uc758": 5, "vector\uac00": 5, "similar\ud558\ub2e4": 5, "joint": [5, 9, 21, 22], "trane": [5, 20], "\uac01\uac01\uc758": [2, 5, 16], "\uac16\ub294": [2, 5, 6, 9], "rare\ud55c": 5, "key\ub97c": 5, "\ubd80\uc5ec\ud574": [5, 27], "i": [2, 5, 8, 9, 12, 13, 14, 15, 17, 21, 23, 24, 25, 27, 28], "constrain": 5, "optim": [5, 7, 10, 16, 19, 21, 23, 28], "merg": [5, 13], "concept\uc73c\ub85c": 5, "\ud559\uc2b5\ub41c": [5, 18, 19, 20, 22, 25], "weight\ub97c": [5, 13, 18], "w_0": [2, 5, 13], "appendix": 5, "a\uc5d0\ub294": 5, "\ub77c\uace0": [2, 5, 6, 10, 13, 19, 25, 28], "\ub098\uc640\uc788\ub294\ub370": 5, "\uc624\ud0c8\uc790\uc77c": 5, "\uac00\ub2a5\uc131": 5, "c_": [2, 5, 15, 16, 23], "reg": 5, "caption\uc758": 5, "\ubf51\uc544": [5, 22], "concat": [5, 6, 15], "\uacf1\ud55c": 5, "\uac12\uacfc\uc758": 5, "norm\uc744": [5, 20], "\uacc4\uc0b0\ud588\uc744\ub54c": 5, "n\uac1c\uc758": [5, 21], "attention\uc774": 5, "\ub3d9\uc791\ud558\ub294": [5, 12], "\ucc3e\uc544": [5, 19], "\ud558\ub098\ub9cc": 5, "\uc0ac\uc6a9\ud558\uc790": 5, "250": 5, "two": [5, 6, 17, 19, 21, 23, 27], "500": [5, 18], "8": [2, 5, 8, 11, 12, 13, 14, 17, 21, 24, 27], "10": [2, 5, 8, 10, 11, 14, 15, 18, 22, 24, 27], "resiz": 5, "veri": 5, "small": [5, 19, 26, 27], "far": 5, "awai": 5, "zoom": 5, "techniqu": [5, 8, 14, 25], "qualit": [2, 5, 10, 21, 27], "evalu": [2, 5, 6, 7, 9, 26], "quant": [5, 19], "kid": [5, 14], "\uc5bc\ub9c8\ub098": [5, 6, 8, 14, 18, 19, 22], "\ub300\uc751\ub418\ub294": 5, "\uc0dd\uc131\ud574\ub0c8\ub294\uac00": 5, "image\uc758": [5, 9, 11, 12, 14, 16, 19], "\ud45c\ud604\ud574\ub0c8\ub294\uac00": 5, "tabl": [2, 5, 10, 11, 13, 15, 16, 18, 20, 22, 27], "\uc815\uc131\uc801": [5, 21], "\uc815\ub7c9\uc801": [5, 10, 21], "human": [5, 6, 9, 17, 19, 26], "prefer": [5, 25], "studi": [2, 5, 10, 16], "baselin": [5, 21], "customdiffus": [5, 10], "all": [5, 6, 13], "\uc120\ud638": 5, "inversion\uc740": [5, 19], "alignment\ub294": 5, "\uc120\ud638\ub3c4\uc640": 5, "\ube44\uc2b7\ud558\uc9c0\ub9cc": [5, 18, 20], "alignment\uc218\uce58\ub97c": 5, "diffusion\uc774": 5, "\ub9e4\uc6b0": [2, 5, 6, 7, 8, 13, 19, 20, 25], "\ub192\uc544": 5, "overfitting\ub41c": [5, 16], "ablat": [2, 5, 10, 16, 22], "\u314cgen": 5, "\ub300\uc2e0": [5, 8, 10, 11, 15, 16, 19, 20, 21], "generate\ub41c": 5, "\uc218\uce58\ub294": [5, 11, 22], "regulat": 5, "world": 5, "customizing\uc774": 5, "\uac00\ub2a5\ud558\uace0": 5, "resourse\uac00": 5, "Of": 5, "category\uc758": 5, "object\uc5d0": 5, "\ub300\ud574\uc11c\ub294": [5, 9, 11, 18], "\ub3d9\uc791\ud558\uc9c0": [5, 11], "\uc54a\uc74c": [5, 8, 19, 25], "hierarch": 6, "2022": [6, 9, 12, 20, 25], "2204": 6, "06125v1": 6, "seonhoon": [2, 6], "sep": [6, 25, 26], "18": [6, 23], "2022\ub144\uc5d0": 6, "\uacf5\uac1c\ub418\uc5b4": 6, "\uc138\uc0c1\uc744": 6, "\ub180\ub77c\uac8c": 6, "\ub2a5\ub825\ub3c4": 6, "\ub6f0\uc5b4\ub0ac\uace0": 6, "\uc0ac\uc6a9\uc790": 6, "\uc785\ub9db\uc5d0": 6, "\uc870\uc791\ud560": 6, "\ub418\uc5c8\uc8e0": 6, "\uc774\ub984\uc740": 6, "\uc77c\uae4c\uc694": 6, "\ucd08\ud604\uc2e4\uc8fc\uc758": 6, "\ud654\uac00": 6, "salvador": 6, "dali": 6, "wall": 6, "\ud569\uc131\uc5b4\uc785\ub2c8\ub2e4": 6, "\uc0dd\uc131\ud574\ub0b8": 6, "\uacb0\uacfc\ubb3c\uc774": [6, 15, 21], "\uacfc\uc5f0": 6, "\uc5b4\ub5bb\uae38\ub798": 6, "\uacb0\uacfc\ubb3c": [6, 21], "\uc0dd\uc804": 6, "\ubaa8\uc2b5": 6, "vibrant": 6, "portrait": [6, 16], "robot": 6, "half": 6, "face": [6, 10, 17, 24], "\uc2e4\uc81c": [6, 10, 13, 14, 18, 19, 20, 23, 24], "\ubaa8\uc2b5\uc774": [6, 20], "\ubcf4\uc774\ub124\uc694": 6, "\ucd08\ud604\uc2e4\uc8fc\uc758\uc801": 6, "\uac19\uae30\ub3c4": 6, "corgi": 6, "\uc5b4\ub5a4\uac00\uc694": 6, "s": [2, 6, 7, 16, 17, 19, 23, 24, 25, 26, 29], "head": [6, 8, 16, 22], "depict": 6, "explos": 6, "nebula": 6, "\ubaa8\uc2b5\uc744": [6, 19], "\uc131\uc6b4\uc758": 6, "\ud3ed\ubc1c\ub85c": 6, "\ubb18\uc0ac\ud574\ub2ec\ub77c\uace0": 6, "\ud588\uc744": [6, 23], "\uadf8\ub9bc\uc785\ub2c8\ub2e4": [6, 28], "nasa": 6, "\ucd2c\uc601\ud55c": 6, "\ucd08\uc2e0\uc131": 6, "\ud3ed\ubc1c\uc758": 6, "\uc794\ud574\uc785\ub2c8\ub2e4": 6, "\uc815\ub9d0": 6, "\uadf8\ub7f4\ub4ef\ud558\uc9c0": 6, "\uc54a\ub098\uc694": 6, "thi": [6, 7, 8, 13, 19, 23, 29], "mosaic": 6, "one": [2, 6, 16, 20], "largest": 6, "ever": 6, "taken": 6, "hubbl": 6, "space": [2, 6, 10, 15, 19, 21, 23, 27, 28], "telescop": 6, "crab": 6, "six": 6, "light": 6, "year": 6, "wide": 6, "expand": [6, 22, 27], "remnant": 6, "star": 6, "supernova": 6, "\uc8fc\uc758\uc0ac\ud56d": 6, "\ubcf8": [2, 6, 8, 10, 13, 18, 21, 22, 25], "\ub0b4\uc6a9\uc744": [2, 6, 10], "\ube44\uc120\ud615\uc801\uc73c\ub85c": 6, "\uc0b4\ud3b4\ubd05\ub2c8\ub2e4": 6, "\ub9c8\uce58": 6, "\uc624\ud508\uc6d4\ub4dc": 6, "\uac8c\uc784\ucc98\ub7fc": 6, "\ub9d0\uc774\uc8e0": 6, "\ud575\uc2ec\uc774": 6, "\ub418\ub294": [6, 21, 22, 23, 26, 27], "\uc9c8\ubb38\ub4e4\uc744": 6, "\ub358\uc9c0\uba70": 6, "\ud30c\ud5e4\uccd0": 6, "\uac81\ub2c8\ub2e4": 6, "\ud3ec\uc2a4\ud305\uc740": 6, "openai": 6, "blog": [6, 19], "assemblyai": 6, "youtub": [2, 6, 13, 21], "eden": 6, "meyer": 6, "\ucc38\uace0\ud588\uc2b5\ub2c8\ub2e4": 6, "\ubcf8\uaca9\uc801\uc73c\ub85c": 6, "\ud559\uc2b5\ud558\uae30": [2, 6], "\uc804\uc5d0": [6, 23], "\uc54c\uc544\uc57c\ud560": 6, "\uac83\uc740": [6, 8, 18, 19, 20, 27], "\ubaa8\ub378\uc785\ub2c8\ub2e4": [6, 12, 17], "The": [6, 17], "fundament": 6, "principl": 6, "ar": [6, 7, 8, 13, 15, 27], "quit": 6, "simpl": [6, 11, 22, 26, 27], "first": [6, 7, 13], "associ": 6, "caption": [6, 9, 21, 25], "through": [6, 9, 19], "respect": [6, 22, 25], "object": [2, 6, 7, 19, 21, 23, 26], "dimension": [6, 8], "Then": [6, 13], "cosin": [6, 19, 22, 23], "similar": [6, 21, 23], "each": [6, 17, 23], "pair": [6, 13, 14, 25, 27], "comput": [6, 7, 13, 20, 22, 23, 25, 26, 27], "simultan": 6, "maxim": [6, 21], "between": [2, 6, 9, 21, 25], "n": [2, 6, 8, 9, 12, 13, 21, 27], "correct": [6, 23], "minim": 6, "incorrect": [6, 10, 23], "\ud1b5\ud569\uc2dc\ucf30\uc2b5\ub2c8\ub2e4": 6, "\ucd5c\ucd08\ub294": 6, "\uc815\ub2f5\uc740": 6, "\uc544\ub2d9\ub2c8\ub2e4": [6, 17], "22\ub144": 6, "5\uc6d4": 6, "\uc0ac\uc6a9\ud558\uc9c0": [6, 10], "imagen": [6, 10, 23], "\uc5d0\uac8c": 6, "sota": [6, 16, 18, 21, 25, 27], "\ub0b4\uc8fc\uc5c8\uc2b5\ub2c8\ub2e4": 6, "\uc544\ud0a4\ud14d\uccd0": [6, 21, 22, 25], "\ucc0d\uba39\ud558\uae30": 6, "\ub0b4\uc758": [6, 15], "semant": [2, 6, 10, 15, 19, 20, 27], "\ud3ec\ucc29\ud574\ub0bc": 6, "\ud45c\ud604": [6, 8], "\ub04c\uc5b4\uc62c\ub9ac\uae30": 6, "\uc704\ud574\uc11c": [2, 6, 9, 17, 20], "\uc800\uc790\ub4e4\uc740": [6, 9, 15, 18, 19, 21, 22, 25], "\ud1b5\ud569\ud55c": 6, "stage": [2, 6, 27], "\uc774\uac83\uc774": 6, "\uc778\ub370\uc694": 6, "unclip": 6, "\ubd80\ub985\ub2c8\ub2e4": 6, "level": [6, 14, 15, 17, 27], "overview": [6, 28], "architectur": [6, 7, 10, 17, 19, 24, 25, 26, 28], "\ubcf5\uc7a1\ud574\ubcf4\uc774\ub2c8": 6, "assembl": 6, "ai": [6, 10, 20], "\ub2e8\uc21c\ud654\ub41c": 6, "\uadf8\ub9bc\uc744": [6, 15, 17, 18, 20], "\uc0b4\ud3b4\ubcfc\uac8c\uc694": 6, "www": [2, 6, 13, 21], "com": [2, 6, 13, 19, 20, 21, 23], "watch": [2, 6, 13, 21], "f1x4fhzf4mq": 6, "360": 6, "ab_channel": [2, 6], "decod": [6, 21, 23, 27, 28], "\ubaa8\ub378\uc778": [6, 14, 25], "\uac19\ub124\uc694": 6, "\ucea1\uc158\uc744": 6, "\uc0c1\uc751\ud558\ub294": 6, "\uc0dd\uc131\ud569\ub2c8\ub2e4": [6, 10, 20, 27, 28], "autogregress": 6, "\ube44\uad50\ud558\ub294": [6, 10], "\uc2e4\ud5d8": [2, 6, 10, 18, 19, 22, 24], "\uc218\ud589\ud588\uc2b5\ub2c8\ub2e4": 6, "computation": [6, 27], "\ud558\uace0": [6, 12, 15, 27], "\ud6c4\ubc18\ubd80\uc5d0\ub294": 6, "\uc2e4\ud5d8\ud569\ub2c8\ub2e4": 6, "\ubaa8\ub378\ub9cc": 6, "\uc774\ub791": [6, 21, 28], "\uc0ac\uc6a9\ud588\uc744\uae4c\uc694": 6, "represent": [2, 6, 12, 21], "\ud559\uc2b5\ud558\ub294\ub370": [6, 19, 23], "\ud070": [2, 6, 8, 10, 11, 13, 15, 17, 18, 19, 20, 27], "\uc131\uacf5\uc744": 6, "\uac70\ub450\uace0": 6, "shift": 6, "robust": [2, 6, 27], "capabl": 6, "\ub6f0\uc5b4\ub0ac\uc2b5\ub2c8\ub2e4": 6, "vision": [6, 14, 18, 26], "task": [2, 6, 13, 20, 21, 24, 25], "\ub418\uc5b4": [2, 6, 10, 17, 24, 27], "\ub2ec\uc131\ud574\ub0c8\uc2b5\ub2c8\ub2e4": 6, "video": [2, 6, 20], "tak": 6, "\uac31\uc2e0\ud558\ub294": 6, "\uc911\uc774\uc5c8\uc8e0": 6, "non": [2, 6, 11, 22, 27], "determinist": [6, 7, 22], "\ub355\ubd84\uc5d0": 6, "\uc874\uc7ac\ud558\uc9c0": [6, 20], "essenti": 6, "\ubcc0\uc8fc\ud558\uba74\uc11c": 6, "\uc720\uc9c0": [6, 27], "\uc788\uc8e0": 6, "variat": [6, 8, 28], "\uc67c\ucabd\uc758": 6, "\ub4e4\uc740": [2, 6, 25], "\ubcf4\uc874\ub429\ub2c8\ub2e4": 6, "\uadf8\ub4e4\uc774": 6, "\ud45c\ud604\ub418\ub294": 6, "\ubc29\uc2dd\uc774\ub098": 6, "\uc870\uae08\uc529": [2, 6, 18], "\ubc14\ub01d\ub2c8\ub2e4": 6, "\uadf8\ub7fc\uc5d0\ub3c4": [6, 20], "\ud2b9\uc720\uc758": 6, "\ud654\ud48d\uc740": 6, "\uc720\uc9c0\ub418\ub294": 6, "\ubcc0\uc8fc\uace1\ucc98\ub7fc": 6, "\ub9e4\ubc88": [6, 13, 21], "\uc0c8\ub86d\uac8c": [6, 19], "\uc5f0\uc8fc": 6, "\ud574\ub0bc": 6, "\uc788\ub294\uac81\ub2c8\ub2e4": 6, "\ud30c\ud5e4\uce58\uae30": 6, "\uc774\ubc88\uc5d0\ub294": 6, "\uc0b4\ud3b4\ubcf4\uc8e0": 6, "\uc790\uccb4\uc758": 6, "\uc124\uba85": [6, 22], "\uc0ac\uc2e4": [6, 11], "\uc870\uac74\uc73c\ub85c": 6, "\ubc1b\ub294": [6, 8, 17], "\uc790\uccb4\ub3c4": 6, "\ubc1b\uc2b5\ub2c8\ub2e4": 6, "\ubb3c\ub860": [6, 18], "\ubc1b\uaca0\uc8e0": 6, "\uc11c\ub85c": [2, 6, 17, 20], "1\ub3001": 6, "\ub300\uc751\ub418\uae30": 6, "duel": 6, "\ubb38\uc81c\ub420": 6, "\uc5c6\ub2e4\uace0": [6, 17], "\ubcc0\ub860\ud569\ub2c8\ub2e4": 6, "\ud004\ub9ac\ud2f0\ub97c": 6, "\ub192\uc774\uae30": [6, 11], "2\uac1c\uc758": [6, 27], "\uc8fc\uc5b4\uc9c4": [6, 9, 10, 19, 21], "\ub192\uc740": [2, 6, 8, 9, 10, 11, 13, 14, 15, 16, 18, 19, 21, 22, 25], "dot": 6, "\uc0ac\uc6a9\ud588\ub2e4\uace0": [6, 9, 18, 23], "modifi": 6, "glide": [6, 26], "project": [6, 12, 13, 22, 27], "\uc8fc\uc7a5\ud569\ub2c8\ub2e4": [6, 27], "\ud1b5\ud569\uc2dc\ud0a4\ub0d0\ud558\uba74": 6, "\ucd94\uac00\ud558\uace0": [6, 8, 10], "token": [6, 13, 19, 21, 23, 27], "\ud558\ub294\uac70\uc8e0": 6, "\ubc29\ubc95\uc73c\ub85c": [6, 18, 19, 22], "\uc6d0\ubcf8": [6, 14, 18, 19, 23, 27, 28], "\uc601\uc0c1\uc744": 6, "process": [2, 6, 11, 14, 19, 21, 23, 24, 27], "\uc0ac\uc6a9\ud568\uc73c\ub85c\uc368": [6, 22, 26], "\uc788\ub358": 6, "photorealist": [2, 6, 9, 18, 25], "\ud65c\uc6a9\ud560": [6, 19], "\uadf8\ub807\ub2e4\uba74": [2, 6], "\ud544\uc694\ud560\uae4c\uc694": 6, "obtain": 6, "full": [6, 8, 11, 16, 26], "we": [6, 19, 23, 27], "combin": [2, 6], "which": [6, 19, 25], "possibl": 6, "given": 6, "\ub531\ud788": [6, 20], "\uc640\ub2ff\uc9c0\ub294": 6, "\uc54a\uc2b5\ub2c8\ub2e4": [6, 20], "\uc2e4\ub9dd\ud558\uae34": 6, "\uc774\ub985\ub2c8\ub2e4": 6, "\uc720\ubb34\uc5d0": 6, "\ud488\uc9c8\uc744": [6, 15, 18, 19], "\uc2e4\ud5d8\uc744": [2, 6, 9, 12], "\uc218\ud589\ud588\ub2e4\uace0": [6, 9], "\ud55c\ubc88": [6, 8, 23, 28], "\uc0b4\ud3b4\ubcfc\uae4c\uc694": 6, "\uc218\ud589": [2, 6, 19, 21, 24], "\ubaa8\ub378\ucc98\ub7fc": 6, "\uc8fc\uc5b4": [6, 20], "\uac16\ucd94\uace0": 6, "\ud6cc\ub96d\ud588\uc2b5\ub2c8\ub2e4": 6, "\ud2b9\ud788": [6, 15, 19, 20, 25, 27], "3\uac00\uc9c0": [6, 10, 25, 26, 27], "\uacbd\uc6b0\uc758": [2, 6, 20], "\uc544\ud0a4\ud14d\uccd0\uc5d0": 6, "sampl": [2, 6, 12, 16, 22, 23, 24, 25, 27, 28], "signal": [6, 7], "same": [6, 13], "\uadf8\ub807\uc9c0\ub9cc": [6, 25], "\uc758\ubb38\uc774": [6, 18], "\ub9d0\ub054\ud788": 6, "\ud574\uc18c\ub418\uc9c0\ub294": 6, "\uc65c\ub0d0\ud558\uba74": [2, 6, 20], "95": 6, "\uc2dc\uac04": [2, 6, 8, 13, 20], "\ub3d9\uc548": [6, 15, 24], "\ubc29\uc2dd\uc73c\ub85c": [2, 6, 8, 10, 13, 16, 22], "\ubc29\uc2dd\uc5d0": [6, 19], "\uadf8\ub300\ub85c": [6, 9, 22, 27], "\uc801\uc6a9\ud574": [6, 15], "\uc2e4\ud5d8\ud588\uc2b5\ub2c8\ub2e4": 6, "\uacf5\uc815\ud55c": 6, "\uc2e4\ud5d8\uc774\ub77c\uace0": 6, "\ubcf4\uae34": 6, "\uc5b4\ub824\uc6b8": 6, "true": [6, 7, 8, 13, 24, 27], "\ud559\uc2b5\uc2dc\ucf30\uc744": 6, "\ub54c\uc758": [2, 6, 15, 18], "\ube44\uad50": [2, 6, 7, 11, 21, 22, 25], "\uc2e4\ud5d8\uc740": [2, 6], "\uc5c6\uc2b5\ub2c8\ub2e4": [6, 20, 27], "\uac1c\uc778\uc801\uc73c\ub85c": [6, 17, 18], "\uc800\ub294": [6, 17], "\ubcf4\uace0": [6, 8, 18], "\ubc18\ub4dc\uc2dc": [6, 14], "\uc368\uc57c\ud558\ub294": 6, "\uadfc\uac70\uc5d0": 6, "\uc124\ub4dd\ub825\uc774": 6, "\ub5a8\uc5b4\uc9c4\ub2e4\uace0": 6, "\uc0dd\uac01\ud588\uc2b5\ub2c8\ub2e4": 6, "\uc368\uc57c\ud560\uae4c\uc694": 6, "\uac1d\uccb4\ub97c": [6, 20], "\ubb18\uc0ac\ud55c": 6, "\uac1d\uccb4\uc758": 6, "\uc2dc\uac01\uc801": [6, 15, 19], "\ubc1c\ud604": 6, "\uc0ac\uc774\uc758": [2, 6, 8, 9, 20], "\uc758\ubbf8\ub860\uc801": 6, "\uad00\uacc4\ub97c": [6, 7, 11], "\ud559\uc2b5\ud588\uc2b5\ub2c8\ub2e4": 6, "\ub2a5\ub825\uc774": 6, "\uc911\uc694\ud558\ub2e4\uace0": [6, 20, 23], "manipul": [6, 14, 16], "diff": 6, "appli": 6, "interpol": [6, 11], "normalis": 6, "produc": 6, "descript": [6, 23], "\ud558\ub294\uc9c0\ub294": 6, "\uace7": [2, 6], "\uc0b4\ud3b4\ubcf4\uaca0\uc2b5\ub2c8\ub2e4": [6, 24, 27], "\uadf8\ub798\uc11c": [6, 9, 18, 20, 24], "\ubb50\uac00": 6, "\uc88b\uc740\uac00\uc694": 6, "\ud3c9\uac00\ud558\uae30": [2, 6, 18], "\uc0dd\uc131\ubb3c\uacfc": 6, "\uc0dd\uc131\ubb3c\uc744": 6, "\uc0ac\ub78c\ub4e4\uc5d0\uac8c": 6, "\uc81c\uc2dc\ud558\uace0": 6, "photor": [6, 9, 25], "\ub300\ud574\uc11c": [2, 6, 9, 10, 12, 17, 18, 19, 20, 23, 24], "\ub9e4\uae30\ub3c4\ub85d": 6, "when": [6, 17, 24, 25], "guidanc": [6, 18, 26, 27], "both": 6, "comparison": [2, 6, 9, 13, 14, 16, 22, 23, 25], "versu": 6, "\uacb0\ub860\uc740": 6, "compar": [6, 19, 20], "\ud6e8\uc52c": [6, 9, 10, 11, 18, 20], "\uac00\ub2a5\ud569\ub2c8\ub2e4": [6, 10, 23, 27], "bipartit": 6, "\uad6c\uc870": [6, 7, 17, 18, 20], "z_i": [6, 12], "x_t": [2, 6, 8, 9, 12, 22, 27], "\uc778\ucf54\ub529": [6, 12], "\ud65c\uc6a9\ud574\uc11c": [6, 22, 28], "ddim": [2, 6, 9, 18, 19, 27], "\ub41c": [6, 10, 13, 15, 18, 23, 27], "\uc5bb\uc73c\uba70": 6, "\ubcf5\uc6d0\ud558\ub294\ub370": 6, "\ud544\uc694\ud55c": [6, 19, 20], "\uc794\uc5ec": 6, "\uc815\ubcf4\ub4e4\uc744": [6, 27], "\uc9c0\ub2d9\ub2c8\ub2e4": 6, "\ubcc0\uc8fc\ud558\uae30": 6, "\u03b7": [6, 7], "\uc801\uc6a9\ud569\ub2c8\ub2e4": [6, 20], "\uc77c": [2, 6, 24], "\ud574\uc9c0\uace0": 6, "\ubcf5\uc6d0\ud574\ub0c5\ub2c8\ub2e4": 6, "\ucee4\uc9c8\uc218\ub85d": [6, 7, 11, 16], "\uc5d0\ub294": [2, 6, 8, 25], "stochast": [2, 6, 7, 9, 14, 24], "\uc0dd\uae30\uace0": 6, "\uadfc\ucc98\uc5d0\uc11c": 6, "perceptu": [6, 20], "centere": 6, "\ub9cc\ub4e4\uc5b4\ub0bc": [6, 9], "\ud0a4\uc6b0\uba74": 6, "\uc6b0\ub9ac\ub294": [6, 20, 28], "\uc874\uc7ac\ud558\uace0": 6, "\uc720\uc2e4\ub418\uc5c8\ub294\uc9c0": 6, "\ud0d0\uc0c9": 6, "\ud0d0\uc0c9\ud574\ub0bc": 6, "\uc788\ub294\uac70\uc8e0": 6, "\uac83\ub3c4": [6, 8, 18, 28], "\ud574\uc11c": [6, 19, 22, 24], "\uc900\ub2e4\uba74": 6, "\ucea1\uc158\uc774": 6, "\uc8fc\uc5b4\uc838\uc788\uc744": 6, "\uc6b0\ub9ac\uac00": [6, 8, 20], "method": [2, 6, 7, 17, 21, 22], "z_t0": 6, "current": [6, 7, 8], "\uc774\uace0": [2, 6, 24, 27, 28], "z_t": [6, 12, 27], "\uc774\ub77c\uba74": [2, 6], "embd": 6, "\uc870\uc791\ub429\ub2c8\ub2e4": 6, "typograph": 6, "attak": 6, "attack": 6, "\ub0b4": 6, "\uc0ac\ubb3c": 6, "\uc704\uc5d0": [6, 9], "\uae00\uc528\uac00": 6, "\uc4f0\uc5ec": 6, "\uacbd\uc6b0\uc785\ub2c8\ub2e4": [6, 17], "multimod": [6, 25], "\ub9ce\uc774": [6, 18, 19, 22, 24], "\ud65c\uc6a9\ud574": [6, 15, 20, 22, 27], "\uc0ac\ubb3c\uc744": 6, "\ud310\ub2e8\ud558\ub294": 6, "ipod": 6, "\uc885\uc774\uac00": 6, "\ubd99\uc740": [6, 20], "\uc0ac\uacfc\ub97c": 6, "\ubd84\ub958\ub97c": 6, "\uc218\ud589\ud574\ubcf4\uc558\uc2b5\ub2c8\ub2e4": 6, "\uc5ed\uc2dc": [6, 28], "granni": 6, "smith": 6, "\uac70\uc758": [6, 9, 11, 20], "\uac00\uae5d\ub2e4\uace0": 6, "\ud310\ub2e8\ud588\uc2b5\ub2c8\ub2e4": 6, "\uc0ac\uacfc\uc758": 6, "\uc0ac\uc9c4\uc73c\ub85c": [6, 20], "recov": 6, "\ud574\ub0c5\ub2c8\ub2e4": 6, "\uc774\ucc98\ub7fc": [6, 28], "\ub354\uc6b1": [6, 18, 21, 25], "\ub2e8\uc810\uc740": 6, "\uc5c6\ub098\uc694": 6, "cube": 6, "\uadf8\ub4e4\uc758": [6, 19], "\uc18d\uc131": [6, 20], "color": [6, 16, 23, 26, 27], "\ub9e4\uce6d\uc2dc\ud0a4\ub294": 6, "\ub5a8\uc5b4\uc9d1\ub2c8\ub2e4": 6, "red": [6, 21], "blue": [6, 21], "\ud30c\ub780": 6, "\ud050\ube0c": 6, "\ube68\uac04": [6, 18], "\ud050\ube0c\ub97c": 6, "\uadf8\ub824\ub2ec\ub77c\uace0": 6, "\ud050\ube0c\uc640": 6, "\ud050\ube0c\uc5d0": 6, "\uc0c9\uc0c1": 6, "attribut": [6, 17, 26], "\ubd80\uc5ec\ud574\uc57c\ud560\uc9c0": 6, "\ud5f7\uac08\ub824\ud569\ub2c8\ub2e4": 6, "\uc77c\uad00\uc131\uc788\uac8c": 6, "sign": 6, "sai": 6, "deep": [6, 11, 18, 25], "\ub9cc\uc758": 6, "\ubb38\uc81c\ub294": 6, "\uc5b4\ub824\uc6cc\ud558\ub294": 6, "\ubb38\uc81c\uc785\ub2c8\ub2e4": 6, "\ubcf5\uc7a1\ud55c": [2, 6, 9, 18], "\uc0c1\ud669\uc5d0\uc11c": [6, 8], "\ub514\ud14c\uc77c\uc744": [6, 19], "\ubb18\uc0ac\ud558\ub294": 6, "show": [6, 25], "some": 6, "complex": [6, 27], "\ub124\uc628": 6, "\uc0ac\uc778\ub4e4\uc758": 6, "\ub514\ud14c\uc77c\ub4e4\uc774": 6, "\ub5a8\uc5b4\uc9c0\ub294": [2, 6, 11, 18], "\ud655\uc778\ud558\uc2e4": 6, "\ub17c\ubb38\uc758": [6, 10, 14, 17, 18, 22, 28], "\uc5d0\uc11c\ub294": [2, 6, 9, 11, 23, 24, 25], "\uc218\ud559\uc801": 6, "justifi": 6, "\ub77c": [6, 27], "\ud569\uc2dc\ub2e4": [6, 17], "\uadf8\uc5d0": [6, 14, 28], "\uc800\uc790\uc758": 6, "\uc8fc\uc7a5": [6, 22, 25], "\uc0d8\ud50c\ub9c1\ud560": 6, "equal": 6, "hold": 6, "becaus": [2, 6], "function": [6, 19, 24, 28], "second": 6, "chain": [2, 6, 8, 22, 23], "rule": 6, "\ud3ec\uc2a4\ud305\uc744": 6, "\ubd80\uac00": 6, "\uc774\ubbc0\ub85c": [2, 6, 22], "\uc4f8": [2, 6, 24], "\uacf5\uc2dd\uc744": 6, "\ud480\uc5b4\uc11c": 6, "\ud574\uc124\ud574\ubcf4\uba74": 6, "\uc0ac\uc6a9\ud574": [6, 13, 15, 16, 18, 19, 25], "\uc0d8\ud50c\ub9c1\ud558\uace0": [6, 28], "\uc0d8\ud50c\ub9c1\ud568\uc73c\ub85c\uc368": 6, "\uc0d8\ud50c\ub9c1\uc774": 6, "\uac00\ub2a5\ud574\uc9c0\ub294": 6, "\uc5c6\ub294\uc9c0": 6, "\uad81\uae08\ud574\uc11c": 6, "\uacf5\ubd80\ud574\ubd24\uc2b5\ub2c8\ub2e4": 6, "\uc788\ub294\uc9c0": 6, "\ud574\uc18c\ud558\uae30": 6, "\ub178\ub825\uc744": 6, "\ud558\uace0\uc788\ub294\uc9c0": 6, "\ub300\uccb4": [2, 6, 13], "\uc815\ub7c9\uc801\uc73c\ub85c": 6, "\ud3c9\uac00\ud560": [6, 26], "\uc870\uc0ac\ud574\ubd24\uc2b5\ub2c8\ub2e4": 6, "\uacb0\uacfc\ubd80\ud130": 6, "\ub9d0\uc500\ub4dc\ub9ac\uba74": 6, "\ucc98\ub7fc": [2, 6, 17, 20, 21, 26, 27], "\uc6f9\ud06c\ub864\ub9c1": 6, "\uc874\uc7ac\ud55c\ub2e4\uace0": 6, "\ud558\uace0\uc788\ub294\uc9c0\ubd80\ud130": 6, "preview": 6, "\ud604\uc7ac": [6, 7, 16, 18], "safeti": 6, "\ub178\ub825": 6, "\ub370\uc774\ud130\uc5d0\uc11c": [6, 18], "violent": 6, "hate": 6, "adult": 6, "\uc81c\uac70\ud568\uc73c\ub85c\uc368": 6, "\ub178\ucd9c\ub418\ub294": 6, "\uc2dc\uac04\uc744": [2, 6], "\ucd5c\uc18c\ud654\ud588\ub2e4\uace0": 6, "polici": 6, "\uc704\ubc18\ud55c": 6, "\uc790\uc815\ud558\ub294": 6, "\uc2dc\uc2a4\ud15c\uc744": 6, "\ubcf4\uc720\ud558\uace0": 6, "\uc2e0\ub8b0\ud560": 6, "\uc804\ubb38\uac00\ub4e4\uacfc": 6, "\uac80\ud1a0\ub97c": 6, "\uc9c4\ud589\ud588\ub2e4\uace0": [6, 9, 26], "eval": [6, 7], "\uc0dd\uc131\ud615": [2, 6], "\ud3c9\uac00\ud558\ub294": [6, 18], "\uae30\ubc95\uc774": [6, 9], "2202": 6, "04053": 6, "j": [6, 8], "min": [6, 7, 13], "dallev": 6, "contribut": [6, 22], "\ucd94\ub860": [6, 21], "\ub2a5\ub825": [2, 6], "3\uac00\uc9c0\ub97c": 6, "\ub370\uc774\ud130\uc14b": [2, 6, 13, 21, 25, 26, 27], "\uc81c\uacf5\ud569\ub2c8\ub2e4": [6, 10], "\ucd5c\uadfc\uc758": [6, 20], "recognit": [6, 18], "skill": 6, "\uc0c1\ub300\uc801\uc73c\ub85c": [6, 10, 22], "\ub6f0\uc5b4\ub098\uc9c0\ub9cc": [6, 11], "count": [6, 26], "spaial": 6, "relat": [2, 6, 29], "\uc774\ud574": [2, 6], "\ub2a5\ub825\uc740": 6, "\ub5a8\uc5b4\uc9d0\uc744": 6, "\uc874\uc7ac\ud558\ub294": 6, "gender": 6, "skin": 6, "tone": 6, "bias": 6, "\uce21\uc815\ud558\ub294": [6, 23], "metric": [6, 7, 8, 14, 20, 23, 26], "\ubd84\uc11d": [6, 19, 22], "\ucd5c\ucd08\uc758": [2, 6], "\ub17c\ubb38": [6, 8, 13, 18, 20, 21, 23], "web": 6, "\ud559\uc2b5\ud588\uc74c\uc744": 6, "\ubcf4\uc5ec\uc8fc\uc5c8\uc2b5\ub2c8\ub2e4": [6, 12], "social": 6, "\uce21\uc815": [6, 19], "sec": 6, "\uc790\uc138\ud55c": [6, 18, 27], "diagnost": 6, "ex": [2, 6, 8, 20], "who": 6, "work": [2, 6], "nurs": 6, "\ucd1d": [6, 16, 17, 18, 19, 20], "252\uac1c\uc758": 6, "\uc81c\uacf5": [2, 6], "\uc774\ubbf8\uc9c0\ub85c\ubd80\ud130": 6, "\ud0d0\uc9c0\ud569\ub2c8\ub2e4": 6, "autom": 6, "detect": 6, "verifi": 6, "reliabl": 6, "blip": 6, "\uc8fc\uba74\uc11c": 6, "\uc601\uc0c1": 6, "\uc0ac\ub78c\uc758": [6, 17], "\uc131\ubcc4\uc744": 6, "\ub9de\ucd94\uac8c": 6, "\ub2f5\ubcc0\uc744": 6, "\uce21\uc815\ud569\ub2c8\ub2e4": 6, "\uc2e0\uacbd\ub9dd\uc73c\ub85c": 6, "facial": [6, 16], "landmark": 6, "\ucd94\ucd9c\ud558\uace0": [6, 15], "illumin": 6, "\ubcf5\uc7a5\uc744": 6, "\ud0d0\uc9c0\ub41c": 6, "unbias": 6, "uniform": [6, 7, 27], "\uc73c\ub85c\ubd80\ud130": 6, "skew": 6, "\ub418\uc5b4\uc788\ub294\uc9c0": 6, "result": [6, 7, 10, 11, 13, 15], "expert": 6, "per": 6, "profess": 6, "exampl": [6, 7, 17, 19, 23, 26], "averag": [6, 7], "\ud3c9\uac00\ud558\ub294\ub370\uc5d0": 6, "\uc131\uacf5\ud588\uc2b5\ub2c8\ub2e4": 6, "satbl": 6, "\uc6f9\ud06c\ub864\ub9c1\uc744": 6, "\uc874\uc7ac\ud588\uc2b5\ub2c8\ub2e4": 6, "\uce21\uc815\ud558\uae30": 6, "\ub178\ub825\uc774": 6, "\uc9c0\uc18d\ub418\uace0": 6, "\ubbf8\ub798\uc5d0\ub294": 6, "\uc548\uc804\ud558\uac8c": 6, "\ud65c\uc6a9\ub420": [2, 6], "\uc788\uae30\ub97c": 6, "\uae30\ub300\ud569\ub2c8\ub2e4": 6, "denois": [7, 10, 13, 14, 15, 18, 27], "implicit": [7, 24, 27], "iclr": [7, 13, 28], "2021": [7, 11, 13, 21, 22], "2010": 7, "02502": 7, "april": 7, "23": [7, 20], "ddpm\uc758": [7, 9, 11, 16, 22], "\ub2e8\uc810\uc778": 7, "markov": [2, 7, 8], "process\ub97c": [7, 8, 22], "process\ub85c": [7, 8, 11, 22], "\uc815\uc758\ud568\uc73c\ub85c\uc11c": 7, "deterministic\ud55c": 7, "sampling\uc774": [7, 22], "\ubd84\uc57c\uc5d0\uc11c": [2, 7, 10, 11, 16], "adversari": [7, 10, 14, 17, 24], "\ubcf4\uc5ec\uc8fc\uace0\uc788\ub2e4": 7, "gan\uc740": [7, 17, 20], "\uacfc\uc815\uc5d0\uc11c": [7, 8, 9, 10, 13, 14, 16, 18, 19, 20], "\ubd88\uc548\uc815\uc131\uc744": [7, 20], "\ub9ce\ub2e4": 7, "generator\uc640": 7, "discriminator\uc758": 7, "imbalanced\uc5d0": 7, "\uc758\ud55c": [7, 19], "mode": [7, 13, 27], "collaps": [7, 20], "\uadf8\ub7ec\ub358": 7, "ddpm\uacfc": [7, 9, 14], "ncsn\uac19\uc740": 7, "training\uad6c\uc870\uac00": 7, "\ub4f1\uc7a5\ud558\uc600\uace0": 7, "\uc131\uacf5\uc758": 7, "\ubcf4\uc5ec\uc8fc\uc5c8\ub2e4": [7, 16], "ddpm\uc740": [7, 22], "process\uc5d0\uc11c": [7, 11, 14, 22], "\uac70\uce58\ub294\ub370": 7, "\uc774\ub54c\ubb38\uc5d0": 7, "gan\uc5d0": 7, "\ub290\ub9b0": 7, "performance\ub97c": 7, "50k": 7, "less": 7, "than": 7, "about": [7, 29], "20h": 7, "256": [7, 15, 20, 21, 24, 25], "1000h": 7, "ddim\uc740": [7, 22], "chain\uc5d0": 7, "\ub300\uccb4\ud558\uc600\uace0": 7, "\uacb0\uad6d": [7, 9, 11, 21, 22], "\ube60\ub974\uace0": [7, 10], "\ube44\uad50\uc801": [7, 11, 27], "quality\uc758": [7, 9, 11, 14, 16], "\uc0dd\uc131\ud574\ub0b4\uace0": [7, 16], "accel": 7, "ddpm\uacfc\ub294": 7, "\ub2e4\ub974\uac8c": [7, 10, 15, 18, 23], "consistency\ud55c": 7, "\ubcf4\uc5ec\uc90c\uc73c\ub85c\uc368": 7, "latent\uac04\uc758": 7, "interpolation\uc774": 7, "consist": 7, "If": 7, "equival": 7, "process\ub294": [7, 9], "\ub3d9\uc791\ud55c\ub2e4": 7, "\ubbf8\ub798": 7, "\uc2dc\uc810\uc744": [2, 7], "\uc608\uce21\ud558\uae30\uc704\ud574": 7, "\uc2dc\uc810\uc758": [2, 7, 22], "\uc774\uc6a9\ud55c\ub2e4": [7, 9], "\uc2dc\uc810\uc740": 7, "\uacfc\uac70": 7, "\uac12\uc5d0\ub294": 7, "\ub3c5\ub9bd\uc801\uc778": 7, "\uac16\ub294\ub2e4": 7, "t\ub294": 7, "ddpm\uc5d0\uc11c": [7, 9, 11, 22], "\uc88c\uc9c0\uc6b0\uc9c0\ud558\ub294": 7, "hyper": [7, 11, 13, 16], "parameter\uc774\ub2e4": 7, "\ub300\ucda9": 7, "1000": [2, 7, 8, 15, 18], "\ubc88\uc758": 7, "\uacfc\uc815\uc744": [7, 8, 9, 10, 14, 15, 19, 28], "sequential\ud558\uac8c": 7, "\uac70\uccd0\uc57c\ud558\uace0": 7, "\ubcf4\ub2e4": [2, 7, 8, 11, 13, 18, 19, 20, 21, 22, 23, 25, 27], "\ud604\uc800\ud788": [7, 11], "\uc18d\ub3c4\ub97c": [7, 10], "\uc694\uc18c\uac00": 7, "\ub41c\ub2e4": [2, 7, 20, 25], "\uc815\uc758": [2, 7, 11], "\uad6c\ud558\uae30\uc704\ud574": 7, "\uac12\uacfc": 7, "\ucc38\uc870": [7, 10], "\uac12\ub9cc\uc744": 7, "\u03c3\ub294": 7, "process\uc758": [7, 11], "stochastic\ud55c": 7, "chap": 7, "And": 7, "unifi": 7, "revers": [2, 7, 9, 11, 14, 27], "\uc2dd\uc744": [7, 22], "\uc774\uc6a9\ud574": [7, 9, 14, 18, 19, 28], "\uc0d8\ud50c\ub9c1": [7, 18, 22, 25], "\uad00\uacc4": 7, "noise\ub97c": [7, 8, 11, 14, 17], "\uacc4\uc0b0": [7, 8, 19], "fix": [2, 7, 8, 10], "t\uc2dc\uc810\uc758": 7, "\uc608\uce21\ud55c": [7, 9, 10], "\u03c3": 7, "\u03c3\uac00": 7, "\uac00\uc9c8": 7, "\uc218\uc2dd\uacfc": 7, "\ub3d9\uc77c\ud558\ub2e4": 7, "explan": 7, "acceler": [2, 7, 22, 23], "deterministic\ud558\uae30\ub54c\ubb38\uc5d0": [7, 22], "\uacc4\uc0b0\ud560": [7, 22], "\ud544\uc694": [7, 22], "subset\uc758": [7, 22], "\uc2dc\uc810\ub9cc\uc73c\ub85c": [7, 22], "method\ub294": [7, 19, 22], "\uc57d\uac04\uc758": [7, 9, 22], "\uc800\ud558\uac00": [7, 10, 22], "\uc788\uc9c0\ub9cc": [7, 9, 20, 21, 22, 27], "efficiency\ub97c": [7, 22], "\ucda9\ubd84\ud788": [7, 10, 22], "\uc99d\uac00\uc2dc\ud0ac": [7, 22], "ddim\uc758": [7, 22], "od": 7, "encoding\uc774": 7, "\uc720\ub3c4\ud560": 7, "table1": 7, "euqat": 7, "simple\ud558\uac8c": 7, "control\ud558\uae30\uc704\ud55c": 7, "\ud69f\uc218": [7, 10], "\ub0ae\uc740": [7, 8, 10, 11, 15, 21, 23], "3\uc758": [7, 21], "\u03b7\uac00": 7, "step\uc5d0": [7, 11], "figur": [2, 7, 11, 15, 16, 18, 19, 20, 23, 25], "step\uacfc": 7, "time\uc774": 7, "linear\ud55c": 7, "step\uc5d0\uc11c\ub3c4": 7, "\uc5b4\ub290\uc815\ub3c4\uc758": 7, "object\ub97c": 7, "kera": 7, "io": [7, 8, 11, 19, 20, 21], "diffusionmodel": 7, "image_s": 7, "width": [7, 22], "block_depth": 7, "super": [2, 7, 8, 17, 18, 20, 24, 26, 27, 28], "get_network": 7, "unet": [7, 8, 15, 23, 27], "denorm": 7, "convert": [7, 23], "pixel": [7, 21, 27], "back": 7, "rang": [7, 21, 23, 24, 27], "mean": [7, 8, 9, 23], "varianc": [2, 7, 8, 9, 18], "tf": 7, "clip_by_valu": 7, "diffusion_schedul": 7, "diffusion_tim": 7, "angl": 7, "start_angl": 7, "aco": 7, "max_signal_r": 7, "end_angl": 7, "min_signal_r": 7, "diffusion_angl": 7, "signal_r": 7, "co": [7, 8], "noise_r": 7, "sin": [7, 8], "note": 7, "squar": [7, 23], "sum": [7, 13], "alwai": 7, "noisy_imag": 7, "exponenti": [7, 15], "move": [7, 15], "ema_network": 7, "predict": [7, 8, 10, 23, 25, 27], "compon": 7, "calcul": 7, "pred_nois": [7, 8], "pred_imag": 7, "train_step": 7, "have": 7, "standard": [2, 7, 17], "deviat": 7, "like": 7, "shape": [7, 8, 17, 19, 23, 24, 26], "batch_siz": [7, 17, 28], "minval": 7, "maxval": 7, "mix": [7, 18], "accordingli": 7, "gradienttap": 7, "tape": 7, "separ": [7, 17, 23], "noisi": [7, 27], "noise_loss": 7, "image_loss": 7, "trainable_weight": 7, "apply_gradi": 7, "noise_loss_track": 7, "update_st": 7, "image_loss_track": 7, "name": [7, 13], "reverse_diffus": 7, "initial_nois": 7, "diffusion_step": 7, "num_imag": 7, "step_siz": 7, "import": [7, 11, 13], "line": 7, "pure": 7, "its": 7, "assum": 7, "nonzero": 7, "next_noisy_imag": 7, "ones": 7, "remix": 7, "next": 7, "next_diffusion_tim": 7, "next_noise_r": 7, "next_signal_r": 7, "generated_imag": 7, "probabilist": [8, 13, 18], "neurip": [8, 22, 25], "2020": [8, 11], "2006": [8, 13], "11239": [8, 13], "pytorch": [8, 13, 17, 21, 24, 28], "implement": [8, 13, 16, 23, 24, 28], "review": [8, 13, 19, 29], "pr": [8, 13, 23], "409": [8, 13], "beomsoo": [8, 13], "park": [8, 9, 13], "apr": [8, 13, 17, 20, 24, 28], "19": [8, 13], "sourc": [2, 8, 16, 17, 19, 20, 21, 26], "velog": [8, 20, 21], "yetsyl0705": 8, "what": 8, "inference\ub85c": 8, "\ud559\uc2b5\uc2dc\ucf1c": [8, 13], "parameter": 8, "model\uc740": [8, 9, 12, 13, 19], "markov\uac00": 8, "distribution\uc758": 8, "\ud615\ud0dc\ub97c": 8, "\ub54c\uae4c\uc9c0": 8, "\ub354\ud574\uac00\ub294": 8, "\uc5ed\uc73c\ub85c": 8, "\uac70\uce58\uba70": 8, "\uad6c\uc131\ub428": 8, "\uc815\uc758\ud558\uae30": 8, "\uc27d\uace0": 8, "\ud559\uc2b5\uc2dc\ud0a4\ub294": [8, 9, 20], "\ud3b8\ub9ac\ud568": 8, "\ud488\uc9c8\uc758": [8, 9, 20], "\uc0dd\uc131\uc774": [8, 10, 16, 21, 22, 25], "\ubcc0\ubd84\ucd94\ub860": [8, 28], "\uc0ac\ud6c4\ud655\ub960": 8, "posterior": [8, 21, 28], "\ubd84\ud3ec": [8, 21], "\ub2e4\ub8e8\uae30": [8, 28], "\uc26c\uc6b4": [8, 20, 28], "\ud655\ub960\ubd84\ud3ec": 8, "\uadfc\uc0ac": 8, "approxim": [8, 28], "\ud45c\ud604\uc2dd\uc5d0": 8, "\ud45c\ud604\ud558\ub294": [8, 19, 23], "\ubcf4\ud1b5": [8, 10, 13, 17, 18, 19, 20], "parameter\uc758": [8, 9], "\uc2dd\uc758": 8, "\ucc28\uc218\ubcf4\ub2e4": 8, "\uc218\ub85c": 8, "\uc120\ud0dd": [8, 21], "3\ucc28": 8, "\ud45c\ud604\uc2dd": 8, "2\uac1c": 8, "\ud558\ubbc0\ub85c": [2, 8], "\ucc28\uc218\ub85c\uc758": 8, "\ud568\uc218": [8, 20, 21, 22], "3d": 8, "2d": 8, "\uc0c1\ud0dc\uc5d0\uc11c": [8, 10, 20], "\uc0c1\ud0dc\ub85c": [8, 10, 23, 27], "\ub118\uc5b4\uac08": 8, "\ub2e8\uacc4\uc758": [8, 19], "\uc0c1\ud0dc\uc5d0\ub9cc": 8, "\ud655\ub960": [2, 8, 14, 18, 28], "graphic": [8, 25], "_0": 8, "prod_": 8, "quad": 8, "sqrt": [2, 8, 9, 12, 27], "beta_t": 8, "chain\uc73c\ub85c": 8, "data\uc5d0": [8, 20], "\ucd94\uac00\ud560": 8, "schedul": [2, 8, 11, 22, 23, 27], "beta_1": 8, "\ub354\ud574\uc900\ub2e4": 8, "\uc774\uba74": [8, 14, 25], "mean\uc778": 8, "\uc774\uc804": [8, 9, 13, 17], "\uac16\uc9c0": 8, "\ub178\uc774\uc988\uac00": 8, "\uc99d\uac00\ud568": 8, "\ub2e8\uc21c\ud788": [2, 8, 14, 19, 20], "noise\ub9cc\uc744": 8, "\ub354\ud574\uc8fc\ub294\uac8c": 8, "scaling\ud558\ub294": 8, "variance\uac00": 8, "\ubc1c\uc0b0\ud558\ub294": 8, "\ub9c9\uae30": 8, "\uc704\ud568": [8, 20], "x_1": 8, "x_0": [2, 8, 9], "\ub9cc\ub4dc\ub294": [8, 9, 10, 28], "\uc644\uc804": 8, "destroy\ub41c": 8, "\uc0c1\ud0dc": 8, "p_": [2, 8, 9, 13, 21, 22, 24, 28], "boldsymbol": 8, "mu": [2, 8, 17, 22, 28], "sigma": [8, 17, 22, 28], "\uac00\uc6b0\uc2dc\uc548": 8, "\ub178\uc774\uc988\ub97c": [2, 8, 18], "1994\ub144": 8, "process\uac00": [8, 14], "\uac00\uc6b0\uc2dc\uc548\uc774\uba74": 8, "process\ub3c4": 8, "\uac00\uc6b0\uc2dc\uc548\uc73c\ub85c": 8, "\uc4f0\uba74": 8, "\ub41c\ub2e4\ub77c\ub294": 8, "\uc99d\uba85\uc774": 8, "\ud568": [2, 8, 13, 18, 19, 20, 21, 22, 25], "\ud574\uc57c": 8, "mu_": [2, 8, 9], "\ubd84\uc0b0": [2, 8, 18, 22, 28], "sigma_": [8, 22, 23], "hierarach": 8, "vae\uc5d0\uc11c\uc758": 8, "\uacfc\uc815\uacfc": 8, "\ube44\uc2b7\ud568": [8, 19], "\ubaa9\uc801\uc740": 8, "\uc81c\uac70\ud560": 8, "\uac83\uc778\uac00": 8, "\uc774\ub2e4": [2, 8, 13, 22], "\ub4e4\uc5b4\uc654\uc744": [8, 18], "\uc608\uce21\ud560": 8, "\uc608\uce21\uc774": 8, "\uac00\ub2a5\ud574\uc9d0": [8, 19], "mathbb": [2, 8, 12, 13, 23, 24, 27, 28], "leq": 8, "_q": [8, 12], "sum_": [8, 9, 13, 28], "geq": 8, "neg": [8, 9, 16], "likelihood\ub97c": 8, "\ucd5c\uc18c\ud654": 8, "\ubc29\ud5a5\uc73c\ub85c": [8, 14, 16, 24, 27], "\uc9c4\ud589": [2, 8, 16, 21, 22, 25], "\uc218\uc2dd\uc744": [8, 14, 20, 22], "elbo": [2, 8, 21], "evid": [8, 21], "lower": [8, 14, 15, 21, 25], "bound": 8, "\uc6b0\ud56d\uacfc": 8, "\uc815\ub9ac\ud558\uace0": 8, "\ud480\uc5b4\ub0b4\uba74": 8, "elbo\uc758": 8, "\uc5ed\ud560\uc740": 8, "\uad00\ucc30\ud55c": 8, "\ud798\ub4e0": 8, "\ubd84\ud3ec\ub97c": [8, 15, 18, 20, 24, 28], "\uc774\ub8e8\uace0": 8, "\uc870\uae08": 8, "\ubd84\ud3ec\uc778": 8, "\ud45c\ud604\ud558\ub824": 8, "\ucc28\uc774": [8, 14], "kl": [2, 8, 12, 24, 28], "diverg": 8, "\ud558\uae30": [2, 8, 10, 21, 23, 27, 28], "underbrac": 8, "d_": [2, 8, 10, 13, 24], "_1": 8, "\ub098\uc628\ub2e4": [2, 8, 22], "term\uc73c\ub85c": 8, "\ud559\uc2b5\uc2dc\ud0b4": 8, "reconstruct": [8, 15, 19, 23, 28], "\ub9e4": [2, 8, 27], "\ub2e8\uacc4\uc5d0\uc11c": [8, 10, 15, 19], "\uc9c0\uc6b0\ub294": 8, "\uc9c0\uc6c0": 8, "ddpm\uc5d0\uc11c\ub294": [8, 9, 11], "induct": 8, "bias\ub97c": [8, 17, 19], "\ub298\ub824": [8, 18], "stable\ud558\uace0": 8, "\uc131\ub2a5\ub3c4": [8, 18, 26], "\uac1c\uc120\ud560": [8, 11], "\uc788\uc5c8\uc74c": [8, 13, 19, 20], "\ub9cc\ub098\ubcf4\uc9c0": 8, "\ubabb\ud588\ub358": [8, 23], "\uc815\ud655\ud55c": [8, 19, 20], "\uc608\uce21\uc744": [8, 10], "\uac00\uc815": 8, "\ud480\ub824\ub294": 8, "\ubb38\uc81c\uc5d0": 8, "\uc801\uc6a9\ud558\ub294": [8, 13, 23, 27], "\uace0\uc815": [8, 11, 19], "\ud588\ub354\ub2c8": 8, "\uc798\ub428": 8, "02\ub85c": 8, "linear\ud558\uac8c": 8, "image\uc5d0": [8, 9, 19], "\uac00\uae4c\uc6b8\uc218\ub85d": 8, "\uc801\uac8c": [8, 21], "\uc8fc\ub294": [8, 9, 27], "\uc124\uc815": [2, 8, 13, 18, 21], "parameter\uac00": 8, "\uc5c6\uc5b4": [8, 20, 25], "\ub418\uae30": [8, 13, 20], "tild": [2, 8, 10, 11], "beta": [8, 10], "progress": 8, "posterior\ub97c": 8, "\ub354\ud574": 8, "\ub9cc\ub4e4\uc5c8\uc744\ub54c": 8, "\ubcf5\uc6d0": 8, "simplic": 8, "sjina0722": 8, "\ub9ac\ubdf0": [8, 13], "\uc0c1\uc218\ub85c": 8, "\uac00\uc815\ud588\uace0": 8, "\ubc1b\uae30": [8, 16], "\ud559\uc2b5\uc2dc\ud0a4\uc9c0": 8, "\uc54a\uc544\ub3c4": [8, 27], "\ub41c\ub2e4\uace0": 8, "\uc0dd\uac01\ud574": 8, "term\uc744": 8, "\uc81c\uac70": [8, 10, 15], "residu": [8, 9, 10, 20, 22, 23, 25, 27], "estim": [8, 24, 28], "\uad6c\ud558\uc9c0": [8, 24], "\uc54a\uace0": [8, 13, 24, 26, 28], "epsilon_": [2, 8, 12, 22, 27], "\uad6c\ud574": 8, "\uc815\ud655\ub3c4\ub97c": [8, 18], "\ub192\uc784": 8, "d": [2, 8, 12, 13, 17, 24], "int_": 8, "delta_": [2, 8], "sigma_1": 8, "arrai": 8, "ll": [8, 13, 23], "infti": 8, "255": 8, "case": [8, 26], "\uc0ac\uc774\ub85c": 8, "linearli": 8, "\ub2e8\uacc4\uc5d0\ub294": 8, "\ucd94\uac00\ud558\uc9c0": 8, "divergence\ub97c": 8, "\ub098\ud0c0\ub0c4": [2, 8, 18, 20], "\uc88c\ud45c": 8, "final": [8, 9], "\uc704\uc640": [8, 13, 20, 22], "\ub098\ud0c0\ub09c\ub2e4": 8, "ground": [2, 8, 20, 24], "truth": [2, 8, 20, 24], "output\uac04": 8, "\uc904\uc774\ub294": [8, 10], "\uacfc\uc815\uc774": 8, "denoising\uacfc": 8, "\ube44\uc2b7\ud574": 8, "ddpm\uc774\ub77c\ub294": 8, "\uc774\ub984\uc774": [8, 25], "\ubd99\uc74c": 8, "objective\uc744": 8, "\uc5d0\uc11c\ubfd0\ub9cc": 8, "t\uc5d0": 8, "\ub300\ud574\uc11c\ub3c4": [2, 8, 9, 18, 19, 20, 22], "\uac00\ub2a5\ud558\uae30": 8, "\ud6a8\uacfc\uc801": 8, "psuedo": 8, "algorithm": [2, 8], "\ub354\ud574\ub098\uac00\ub294": 8, "epsilon": [2, 8, 9, 10, 12, 22, 23, 27], "\uc5bc\ub9c8\ub9cc\ud07c": 8, "\ub354\ud574\uc84c\ub294\uc9c0\ub97c": 8, "step\uc758": [8, 9], "gaussian": [2, 8, 13, 17, 22, 23, 27, 28], "\ucd94\uac00\ub418\uc5c8\ub294\uc9c0\ub97c": 8, "\uc608\uce21\ud558\ub3c4\ub85d": [2, 8, 11], "\ud559\uc2b5\ub41c\ub2e4": [8, 19], "\ucf54\ub4dc\uc5d0\uc11c\ub294": [8, 13], "\ub79c\ub364": 8, "\ub178\uc774\uc988\uc640": 8, "\ub2e8\uacc4": [8, 24], "t\ub85c": [8, 9], "\uc5bb\uace0": 8, "p_loss": 8, "x_start": 8, "default": [8, 13], "lambda": [8, 20, 23], "torch": [8, 13, 23, 27, 28], "randn_lik": [8, 23], "q_sampl": 8, "do": [8, 17, 19, 27], "set": [8, 13, 20, 23, 25], "slow": 8, "down": [2, 8, 27], "25": [8, 13, 15, 18, 24], "seem": 8, "significantli": [8, 25], "x_self_cond": 8, "self_condit": 8, "no_grad": 8, "model_predict": 8, "pred_x_start": 8, "detach_": 8, "take": 8, "model_out": 8, "pred_x0": 8, "pred_v": 8, "predict_v": 8, "rais": [8, 23], "valueerror": [8, 23], "unknown": [8, 23], "loss_fn": 8, "reduct": [8, 23], "reduc": [8, 23], "extract": 8, "loss_weight": 8, "network\ub97c": [8, 17], "\ud559\uc2b5\ud558\uace0": [8, 18, 21], "\ub098\uba74": [8, 10], "noise\uc5d0\uc11c": 8, "\uc2dc\uc791\ud574\uc11c": [8, 17], "\uc21c\ucc28\uc801\uc73c\ub85c": [8, 21, 27], "markovian": [2, 8, 22], "p_sampl": 8, "int": [8, 24, 27, 28], "devic": [8, 23], "batched_tim": 8, "long": [8, 23], "model_mean": 8, "model_log_vari": 8, "p_mean_vari": 8, "clip_denois": 8, "pred_img": 8, "exp": 8, "backbon": [8, 15], "u": [2, 8, 12, 23, 25, 27], "\uac01": [2, 8, 10, 13, 15, 17, 18, 19, 20, 21, 22, 23, 26, 27], "upsampl": [8, 9, 22, 25, 27], "\ub2e8\uacc4\ub294": 8, "resnet": [8, 18, 22, 27], "convnext": 8, "\ube14\ub85d": 8, "groupnorm": [8, 22], "upsampling\uc73c\ub85c": 8, "block_klass": 8, "resnetblock": 8, "group": 8, "resnet_block_group": 8, "modulelist": [8, 27], "dim_in": 8, "time_emb_dim": 8, "time_dim": 8, "prenorm": 8, "linearattent": 8, "downsampl": [2, 8, 15, 22, 26, 27], "dim_out": 8, "is_last": 8, "conv2d": [8, 13, 27], "init_dim": 8, "out_dim": 8, "dim_mult": 8, "learned_vari": 8, "learned_sinusoidal_cond": 8, "random_fourier_featur": 8, "learned_sinusoidal_dim": 8, "determin": 8, "dimens": [8, 13, 27], "input_channel": 8, "init_conv": 8, "in_out": 8, "list": [8, 27], "random_or_learned_sinusoidal_cond": 8, "sinu_pos_emb": 8, "randomorlearnedsinusoidalposemb": 8, "fourier_dim": 8, "sinusoidalposemb": 8, "time_mlp": 8, "gelu": 8, "num_resolut": 8, "len": [8, 24, 27], "ind": 8, "enumer": [8, 23, 24, 27], "mid_dim": 8, "mid_block1": 8, "mid_attn": 8, "mid_block2": 8, "default_out_dim": 8, "final_res_block": 8, "final_conv": 8, "zeros_lik": 8, "clone": [8, 27], "block1": [8, 27], "block2": [8, 27], "attn": [8, 16], "pop": 8, "resolution\uc5d0": [8, 18, 20], "conv\uc5d0\uc11c": 8, "\ucc28\uc6d0\uc744": 8, "3\ubc30\ub85c": 8, "\ub298\ub9ac\uace0": 8, "v\ub85c": 8, "\ubd84\ud574": [8, 10], "dim_head": 8, "hidden_dim": 8, "to_qkv": 8, "to_out": 8, "qkv": 8, "chunk": [8, 23, 27], "rearrang": 8, "sim": [2, 8, 12, 24, 27], "einsum": 8, "softmax": [8, 12, 21], "layernorm": 8, "block\uc5d0": [8, 9, 22], "sinusoid": 8, "embedding\uc774": [8, 19], "\ucd94\uac00\ub3fc\uc11c": 8, "\uad6c\ubd84\ub428": 8, "half_dim": 8, "math": 8, "10000": 8, "arang": 8, "score": [8, 9, 20, 21, 25, 27], "is\ub85c": 8, "model\uc778\ub370\ub3c4": 8, "model\ubcf4\ub2e4": [8, 9, 14], "\uc6b0\uc6d4": 8, "codelength\uc5d0\uc11c": 8, "\ucc28\uc774\uac00": [8, 11, 18, 19], "\uc5c6\uae30": [8, 20], "overfitting\uc758": 8, "\uac00\ub2a5\uc131\ub3c4": 8, "\uc801\uc74c": 8, "incept": [8, 18, 21], "v3\uc73c\ub85c": 8, "\uacc4\uc0b0\ud55c": [8, 24], "dataset\uc5d0": [8, 9, 21], "\ud559\uc2b5\ub418\uba74": [8, 19], "label": [8, 20, 22, 25], "\ub4f1\uc758": [8, 10, 19, 20], "\uacc4\uc0b0\ud558\ub294": [8, 28], "\uc131\uc801\uc774": 8, "\uc88b\uace0": 8, "variance\ub97c": [8, 11], "\uc0ac\uc6a9\ud588\uc744": [8, 18, 19, 22], "\ub54c\uc5d0\ub3c4": 8, "\uac10\uc18c\ud558\uc9c0": 8, "icml": [9, 10, 21], "2307": 10, "06949": 10, "hyoungseo": 10, "cho": [10, 12], "\ub5a0\uc624\ub974\uace0": 10, "\uc8fc\uc81c\uc785\ub2c8\ub2e4": 10, "fidelity\uc640": 10, "identity\ub97c": 10, "\uc720\uc9c0\ud55c": [10, 23], "\ub9e5\ub77d\uacfc": 10, "\uc2a4\ud0c0\uc77c\uc744": [10, 16, 18, 20], "\ub17c\ubb38\uc740": [2, 10, 17, 18, 20, 23], "\uc9c4\ud589\ub418\uc5c8\uae30": 10, "\ub17c\ubb38\uc744": [9, 10, 18, 22], "\uc77d\uc5b4": 10, "\ubcf4\uc2dc\uae30\ub97c": 10, "\ucd94\ucc9c\ub4dc\ub9bd\ub2c8\ub2e4": 10, "contribution\uc740": [10, 17], "\ud06c\uac8c": [2, 10, 14, 18, 19, 20, 24, 26, 27, 28], "3\uac00\uc9c0\ub85c": 10, "lighweight": 10, "dreambooth\uc758": 10, "\uc720\uc9c0\ud558\uba74\uc11c": [10, 15, 17], "\ud06c\uae30\ub97c": [10, 15, 22, 27], "\uc904\uc774\uace0": 10, "\ub192\uc77c": [10, 19], "hyperdreambooth\ub97c": 10, "\uad6c\ud604\ud588\uc9c0\ub9cc": 10, "e2": [10, 23, 26], "\uc801\uc6a9\uc774": [10, 13, 17], "\uae30\uc220\ub4e4\uc740": 10, "fidelity\uac00": [10, 16, 19, 22, 25], "\ub5a8\uc5b4\uc9c0\uac70\ub098": 10, "\ubb38\ub9e5\uc744": 10, "\uc81c\uacf5\ud558\uc9c0": 10, "\ubb38\uc81c\uac00": [10, 18, 20], "hypernetwork\ub97c": 10, "\ub3c4\uc785\ud55c": [2, 10, 15], "\uc5f0\uad6c\ub97c": 10, "via": 10, "\ub2e4\uc74c\uc73c\ub85c": 10, "personalization\uc744": 10, "finetuning\uc5d0": 10, "svdiff": 10, "lora": 10, "styledrop": 10, "dreamartist": 10, "\uc608\uc2dc\uac00": 10, "\uc18d\ub3c4": [10, 13], "\uce21\uba74\uc5d0\uc11c": [10, 17], "\ub290\ub9ac\ub2e4\ub294": 10, "\ub2e8\uc810\uc744": [10, 24, 28], "\uad00\ub828": [10, 17], "\uc5f0\uad6c\ub4e4\uc744": 10, "hyperdreambooth\ub294": 10, "\uc18d\ub3c4\uc640": 10, "\ud6a8\uc728\uc131": 10, "\ubc1c\uc804\uc744": 10, "\uc774\ub8e8\uc5c8\ub2e4\uace0": 10, "\uc774\uc804\uc5d0": [10, 19, 22], "\ub098\uc628": [9, 10, 12, 21, 22, 25, 27, 28], "dreambooth\ub294": 10, "\uc8fc\uc81c\uc758": 10, "\uc0dd\uc131\ud558\uae30": [10, 19, 20], "\ub124\ud2b8\uc6cc\ud06c\ub97c": 10, "\ud65c\uc6a9\ud588\uc2b5\ub2c8\ub2e4": 10, "hyperdreambooth\uc758": 10, "\uc601\uac10\uc6d0": 10, "\ud558\ub098\ub85c": [10, 13, 19, 27, 28], "\ud65c\uc6a9\ub418\uc5c8\uc2b5\ub2c8\ub2e4": 10, "adapt": [10, 13, 17], "lora\ub294": [10, 13], "\uac00\uc911\uce58\ub97c": [10, 15, 18], "\ub7ad\ud06c\uc758": 10, "\ud589\ub82c\ub85c": 10, "\uadfc\uc0ac\ud654\ud558\uc5ec": 10, "\ud06c\uae30\uc640": [10, 11], "\ubcf5\uc7a1\uc131\uc744": 10, "\ubc29\ubc95\uc785\ub2c8\ub2e4": [10, 17], "\uae30\uc220\uc744": [10, 15, 20, 25], "\ud6a8\uc728\uc801\uc778": 10, "personalization\uc774": 10, "\uac00\ub2a5\ud558\ub3c4\ub85d": [9, 10, 28], "\uc0b4\ud3b4": 10, "contribution\uc758": 10, "\uc0b4\ud3b4\ubcf4\ub3c4\ub85d": [10, 28], "\uae30\uc220": [10, 15, 18], "\ud558\ub098\uc778": [10, 18], "\uc904\uc5ec\uc11c": 10, "lidb\uc5d0": 10, "\uc124\uba85\ub4dc\ub9ac\uaca0\uc2b5\ub2c8\ub2e4": 10, "lidb\ub294": 10, "residuals\uc758": 10, "\uac00\uc911\uce58": [10, 20, 25], "\uacf5\uac04\uc744": 10, "\uc138\ubd84\ud654\ud558\ub294": 10, "\uc544\uc774\ub514\uc5b4\uc785\ub2c8\ub2e4": 10, "\ub0b4\uc5d0\uc11c": [10, 15, 19, 26], "orthogon": 10, "basis\ub97c": 10, "decompos": 10, "\uc811\uadfc": [10, 19], "lora\uc758": 10, "a\uc640": 10, "\ud589\ub82c\uc744": 10, "\ubd84\ud574\ud558\ub294": 10, "\uac83\uc73c\ub85c\ub3c4": 10, "\uc774\ud574\ud560": 10, "\uad6c\uccb4\uc801\uc73c\ub85c": 10, "\uc0b4\ud3b4\ubcf4\uba74": [2, 10, 14], "\ud589\ub82c\uc740": 10, "a_": 10, "aux": [10, 16], "\ubd84\ud574\ub418\uba70": 10, "b_": [10, 11], "\ubd84\ud574\ud560": 10, "\ub808\uc774\uc5b4\ub294": 10, "\ud589\ubcc4\ub85c": 10, "\uc9c1\uad50\ud558\ub294": 10, "\ubca1\ud130\ub85c": [10, 19], "\ubb34\uc791\uc704": [10, 20], "\ucd08\uae30\ud654\ub418\uace0": 10, "\ud559\uc2b5\ub418\ub294": 10, "\uac00\uc911\uce58\uc785\ub2c8\ub2e4": 10, "\uc120\ud615": 10, "\ub808\uc774\uc5b4\uc758": 10, "residual\uc740": 10, "w_x": 10, "experiment": 10, "\ub418\uc5c8\uc73c\uba70": 10, "\uac1c\uc218\ub294": 10, "\uc57d": [9, 10, 19, 20, 21], "30k\uac1c": 10, "\uc0ac\uc774\uc988\ub294": 10, "120kb\ub85c": 10, "\uacbd\ub7c9\ud654": 10, "\ubcc0\uc218\ub9cc\uc73c\ub85c": 10, "fidel": [2, 10, 22, 23, 25], "edit": [2, 9, 10, 14, 19, 27], "\ub4f1\uc744": [10, 20], "\ud3ec\uc778\ud2b8\uc785\ub2c8\ub2e4": 10, "\ub2e4\uc74c\uc740": 10, "\uc0ac\uc804\uc5d0": [10, 23], "\ub098\ud0c0\ub0b4\uba70": 10, "\ub808\uc774\uc5b4\uc5d0": 10, "\uc544\uc774\ub514\uc5b4\ub294": 10, "x\ub97c": 10, "\uc785\ub825\uc73c\ub85c": [10, 18, 20], "\ubc1b\uace0": [10, 25], "lidb\uc758": 10, "residual\uc778": 10, "hat": [10, 12, 23], "h_": [10, 15], "eta": 10, "\ub3cc\uc785\ud558\ub294": 10, "hypernetwork\ub294": 10, "\ub3c4\uba54\uc778": [2, 10], "\ud2b9\ud654": [10, 21], "\ub370\uc774\ud130\uc14b\uc5d0\uc11c": [10, 19, 25], "\ud6c8\ub828\ub418\uba70": 10, "\ud655\uc0b0": 10, "\ub178\uc774\uc988": [2, 10, 15, 23], "\uc190\uc2e4\uacfc": [10, 20], "\uacf5\uac04": [10, 19, 20], "\uc190\uc2e4\uc744": 10, "alpha": [10, 13, 27], "\ubaa9\ud45c\ub294": [10, 19], "pre": [2, 10, 13, 14, 16, 19, 23, 27], "paramters\uc785\ub2c8\ub2e4": 10, "\uac00\uc911\uce58\ub294": 10, "\uad00\ub828\ub41c": [10, 19, 20], "\uc870\uc815\ub429\ub2c8\ub2e4": 10, "\ub098\ud0c0\ub0c5\ub2c8\ub2e4": 10, "supervisori": 10, "\uc870\uac74\uc774": 10, "\uc124\uc815\ub41c": 10, "\uac1c\uc778\ud654\uc5d0": 10, "\uc0c1\ub300\uc801\uc778": 10, "loss\uc758": [10, 11], "\uc81c\uc5b4\ud558\uae30": 10, "\ud56d\ubaa9\uc758": 10, "\ub370": [9, 10, 18, 20], "\uc9c0\uc6d0\ud558\uae30": 10, "\uc785\ub825\uc785\ub2c8\ub2e4": 10, "\ud504\ub86c\ud504\ud2b8\ub294": 10, "\uc9c0\uc2dc\uc0ac\ud56d": 10, "hyperdreambooth\uc5d0\uc11c\ub294": 10, "\uac1c\uc778\ud654\ub41c": [10, 19], "\ub4dc\ubb3c\uc9c0\ub9cc": 10, "\uc758\ubbf8": 10, "\uc218\uc815\uc744": 10, "\uc0bd\uc785\ud560": [10, 19], "\uc5ed\ud560\uc744": [10, 11, 27], "hyperdreambooth\uc5d0\uc11c": 10, "\uad6c\uc870\ub85c": 10, "\uad6c\uc131\ub418\uba70": [10, 15], "\uc608\uce21\ud569\ub2c8\ub2e4": [10, 20], "\uad6c\uc131": 10, "\uc694\uc18c": 10, "\ud558\ub098\uc785\ub2c8\ub2e4": 10, "\uc774\ud6c4": [2, 9, 10, 13, 19], "\uac00\uc911\uce58\uc5d0": 10, "\ub354\ud558\uc5ec": 10, "\uac1c\uc778\ud654\ub97c": 10, "\uc2e4\ud589\ud569\ub2c8\ub2e4": 10, "iter": 10, "\ubc18\ubcf5\uc801": 10, "\uc218\ud589\ud569\ub2c8\ub2e4": 10, "hypernetwork\uac00": 10, "\ucd08\uae30": 10, "\ubc18\ubcf5\uc801\uc778": 10, "\uac1c\uc120\ud558\ub824\uace0": 10, "\uc2dc\ub3c4\ud558\ub294": 10, "\uc608\uce21\uc740": 10, "\ubc29\ud5a5\uc131\uc774": 10, "\uc62c\ubc14\ub974\uace0": 10, "\uc5bc\uad74\uacfc": [10, 15], "\ubbf8\uc138\ub9cc": 10, "\uc138\ubd80": [10, 19], "\uc7a1\uc544\ub0b4\uc9c0": 10, "\ubabb\ud560": [10, 15], "tuning\ud558\uace0": 10, "\ub098\uc740": 10, "\ub54c\uc5d0": 10, "encoding\uc740": 10, "\ubc88\ub9cc": 10, "\uc218\ud589\ub418\uba70": 10, "\ucd94\ucd9c\ub41c": 10, "\ud2b9\uc9d5": [10, 14], "f\ub294": [10, 20], "\uc2e4\ud589\ud558\uace0": 10, "\uc18d\uc131\uacfc": 10, "\ubc29\ud5a5\uc131\uc5d0": 10, "\uc62c\ubc14\ub974\uac8c": 10, "\ub418\uc9c0\ub9cc": 10, "\uc138\ubd80\uc801\uc778": [10, 17, 20], "detail\uc740": 10, "\ubabb\ud569\ub2c8\ub2e4": 10, "dreambooth\ubcf4\ub2e4": 10, "\ube60\ub974\uc9c0\ub9cc": 10, "\uac15\ud55c": [10, 16], "subject": [10, 23], "diversity\ub97c": [10, 16], "\ub3d9\uc77c\ud558\uac8c": [10, 18, 26, 27], "\ucd08\uae30\ud654\ub41c": 10, "x\uc640": [9, 10, 20], "\uc9c0\uc2dc\uc5b4": 10, "c\uc5d0": 10, "\ucd5c\uc18c\ud654\ud558\ub3c4\ub85d": 10, "\uc870\uc815\ud569\ub2c8\ub2e4": 10, "\uc810\uc740": [10, 18, 19], "\uac1c\ub150\uc785\ub2c8\ub2e4": 10, "\uc8fc\ub85c": [9, 10], "\uc644\ud654\ud558\uc5ec": 10, "rank\ub85c": 10, "hypernetwork\uc758": 10, "\uc608\uce21\ub41c": [2, 10], "\uc8fc\uccb4\uc758": 10, "\uace0\uc8fc\ud30c\uc218": 10, "\uc0ac\ud56d\uc744": 10, "\uadfc\uc0ac\ud654\ud560": 10, "\uc774\ub85c": [2, 10, 25], "\uc778\ud574": [2, 10, 15, 20, 25], "\uc81c\ud55c\ub41c": 10, "\uc5c5\ub370\uc774\ud2b8\ubcf4\ub2e4": 10, "\uc8fc\uc81c": 10, "\ucda9\uc2e4\ub3c4\ub97c": 10, "\ub2ec\uc131\ud560": 10, "relaxed\uc758": 10, "\uac1c\ub150\uc740": 10, "\ubc29\uc2dd\ubcf4\ub2e4": 10, "\uc6b0\uc218\ud558\uac8c": [10, 17], "\uc694\uc778\uc785\ub2c8\ub2e4": 10, "\uc5ec\uae30\uc11c\ub3c4": 10, "\uc0ac\uc6a9\ud558\ub294\ub370": 10, "\uc9c0\uc6d0\ud558\uba70": 10, "\uc5bc\uad74\uc5d0": 10, "\ud2b9\uc131\uacfc": 10, "\ucea1\ucc98\ud558\ub294": 10, "\ub3c4\uc6c0\uc774": [10, 19, 21], "\uace0\ub824\ud560": 10, "40\ubc88\uc758": 10, "\ubc18\ubcf5\uc73c\ub85c": 10, "\uc644\ub8cc\ud560": 10, "dreambooth\uc640": 10, "\ube44\uad50\ud588\uc744": [10, 14, 18], "25\ubc30": 10, "\uc18d\ub3c4\ub77c\ub294": 10, "\uad6c\ud604\ud588\uc2b5\ub2c8\ub2e4": 10, "\ubaa8\ub378\uc5d0\uc11c\ub294": 10, "5\uc758": 10, "unet\uc758": [10, 15], "\ud65c\uc6a9\ud558\uae30": 10, "\uc778\ucf54\ub354\ub3c4": 10, "\uac1c\uc778\ud654\ud558\uae30": 10, "\uc2dc\uac01\ud654\uc5d0": 10, "\uc5bc\uad74": 10, "sfhq": 10, "synthet": 10, "headquart": 10, "\ub370\uc774\ud130\uc14b\uc744": [9, 10, 17], "\ud6c8\ub828\uc2dc\ud0a4\uae30": [9, 10], "celeba": 10, "hq": [2, 10], "000\uac1c\uc758": 10, "galleri": 10, "\uc624\ub978\ucabd": [10, 15, 18], "\uc544\ub798\ub85c": [10, 17, 27], "\uc778\uc2a4\ud0c0\uadf8\ub7a8": 10, "\uc140\uce74": 10, "pixar": 10, "\uce90\ub9ad\ud130": 10, "bark": 10, "skin\uc758": 10, "\ub85d": 10, "\uc2a4\ud0c0": 10, "\uc804\ubb38\uc801\uc778": 10, "\ucd2c\uc601": 10, "inversion\uc758": 10, "\ube44\uad50\ud55c": [9, 10, 26], "\ud45c\uc785\ub2c8\ub2e4": 10, "\ud3c9\uac00\ub97c": [10, 15, 20, 21, 24, 25], "dino\uc640": 10, "\uc9c0\ud45c\ub97c": [10, 18, 20], "\ud45c\ub294": 10, "\ubd80\ubd84\uc785\ub2c8\ub2e4": [10, 17, 18], "hyperparameter\ub97c": 10, "\uc870\uc815\ud558\uc5ec": 10, "\ube44\uad50\ud588\uc2b5\ub2c8\ub2e4": [10, 27], "\ud559\uc2b5\ub960\uc744": 10, "\uc99d\uac00\uc2dc\ud0a4\uace0": 10, "\ubc18\ubcf5": 10, "\uac10\uc18c\uc2dc\ud0a4\uba74": 10, "\uacb0\uacfc\uc758": [10, 19], "agg": 10, "1\uc740": [10, 20, 22], "400\ubc88\uc758": 10, "\ubc18\ubcf5\uc744": 10, "\uc2dc\ud589\ud558\uace0": 10, "2\ub294": [10, 20, 22], "1200\ubc88": 10, "\uc694\uc18c\ub85c": 10, "\ub098\ub204\uc5b4": 10, "\uc911\uc5d0\ub294": 10, "\ud558\uc774\ud37c\ub124\ud2b8\uc6cc\ud06c\ub97c": 10, "\ud558\uc774\ud37c\ub124\ud2b8\uc6cc\ud06c": 10, "\uc608\uce21\ub9cc": 10, "1\ubc88\ub9cc": 10, "\ube44\uad50\ud569\ub2c8\ub2e4": 10, "\uacb0\uacfc\uc801\uc73c\ub85c": [10, 20], "\ubc29\ubc95\uc774": [9, 10, 18, 20], "\uc2e0\ub8b0\uc131": 10, "\uc9c0\ud45c\uc5d0\uc11c": 10, "\ub2ec\uc131\ud55c\ub2e4\ub294": 10, "\ubcf4\uc5ec\uc8fc\uace0": [9, 10, 22, 23, 25], "user": [10, 16, 19], "\uc778\uc2dd": 10, "\uba54\ud2b8\ub9ad": 10, "\uc2dc\ub098\ub9ac\uc624\uc5d0\uc11c": 10, "\uc57d\ud558\ub2e4\uace0": 10, "\ub124\ud2b8\uc6cc\ud06c\uac00": 10, "\uc774\ubbf8\uc9c0\uc5d0\ub9cc": 10, "\ud6c8\ub828\ub418\uc5b4": 10, "\uc788\uace0": [9, 10, 14, 24, 26, 27, 28], "\uc2a4\ud0c0\uc77c\uc5d0\uc11c": 10, "\uc0ac\ub78c\uc744": [10, 20], "\uc778\uc2dd\ud558\ub3c4\ub85d": 10, "\uc788\uc9c0": [2, 10, 11, 20], "\uc54a\uae30": [10, 18], "\ub54c\ubb38\uc774\ub77c\uace0": 10, "\uc8fc\uc7a5\ud558\uba70": 10, "\ubcf4\uc644\ud558\uae30": 10, "study\ub97c": 10, "inversion\uc744": 10, "\ube44\uad50\ud558\uace0": 10, "\uc0ac\uc6a9\uc790\ub4e4\uc758": 10, "\ubc1b\uc558\uc2b5\ub2c8\ub2e4": 10, "ups\uac00": 10, "\uc874\uc7ac\ud569\ub2c8\ub2e4": [10, 20, 23], "direct": 10, "error": [10, 28], "\uc608\uce21\uc5d0\uc11c": 10, "\uc798\ubabb\ub41c": 10, "\uc2dc\ub9e8\ud2f1": [10, 20], "\ub098\uc62c": 10, "\uc5d0\ub7ec\uc785\ub2c8\ub2e4": 10, "\ub208": [10, 20], "\uc0c9\uae54\uc774\ub098": 10, "\ud5e4\uc5b4": 10, "\ud0c0\uc785": 10, "\uc131\ubcc4": [10, 19], "\ub4f1\uc774": [10, 18, 27], "captur": 10, "\uc624\ub958\uac00": 10, "underfit": 10, "identity\ub294": 10, "\uc9c0\ucf1c\uc9c0\ub354\ub77c\ub3c4": 10, "\uc720\uc0ac\ud558\uc9c0": 10, "\uc0d8\ud50c\uc774": [10, 20], "\uc0dd\uc131\ub420": 10, "hypernetwork\uc640": 10, "\uc2a4\ud0c0\uc77c\uc5d0": 10, "\ubb38\uc81c\uc810\uc740": 10, "\ube5b": 10, "\ud3ec\uc988": 10, "\ub4f1\uc73c\ub85c": 10, "ood\uc778": 10, "\uc0d8\ud50c\uc5d0\uc11c": 10, "\ub098\ud0c0\ub0a0": 10, "\uc5f0\uad6c\uc5d0\uc11c\ub294": [9, 10], "hyperdreambooth\ub77c\ub294": 10, "\uc18c\uac1c\ud588\uc2b5\ub2c8\ub2e4": 10, "\ubcc0\ud658\ud558\ub294": [10, 20], "\uac00\ubcbc\uc6b4": 10, "\uac1c\uc778\ud654\ud558\ub294": 10, "\ubaa9\ud45c\ub85c": [10, 19, 20, 23], "hypernetwork\ub77c\ub294": 10, "\ud30c\ub77c\ubbf8\ud130\uc778": 10, "\uc0dd\uc131\ud558\uba70": [10, 18], "\uc774\uc5b4\uc11c": 10, "\uae30\ud0c0": 10, "\ucd5c\uc801\ud654": [10, 15, 19], "\uac1c\uc778\ud654": 10, "\uc791\uc5c5\uc5d0": 10, "\uc0c1\ub2f9\ud788": [9, 10, 13, 19], "\uc904\uc774\uba74\uc11c": [10, 12], "\ubb34\uacb0\uc131\uc744": 10, "\uc2a4\ud0c0\uc77c\uacfc": [10, 19], "\uc758\ubbf8\uc801": [10, 19], "\uc218\uc815\uc774": [10, 19], "\uc801\uc6a9\ub41c": [10, 17, 18, 20], "\uc788\uc74c\uc744": [10, 11, 18, 19, 20], "\uc785\uc99d\ud558\uc600\uc2b5\ub2c8\ub2e4": 10, "2102": [11, 21], "09672": 11, "ddpm\uc744": 11, "\uc57d\uac04": 11, "quality\ub97c": [11, 19], "\uc720\uc9c0\ud558\uace0": [11, 17], "likelihood\uc218\uce58\ub3c4": 11, "\ud5a5\uc0c1\ub41c": [11, 15, 19], "sampling\uc2dc": 11, "base": [9, 11, 15, 16, 17, 19, 22, 25, 26, 27, 29], "step\uc73c\ub85c": [11, 15, 22], "\ub0bc": [11, 22], "scale\uacfc": [11, 17], "quailty\uc640": 11, "\uc218\uce58\uac04\uc758": 11, "ho": [9, 11], "et": [9, 11, 20], "al": [9, 11, 20], "quality\uc5d0": 11, "\ubc18\ud574": [11, 14], "\ubaa8\ub378\uc5d0\ube44\ud574": 11, "\ub5a8\uc5b4\uc84c\ub2e4": 11, "ddpm\uc774": 11, "diversity\uac00": [11, 22], "dataset": [2, 11, 19, 20, 21, 22, 24, 25], "cifar": [11, 18, 24], "lsun": 11, "\ub3d9\uc791\ud588\uc9c0\ub9cc": 11, "dataset\uc5d0\uc11c\uc758": 11, "\ub3d9\uc791\uc740": 11, "\uc99d\uba85\ub418\uc9c0": 11, "\ubabb\ud588\ub2e4": 11, "\uc218\uce58": 11, "\uac1c\uc120": [2, 11, 18, 22, 27], "imagenet\uac19\uc740": 11, "dataset\uc5d0\uc11c\ub3c4": 11, "\ub3d9\uc791": 11, "process\uc5d0\uc11c\uc758": 11, "\uc81c\uc548\ud558\uc600\ub2e4": 11, "\ub0b4\ub294": [11, 13, 19], "\ud655\uc778": [11, 16, 21, 25], "\uc5f0\uad6c\ub4e4\uc5d0\uc11c": 11, "loglikelihood": 11, "\uc218\uce58\uc640": 11, "sample\uc758": 11, "quality\uac04\uc758": 11, "\uc5f0\uad00\uc131\uc744": 11, "\ub9ce\uc558\ub2e4": 11, "distribution\uc5d0": [11, 20], "model\uc774": [9, 11], "\uc218\uce58\ud654\ud55c": 11, "\ub290\ub08c": 11, "\uc218\uce58\uac00": 11, "\uc88b\uc544\uc9c0\uba74": 11, "quality\ub3c4": 11, "\uc99d\uac00\ud558\ub294": 11, "\uacbd\ud5a5\uc744": [11, 16, 20], "\ubcf4\uc600\ub2e4": [11, 15, 16], "ddpm\uc5d0\uc11c\ub3c4": 11, "\uc218\uce58\ub97c": [11, 14, 22], "\uac1c\uc120\ud55c\ub2e4\uba74": 11, "\uc99d\uac00\ud560": 11, "\uc54a\uc744\uae4c": 11, "angeloyeo": 11, "github": [11, 13, 19, 20, 21, 23], "07": 11, "17": [11, 22], "mle": 11, "html": [11, 20], "\uc785\ud78c": [11, 16], "\ud615\ud0dc": 11, "denoising\uc5d0": 11, "parameter\ub85c": [9, 11], "noising\ud560": 11, "denoising\uc744": [11, 14], "\uc544\ub798\uc640\uac19\uc774": 11, "\uc0ac\uc6a9\ud574\ub3c4": [11, 27], "\ubcf4\uc5ec\uc11c": [11, 20], "\ubb38\uc7a5": 11, "\uc758\ubb38\uc810": 11, "\uc815": 11, "\ubc18\ub300\uc758": 11, "parameter\uc778\ub370": 11, "\ubcf4\uc600\uace0": 11, "fix\ub97c": 11, "\ud558\ub294\uac8c": 11, "\ub9de\uc744\uae4c": 11, "step\uac04": 11, "\ucc28\uc774\ub97c": [11, 23], "\ube44\uad50\ud574\ubcf4\uba74": 11, "step\uc774": [11, 22], "\ub450\uac1c\uc758": [11, 20], "\ub3d9\uc77c\ud574\uc9c4\ub2e4": 11, "2\ub97c": [11, 15, 18], "\uc131\ub2a5\uc740": [11, 18], "\ucd08\ubc18\uc5d0": [11, 24], "\uacb0\uc815\ub418\ub294\ub370": 11, "\ucd08\ubc18\uc5d0\ub294": 11, "\uac12\uc758": [2, 11], "\uacb0\uc815\ub418\ub294": 11, "\ubd80\ubd84": [11, 15, 20, 21], "\uae09\uaca9\ud558\uac8c": 11, "\ub450\uace0": [2, 11], "\ub450\ub294\uac83\uc740": 11, "\uc124\uacc4\uc758": 11, "miss": 11, "\ud559\uc2b5\ud558\uae30\uc5d0\ub294": 11, "\ubc94\uc704\uac00": 11, "\ub108\ubb34": [2, 9, 11, 15, 16, 20, 21], "\uc791\uc544\uc11c": 11, "predict\ud558\ub3c4\ub85d": 11, "\uc124\uacc4": 11, "hybrid": [11, 22], "l_": [11, 14, 22, 27], "hyprid": 11, "\u03bbl_": 11, "vlb": 11, "\uc774\ubbf8\uc9c0\uc5d0\ub300\ud574": 11, "\ub3d9\uc791\ud558\uc9c0\ub9cc": 11, "32x32": [11, 22], "64x64": [9, 11, 18, 25, 26, 27], "\uc54a\ub294\uac83\uc744": 11, "scheduling\uc5d0\uc11c": 11, "mode\uc758": 11, "limitation\uc774": 11, "\uc9c0\uc801": 11, "\uac70\ub4ed\ub0a0\uc218\ub85d": 11, "\uc0c1\ub2e8": [11, 15], "noisy\ud574\uc9d0": 11, "skip\ud574\ub3c4": 11, "\uc131\ub2a5\uc5d0": [11, 18], "\uc601\ud5a5\uc774": 11, "\uc5c6\uc74c\uc744": 11, "mode\ub97c": 11, "\uc774\ud6c4\uc758": 11, "noise\ub294": 11, "\uc758\ubbf8\uc788\ub294": 11, "\ubbf8\uce58\uc9c0": 11, "\ubabb\ud55c\ub2e4": 11, "equation\uc744": 11, "\uc0c8\ub85c": [11, 26], "\uc2dd\uc740": 11, "\uc911\uac04": [2, 11, 19, 22], "\ub2e8\uacc4\uc5d0\uc11c\ub294": [11, 15], "\uac15\ud558\uac8c": [11, 16], "\uc785\ud600\uc9c0\uc9c0\ub9cc": 11, "0\uacfc": 11, "\ubd80\uadfc\uc5d0\uc11c\ub294": 11, "\ub35c": [11, 22], "direct\ub85c": 11, "\ucd5c\uc801\ud654\ud558\ub3c4\ub85d": 11, "\uc124\uacc4\ud558\uba74": 11, "best": [11, 16, 21, 22], "\uc774\ubbf8\uc9c0\uc640\uac19\uc774": 11, "\uc790\uccb4\uac00": [11, 19], "unstable\ud574\uc11c": 11, "\ucd5c\uc801\ud654\uc5d0\ub294": 11, "\uc5b4\ub824\uc6c0\uc774": 11, "\uc904\uc774\uae30\uc704\ud574": 11, "\ub3c4\uc785": [2, 11], "2\uc5d0\uc11c": [11, 15], "\ub9d0\uae30\ub294": 11, "\ubcc0\ud654\uc5d0": 11, "\uc5c6\uc73c\ubbc0\ub85c": 11, "\ud655\ub960\uc801\uc73c\ub85c": [11, 17], "\ucd08\ubc18\uc758": 11, "sampling\ud574\uc11c": 11, "\ud559\uc2b5\ud558\ub3c4\ub85d": 11, "\uc801\uc6a9\ud574\ubcf8": 11, "\ubcf4\uc784": [11, 20, 21], "sampling\uc744": [9, 11, 21], "\uc801\uc6a9\ud558\uba74": 11, "\uc801\uc6a9": [2, 11, 14, 15, 21, 27], "\uc804\ubcf4\ub2e4": 11, "\uc88b\uc9c0": [11, 15, 16, 18, 27], "\ubcf4\uc778\ub2e4": [9, 11, 16], "\ub2e4\uc18c": [11, 21], "\ucde8\uc57d\ud588\ub358": 11, "imagenet": [9, 11, 15, 19], "64x64\uc640": 11, "cidar": 11, "\uae30\uc900": [11, 21, 22], "convolut": [11, 17, 20, 21, 27], "\ubaa8\ub378\uc774\ub098": 11, "\ubaa8\ub378\uc911\uc5d0\uc11c\ub294": 11, "fulli": [11, 20], "\ube44\ud574\uc11c\ub294": 11, "\ubd80\uc871\ud55c": [11, 23], "\uba74\uc774": 11, "speed\ub97c": 11, "\uba87\uba87": [2, 9, 11], "step\ub9cc": 11, "\uac00\ub3c4": 11, "fid\uac12\uc744": 11, "metric\uc73c\ub85c": 11, "biggan": [11, 22], "big": 11, "\ubaa8\ub378\ubcf4\ub2e4": [11, 18, 21, 26], "\ud0c0\uac9f\uc5d0": 11, "\uc218\uce58\ub098": 11, "recal": [9, 11], "metric\uc5d0\uc11c": 11, "capacity\ub97c": 11, "fid\uc640": [9, 11, 18, 22], "nll": [11, 20], "\ud559\uc2b5\ub7c9": 11, "\uc5b4\ub290\uc815\ub3c4": 11, "\ube44\ub840\ud568": 11, "synthesi": [2, 9, 12, 14, 17, 23], "2112": [9, 12], "10752": 12, "compvi": 12, "namkyeong": 12, "31": [12, 19, 23], "\uc624\ub298": [12, 17], "\uc54c\uc544\ubcfc": [12, 17, 26, 27], "model\uc785\ub2c8\ub2e4": 12, "\ub2e4\ub918\ub358": [12, 17], "\uc720\uc0ac\ud558\uac8c": [12, 15, 18, 26, 27], "\ucef4\ud4e8\ud130": 12, "\uc790\uc6d0\uc758": 12, "\uc18c\ubaa8\ub97c": 12, "\uc5bb\ub294\uac83\uc774": 12, "\ubaa9\ud45c\uc785\ub2c8\ub2e4": [12, 26], "\uc804\ubc18\uc801\uc73c\ub85c": [12, 18], "\uc8fc\uc5b4\uc84c\uc744\ub54c": 12, "\ud1b5\ud574\uc11c": [12, 17], "\ub514\ucf54\ub529\uc744": 12, "\ub418\ub3c4\ub85d": [12, 16], "\ud14c\uc2a4\ud2b8\ub97c": 12, "\uc9c4\ud589\ud558\uc600\ub2e4": [9, 12], "space\uc5d0\uc11c": [12, 19], "\ubd84\uc0b0\uc774": [2, 12], "\ucee4\uc9c0\uc9c0": 12, "\uc54a\ub3c4\ub85d": 12, "divergence\uc640": 12, "quantiz": [12, 21], "vq": 12, "\ud65c\uc6a9\ud558\uc600\ub2e4": 12, "\uc774\ubbf8\uc9c0\uc678": 12, "\ud14d\uc2a4\ud2b8\ub098": 12, "semat": 12, "map\uacfc": 12, "\uc815\ubcf4\ub294": 12, "tau_": 12, "\uc804\ub2ec\uc744": 12, "\ud558\uc600\uace0": [12, 18], "phi_i": 12, "_k": 12, "_v": 12, "\uc815\uc758\ub418\uace0": 12, "\uc911\uac04\uc758": 12, "matrix\uc774\ub2e4": 12, "attention\uc758": 12, "value\uc5d0": 12, "\ud574\ub2f9\ud558\uba70": 12, "qk": 12, "\uc9c4\ud589\ub41c\ub2e4": 12, "\ud568\uc218\ub294": 12, "\uac19\uc774\ud45c\ud604\ub41c\ub2e4": 12, "\uc8fc\ubaa9\ud560\ub9cc\ud55c": 12, "model\uc5d0\uc11c": 12, "dm": [12, 22], "function\uc73c\ub85c": [12, 21, 22], "\uc9c4\ud589\uc2dc\ud0a4\ub294\ub370": 12, "\ubc14\uafb8\uba74\uc11c": 12, "\uc591\uc744": [12, 15], "\uc904\uc600\ub2e4\ub294": 12, "\uc810\uc774\ub2e4": [12, 15, 22], "\uc9c4\ud589\ud558\uc600\ub294\ub370": 12, "\uadf8\uc911": 12, "\uc77c\ubd80\ub9cc": 12, "\uc18c\uac1c\ud558\ub3c4\ub85d": 12, "\ud558\uaca0\ub2e4": 12, "dataset\uc5d0\uc11c": [12, 16, 19, 21], "\ubf51\uc740": [12, 17], "\uc0d8\ud50c\uacfc": [12, 20], "sample\ub4e4\uc785\ub2c8\ub2e4": 12, "laion": [12, 16, 27], "\uc801\uc808\ud55c": [12, 14], "\uc810\uc218\uc640": [9, 12], "\ud6a8\uc728\uc131\uc744": 12, "layout\uc774": 12, "\uc8fc\uc5b4\uc84c\uc744": [2, 12], "layout": [2, 12], "peft": 13, "effeci": 13, "\ud558\ub098": [13, 21], "\uace0\uc815\ud55c": 13, "\ucc44\ub85c": 13, "\uba87": [13, 19, 20, 24], "fc": 13, "layer\ub9cc": 13, "task\uc758": [13, 20], "\uc5f0\uc0b0\ub7c9\uc744": 13, "\uc904\uc77c": [13, 19], "gpt": 13, "3\uc744": 13, "\uae30\uc900\uc73c\ub85c": [13, 18], "parameter\ub294": [13, 18], "10000\ubc30": 13, "gpu": [13, 27], "\uba54\ubaa8\ub9ac\ub294": 13, "3\ubc30\ub97c": 13, "latency\uac00": 13, "\uc5c6\uc74c": [2, 13], "\ud29c\ub2dd\ud558\ub294": 13, "\ud30c\ub77c\ubbf8\ud130\ub9cc\uc744": 13, "\ud29c\ub2dd\ud568\uc73c\ub85c\uc368": 13, "\uc790\uc6d0\uc73c\ub85c\ub3c4": 13, "\ub192\uac8c": 13, "\uc720\uc9c0\ud558\ub294": 13, "\ubc29\ubc95\ub860": 13, "\ud558\ub294\uac83": 13, "upstream": 13, "\ud559\uc2b5\uc2dc\ud0a4\ub294\uac83": 13, "\uc694\uccad\uc758": 13, "\uc2dc\uc791\ubd80\ud130": 13, "\uc644\ub8cc\uae4c\uc9c0": 13, "\uac78\ub9ac\ub294": 13, "llm\uc740": 13, "\uc2dc\ud0b4": [13, 18], "tuning\uc5d0\uc11c": 13, "\ud559\uc2b5\uc2dc\ud0a4\uba74": 13, "roberta": 13, "\ub2ec\uc774": 13, "\uac78\ub9bc": 13, "\uc5f0\uad6c\uc5d0\uc11c": 13, "over": [2, 13, 20, 25], "model\ub4e4\uc740": 13, "intrins": 13, "dimension\uc5d0": 13, "\uae30\ubc18\ud558\uace0": 13, "\uc0ac\uc2e4\uc5d0": 13, "\uc800\uc790\ub294": [13, 19], "\uacfc\uc815\uc5d0\uc11c\ub3c4": 13, "\uac16\uace0": 13, "\uac00\uc815\ud568": [13, 21], "\uace0\uc815\ud558\uace0": [13, 21], "decomposit": 13, "matrices\ub97c": 13, "\ucd5c\uc801\ud654\ud558\ub294": [13, 24], "\uc2dc\ud0a4\uae30\ub85c": 13, "decomposition\ub41c": 13, "\ub354\ud574\uc90c": 13, "\ud06c\uae30\ub294": [13, 15, 27], "\uc791\uc544": 13, "cost\ub97c": 13, "\ucd5c\ub300": [2, 13, 21], "3\ubc30\uae4c\uc9c0": 13, "\ubc14\uafd4\uc8fc\uba74": 13, "storag": [13, 27], "requir": [2, 13], "switch": 13, "overhead\ub97c": 13, "\uc678\uc5d0\ub3c4": 13, "\uc5c6\ub2e4": [2, 13, 14, 22], "\uae30\ubc95\ub4e4\uacfc": 13, "\uac00\ub2a5\ud558\ub2e4\ub294": 13, "\uc7a5\uc810\uc774": [13, 27], "transformer\uc758": [13, 18, 21], "w_q": [13, 27], "w_k": [13, 27], "w_v": [13, 27], "w_o": 13, "module\uc758": 13, "accumulated\ub41c": 13, "\uc5f0\uad6c\uc758": 13, "convention\uc744": 13, "optimizer\ub294": 13, "adam\uc744": 13, "\uc774\uc6a9": 13, "mlp": [13, 28], "feedforward": 13, "ffn": 13, "agnostic\ud558\uc9c0\ub9cc": 13, "model\uc5d0": [9, 13, 14, 19], "\uc9d1\uc911\ud568": 13, "agnost": [13, 21], "\uad6c\uc560\ubc1b\uc9c0": 13, "\ud574\uc11d\uc774": 13, "max": [2, 13], "phi": [13, 21, 22, 27, 28], "y_t": [2, 13], "y_": [13, 22], "parameterized\ub41c": 13, "x_i": [13, 28], "y_i": 13, "target\uc30d\uc73c\ub85c": 13, "phi_0": 13, "\ub418\uace0": [13, 16, 23, 25, 28], "maximize\ud558\uae30": 13, "\uc5c5\ub370\uc774\ud2b8\ub428": 13, "\ud06c\uae30\uc758": [13, 15], "\ud559\uc2b5\ud574": [13, 19], "\uc5c4\uccad\ub09c": 13, "cost\uac00": 13, "\ubc1c\uc0dd": [13, 23], "\ubc18\uba74": [2, 9, 13, 20, 22], "\uc804\uccb4\uac00": 13, "\uadf8\ubcf4\ub2e4": 13, "\ucc3e\uc544\ub0b4\ub294": 13, "\ubc14\ub00c\uae30": 13, "effecient\ud574\uc9d0": 13, "01": 13, "\uae4c\uc9c0": [2, 13, 26], "\uc791\uc544\uc9c8": 13, "\uae30\uc874\uc5d0\ub3c4": 13, "transfer": [2, 13, 16, 19, 23], "learning\uc5d0\uc11c": [13, 21], "effecient\ub97c": 13, "\uac00\uc9c0\uac00": 13, "perform": [13, 21, 25, 27], "\ucd94\uac00\ud558\ub294": [13, 18, 27], "hardwar": 13, "parellelism\uc774": 13, "\uc5c6\ub2e4\uba74": 13, "bottleneck": [13, 21, 27], "\ucd94\uac00\ud574\ub3c4": 13, "\uc99d\uac00\ud574": 13, "\uc0ac\uc6a9\ud558\uae30": [9, 13], "\uc5b4\ub824\uc6e0\uc74c": 13, "prefix": 13, "tuning\uc740": [13, 18, 19], "optimize\uac00": 13, "ba": 13, "\uacf1\ud574\uc9c4": 13, "vector\ub07c\ub9ac": 13, "coordin": 13, "wise\ud558\uac8c": 13, "\uc774\ub77c": 13, "scaling\ub428": 13, "rate\ucc98\ub7fc": 13, "tuning\ud574\uc11c": 13, "r\uacfc": 13, "\uc774\ub098": 13, "\uc0ac\uc6a9\ud55c\ub2e4\uace0": 13, "actual": 13, "defin": 13, "lora_a": 13, "new_zero": 13, "num_embed": 13, "lora_b": 13, "embedding_dim": 13, "lora_alpha": 13, "matrix": 13, "requires_grad": [13, 24], "reset_paramet": 13, "hasattr": 13, "wai": 13, "zeros_": 13, "normal_": [13, 28], "bool": 13, "merge_weight": 13, "sure": 13, "transpos": 13, "mark": 13, "tensor": [13, 24, 27], "after_a": 13, "padding_idx": 13, "max_norm": 13, "norm_typ": 13, "scale_grad_by_freq": 13, "spars": [13, 21, 27], "w_0x": 13, "bax": 13, "lora\ub97c": 13, "\uc774\uc6a9\ud558\uba74": [13, 25], "inference\uc2dc": 13, "\ud558\ub77d\uc774": 13, "\uacbd\uc6b0\uc5d4": 13, "\ucd94\uac00\ud558\uba74": [13, 18], "overhead\uac00": 13, "\ub0ae\uc74c": 13, "\ucd5c\uc18c\ud654\ud558\uae30": [13, 20], "weight\ub9cc": 13, "\uc801\uc6a9\ud558\uace0": 13, "module\uc740": 13, "\uace0\uc815\ud568": 13, "175b\ub97c": 13, "vram\uc740": 13, "2tb\uc5d0\uc11c": 13, "350gb": 13, "checkpoint": [13, 16], "size\ub294": 13, "350gb\uc5d0\uc11c": 13, "35mb\ub85c": 13, "\uc904\uc784": 13, "\ube68\ub77c\uc9d0": 13, "bert": 13, "\ub300\ubd80\ubd84\uc758": [2, 13], "\uacbd\uc6b0\uc5d0\uc11c": 13, "\uc88b\uc74c": [13, 21], "valid": [13, 18, 24], "accuraci": 13, "transformer\uc5d0\uc11c": [13, 21], "matrix\uc5d0": 13, "r\uc744": 13, "\uac83\ubcf4\ub2e4": [13, 25, 27], "matrices\uc5d0": 13, "\uc88b\uc558\uc74c": 13, "\ub274\ub7f4\ub124\ud2b8\uc6cc\ud06c\uc758": 13, "activation\uc744": 13, "\uc904\uc774\uae30\ub3c4\ud558\uace0": 13, "\ub298\ub9ac\uae30\ub3c4\ud558\ub294": 13, "\uc5b4\ub311\ud130\ub97c": 13, "\uc911\uac04\uc5d0": 13, "\uc0bd\uc785\ud558\ub294": 13, "lora\ubcf4\ub2e4": 13, "\uc0ac\uc6a9\ud558\uba74\uc11c": [13, 19], "\uc54c\ub824\uc838\uc788\uc73c\uba70": 13, "3\ub97c": 13, "\ud588\uc744\ub54c": 13, "\ubcf4\ub2e4\ub3c4": [13, 25], "\uc8fc\uc7a5\ud558\uace0": 13, "\ud559\uc2b5\uc2dc\uac04\ub3c4": 13, "\uc9e7\uc544": 13, "a100": [13, 23], "30\ubd84\ub9cc\uc5d0": 13, "\ud29c\ub2dd\ud560": 13, "loralib": 13, "\uc124\uce58": 13, "pip": 13, "instal": 13, "altern": [13, 24], "git": 13, "microsoft": 13, "befor": 13, "in_featur": 13, "out_featur": 13, "after": 13, "add": [13, 23], "parameter\ub9cc": 13, "bigmodel": 13, "string": 13, "lora_": 13, "mark_only_lora_as_train": 13, "loop": [13, 27], "dataload": [13, 24], "checkpoint\ub97c": [13, 16], "\uc800\uc7a5\ud560": 13, "\ub54c\uc5d4": 13, "state_dict": 13, "\uc800\uc7a5\ud558\uac8c": 13, "save": 13, "checkpoint_path": 13, "lora_state_dict": 13, "\ubd88\ub7ec\uc62c": 13, "load_state_dict": 13, "strict": 13, "load": [13, 15, 23], "ckpt_pretrain": 13, "pt": [13, 21], "ckpt_lora": 13, "llm": [13, 25], "\ud29c\ub2dd": 13, "gpu\ub85c": [13, 16], "\uac00\ub2a5\ud560\uae4c": [13, 18], "\uc18c\uac1c\ud569\ub2c8\ub2e4": [13, 18, 23, 26, 27], "da": 13, "nhctrrve": 13, "guid": [2, 14, 26], "differenti": [9, 14, 28], "2108": 14, "01073": 14, "03": [14, 27], "\ubd84\uc57c\uc5d0\uc11c\uc758": 14, "\uc9c4\ud654": 14, "\uc18d\ub3c4\uac00": 14, "\uacc4\uc18d": 14, "\ub418\uc5b4\uc624\uace0\uc788\ub2e4": 14, "\uc0ac\uc6a9\uc790\uac00": [14, 15, 19], "\uc774\ub04c\uc5b4\ub0b4\ub824\ub294": 14, "\ubd84\uc57c\ub3c4": 14, "\ud65c\ubc1c\ud788": [14, 16], "\uc9c4\ud589\ub418\uace0\uc788\ub2e4": 14, "\ubc29\uc2dd\uc73c\ub85c\uc758": 14, "editing\uc5d0\ub294": 14, "\uba87\uac00\uc9c0": [14, 20], "\ub2e8\uc810\uc774": [9, 14, 16], "sdedit\uc740": 14, "\ubb38\uc81c\uc810\uc744": [9, 14, 16, 21, 23], "\ud574\uacb0\ud574\ub098\uc544\uac14\ub2e4\ub294": 14, "\uc810\uc744": [14, 16], "contribution\uc73c\ub85c": 14, "\uc81c\uc2dc\ud558\uc600\ub2e4": 14, "abstract\uc5d0\uc11c": 14, "\ub9d0\ud55c": 14, "editing\uc774\ub780": 14, "\uc720\uc800\uac00": [14, 19], "\uc0dd\uc131\ud558\uace0\uc790": [14, 17, 28], "guide\ub97c": [9, 14], "\uc81c\uc2dc\ud558\uba74": 14, "\uc774\ub54c": [14, 21, 23, 24, 26, 27, 28], "\ub450\uac00\uc9c0\uc758": 14, "\ud3c9\uac00\uc694\uc18c\uac00": 14, "\uc788\ub294\ub370": [2, 14, 15, 16, 27], "faith": 14, "\uc720\uc800\uc758": 14, "\ub530\ub974\ub294\uc9c0": 14, "realist": [2, 14, 26], "real\ud55c\uc9c0": 14, "\uc5f0\uad6c\ubc29\uc2dd\uc740": 14, "\ub450\uac00\uc9c0\ub85c": 14, "\ub098\ub25c\ub2e4": 14, "sota\ub97c": [9, 14, 16, 18, 21, 22], "\uc774\ub8ec": 14, "\uc774\ubbf8\uc9c0\uc5d0\uc11c": [14, 18], "edit\ub41c": 14, "\ub2e8\uc810": 14, "dataset\uc774": 14, "\ud544\uc694\ud558\uace0": 14, "condition\ub9c8\ub2e4": 14, "\uc7ac\ud559\uc2b5\uc744": 14, "\uc694\uad6c": 14, "inversion\ud55c": 14, "vactor\ub97c": 14, "\uc870\uc791\ud574": 14, "function\uc774": [14, 20], "\uc815\uc758\ub418\uc5b4\uc57c\ud558\uace0": 14, "\ud544\uc694\ud558\uc9c0": [14, 20], "\uc54a\ub2e4": 14, "function\uacfc": [14, 20], "\uc7ac\ud559\uc2b5\uc774": 14, "\ud55c\uac1c\uc758": 14, "weight\ub85c": 14, "condition\uc758": 14, "idea": 14, "\uc774\ubbf8\uc9c0\ub4e4\uc740": 14, "\ubd84\ud3ec\uc5d0\uc11c": [14, 19], "\ubd84\ud3ec\uac00": [2, 14, 20, 28], "\ub192\uc740\uacf3\uc73c\ub85c": 14, "\ud574\ub098\uac00\uba74": 14, "\uc5bb\uc5b4\ub0bc": 14, "score\ub294": [14, 18], "\ubc00\ub3c4": 14, "\ud568\uc218\uc758": 14, "\uc21c\uac04": 14, "\uae30\uc6b8\uae30": 14, "\ubbf8\ubd84\uac12": 14, "\uc815\uc758\ud55c\ub2e4": 14, "\uc8fc\uc785\ud558\ub294\ub370": 14, "\uc8fc\uc785\ud55c\ub2e4": 14, "\ub610\ub2e4\ub978": [9, 14], "probabl": [2, 14], "ddpm\uacfc\uc758": 14, "\ucc28\uc774\ub294": [14, 25], "\uc815\uc758\ud558\ub294": 14, "equation\uc758": 14, "\uc815\ub3c4\uc774\ub2e4": 14, "1907": 14, "05600": 14, "setup": [2, 14], "level\uc744": 14, "\uc774\ubbf8\uc9c0\uc704\uc5d0": 14, "patch\ub97c": 14, "stroke\ub97c": 14, "coarse\ud55c": 14, "stroke\uc758": 14, "procedur": 14, "\ub2ec\ub9ac": [2, 9, 14, 18, 20, 24, 28], "sde\uc758": 14, "\uc644\uc804\ud788": [14, 20], "noise\ud654\ub41c": 14, "noise\ub85c\ubd80\ud130": 14, "\uc9c4\ud589\ud560": [14, 21], "\ud544\uc694\uac00": [14, 20, 23], "t_": [2, 14], "\uc9c0\uc815\ud55c": [14, 25], "\uc815\uc758\ud574\uc57c\ud558\ub294\ub370": 14, "realistic\ud558\uc9c0\ub9cc": 14, "\ud558\uc9c0\uc54a\uc740": 14, "faithful\ud558\uc9c0\ub9cc": 14, "artistic\ud55c": 14, "\uc5bb\uac8c\ub41c\ub2e4": 14, "sdedit\uc758": 14, "\uacfc\uc815\uc774\ub2e4": 14, "better": [14, 25], "\uc885\ud569\uc801\uc778": 14, "\uc9c0\ud45c\ub85c": [14, 18], "survey\ub97c": 14, "\ubc29\uc2dd\ub4e4\uacfc": 14, "stylegan": 14, "ada": 14, "sdedit\uc774": 14, "\uc790\uc5f0\uc2a4\ub7fd\uace0": [14, 16], "\ub530\ub974\ub294": [2, 14, 27], "origin": [14, 15], "blend": 14, "\uc804\ud1b5\uc801\uc778": 14, "\uae30\ubc95\uacfc": 14, "\ube44\uad50\ud574\ub3c4": 14, "sdxl\uc740": 15, "diffusion\uacfc": 15, "\ube44\uad50\ud558\uba74": 15, "\ubc30": [15, 21], "\uaddc\ubaa8\uc758": [15, 18, 19], "unet\uc744": 15, "\ube14\ub85d\uacfc": 15, "sdxl\uc5d0\uc11c": 15, "encoder\ub85c": 15, "\uc0ac\uc6a9\ub418\uba74\uc11c": 15, "\ud30c\ub77c\ubbf8\ud130\uac00": 15, "\uc99d\uac00\ud588\ub2e4": 15, "\ub2e4\uc218\uc758": 15, "\ubc29\ubc95\uacfc": [2, 9, 15, 18], "\ube44\uc728\uc5d0": 15, "sdxl\uc744": 15, "\ud559\uc2b5\ud560": [9, 15, 18, 20, 24], "\uc124\uacc4\ud588\ub2e4": 15, "sdxl\uc758": 15, "\uc0d8\ud50c\uc758": [15, 18], "\uc2dc\uac01\uc801\uc778": [15, 19], "fidelity\ub97c": 15, "\ud5a5\uc0c1\uc2dc\ud0a8": 15, "\ub300\ud3ed": 15, "\uc8fc\uc694": 15, "\uae30\ub2a5\uc774\ub77c": 15, "3\ubc30": 15, "\ud615\ud0dc\uc758": [15, 19, 23], "\uac10\ub3c5": 15, "supervis": [15, 20], "\uac04\ub2e8\ud558\uba74\uc11c\ub3c4": 15, "\ud6a8\uacfc\uc801\uc778": 15, "\ucd94\uac00\uc758": 15, "\ud5a5\uc0c1\ud558\ub294": 15, "latent\ub97c": 15, "\ubcc4\uac1c\uc758": 15, "img": [15, 24], "\uadf8\ub9bc": [2, 15, 19, 20], "1\uc5d0\uc11c": 15, "\ub192\uc778": 15, "sdxl\uc774": 15, "sd\ubcf4\ub2e4": 15, "\uc2dc\uac01\ud654\ud588\ub294\ub370": 15, "128x128": 15, "\ud65c\uc6a9\ud558\uace0": 15, "sdedit\uc744": 15, "\uc801\uc6a9\ud55c\ub2e4": 15, "sdxl\uacfc": 15, "autoencoder\ub97c": 15, "sd\uc640": 15, "\ube14\ub85d\uc758": 15, "heterogen": 15, "\uc0ac\uc6a9\ud588\ub2e4\ub294": [15, 22], "\ud14c\uc774\ube14": [15, 22], "1\uc744": 15, "\ucc38\uace0\ud558\uba74": [15, 22], "highest": 15, "level\uc5d0\uc11c": 15, "\ube14\ub7ed\uc744": 15, "level\uc5d0\uc11c\ub294": 15, "unet\uc5d0\uc11c": 15, "lowest": 15, "8x": 15, "conditioning\uc744": [9, 15], "encoder\ub97c": 15, "l\uacfc": 15, "openclip": 15, "bigg\ub97c": 15, "\ucc44\ub110": 15, "\ucd95\uc5d0": 15, "encoder\uc758": [15, 18, 19], "\uc8fc\uae30": [9, 15], "\ub808\uc774\uc5b4\ub97c": 15, "\uc0ac\uc6a9\ud588\uc73c\uba70": 15, "openclip\ub85c\ubd80\ud130": 15, "pool": 15, "embedding\uc744": [15, 19, 22], "condition\uc73c\ub85c": [9, 15, 22], "\ucd94\uac00\ud588\ub2e4": 15, "\ubcc0\ud654\ub294": 15, "\ud30c\ub77c\ubbf8\ud130": [15, 23, 28], "\uc0ac\uc774\uc988\uac00": 15, "6b\ub85c": 15, "encoder\ub294": [15, 18, 28], "817m": 15, "\ud53d\uc140": [2, 15, 25], "\uc774\ud558": [2, 15, 20], "\uc2dc\ud0a4\uac70\ub098": 15, "upscale\ud558\uc5ec": 15, "\ucd5c\uc18c": 15, "\ud06c\uae30\uac00": 15, "\uc815\ud574\uc9c0\ub294": 15, "\ubb38\uc81c\uc810\uc774": 15, "\ubc1c\uc0dd\ud55c\ub2e4": 15, "\uc800\ud558\uc2dc\ud0a4\uac70\ub098": 15, "\uc77c\ubc18\ud654\ub97c": 15, "\uc14b\uc758": 15, "\uc2dc\uac01\ud654\ud574\uc8fc\ub294": 15, "\uadf8\ub9bc\uc774\ub2e4": 15, "\uc81c\uc548\ub41c": 15, "conditiong": 15, "\ud06c\uae30": 15, "\ubbf8\ub9cc\uc758": 15, "39": 15, "\ub098": [15, 23, 25], "\ub2ec\ud55c\ub2e4": 15, "upscal": 15, "blur": 15, "\uac00\uc838\uc640": 15, "\uc544\ud2f0\ud329\ud2b8\uac00": 15, "\uc0dd\uae34\ub2e4": [15, 25], "\uc6d0\ub798\uc758": 15, "\ud574\uc0c1\ub3c4\uc5d0\uc11c": 15, "\uc8fc\uc5c8\ub2e4": [15, 19], "\uc5b4\ub5a0\ud55c": [15, 17, 23], "rescal": [15, 22], "\ud06c\uae30\uc778": 15, "w_": [2, 15, 23], "\uc81c\uacf5\ud574": 15, "\uc904": [2, 9, 15, 20, 22, 25], "\ucd94\uac00\ub41c\ub2e4": 15, "\ud574\uc0c1\ub3c4\ub97c": [15, 17, 26], "\uc815\ud560": 15, "\ud574\uc0c1\ub3c4\uc5d0": 15, "\uc758\uc874\uc801\uc778": 15, "\uc5f0\uad00\uc2dc\ud0a4\ub3c4\ub85d": 15, "imagenet\uc73c\ub85c": 15, "\uc9c4\ud589\ud574": 15, "conditiong\uc5d0": 15, "\uc6b0\uc218\uc131\uc744": 15, "\uc785\uc99d\ud588\ub2e4": 15, "cin": 15, "\uc2dc\ucf30\uace0": 15, "70k": 15, "\uc7a5": 15, "nocond": 15, "\ud45c": [15, 20], "\ubcf4\ub2e4\uc2dc\ud53c": 15, "IS": [9, 15, 21], "4\uc5d0\uc11c": [15, 20], "\uace0\uc591\uc774": [15, 20], "\uba38\ub9ac\uac00": [15, 17], "\uc798\ub824\uc9c4": 15, "cropping\uc73c\ub85c": 15, "\uc0dd\uc131\ub418\uc5c8\uae30": 15, "\ub54c\ubb38\uc774\ub2e4": 15, "\uc81c\uc548\ud55c\ub2e4": [15, 16, 19, 22], "\uade0\ub4f1\ud558\uac8c": 15, "\ub192\uc774": 15, "\ub108\ube44": 15, "\ucd95\uc744": 15, "\ubaa8\uc11c\ub9ac\uc5d0\uc11c": 15, "\ud53d\uc140\uc758": 15, "\uc9c0\uc815\ud558\ub294": 15, "\uc815\uc218": [2, 15], "\uc0d8\ud50c\ub9c1\ud55c\ub2e4": [15, 19], "fourier": 15, "\uc784\ubca0\ub529\uc744": 15, "\ud30c\ub77c\ubbf8\ud130\ub85c\uc368": 15, "\uc785\ub825\ud55c\ub2e4": 15, "conditioning\uacfc": 15, "\uc784\ubca0\ub529": [15, 19], "\ud30c\ub77c\ubbf8\ud130\ub85c": 15, "\ubfd0\ub9cc": [15, 20], "dm\uc5d0\uc11c\ub3c4": 15, "\uc0ac\uc6a9\ub420": [15, 18, 19, 20], "\uac15\uc870\ud55c\ub2e4": 15, "conditioning\uc740": 15, "\uc27d\uac8c": [2, 15, 17, 19], "\uacb0\ud569\ub420": 15, "\ud0c0\uc784\uc2a4\ud15d": 15, "\uc784\ubca0\ub529\uc5d0": 15, "\ucd94\uac00\ud55c\ub2e4": 15, "512x512": [15, 27], "1024x1024": [15, 17, 18, 26], "\ud604\uc2e4": 15, "\uc138\uacc4\uc5d0\uc11c": 15, "\ubd80\uc790\uc5f0\uc2a4\ub7fd\ub2e4": 15, "\uc138\uacc4\uc5d0\uc11c\ub294": 15, "\ube44\uc728\uc744": 15, "\ub9ce\uace0": [15, 18], "\ud48d\uacbd": 15, "\ube44\uc728\uc758": 15, "\uc9c0\ub2c8\uace0": [15, 24, 26], "\ub2e4\ub8f0\uc218": 15, "\ud30c\uc778\ud29c\ub2dd\ud588\ub2e4": 15, "\ud53d\uc140\uc218\ub97c": 15, "\ub9cc\ud07c": [2, 15, 21], "64\uc758": 15, "\ubc30\uc218\ub97c": 15, "\uc9c0\ub2c8\ub3c4\ub85d": 15, "ratio": 15, "\ubc30\uce58\ub294": 15, "\ubc84\ud0b7": 15, "\uc2a4\ud15d\ub9c8\ub2e4": [2, 15], "\ubc88\uac08\uc544": [15, 24], "\uac00\uba70": 15, "\ud0c0\uac9f": 15, "conditioning\uc73c\ub85c": 15, "\uc8fc\uc5c8\uc73c\uba70": 15, "\uacf5\uac04\uc5d0": 15, "\uc784\ubca0\ub529\ub418\ub294": 15, "tgt": [15, 16], "\ud615\ud0dc\ub85c": [15, 22, 27], "\ud45c\ud604\ub41c\ub2e4": 15, "\uace0\uc815\ub41c": [15, 19, 24], "\ube44\uc728\ubc0f": 15, "\ud574\uc0c1\ub3c4\uc758": 15, "pretraining\uc774": 15, "\ub9c8\uce5c": 15, "\ud30c\uc778\ud29c\ub2dd": [15, 18], "\ud559\uc2b5\ud588\uace0": 15, "\ucd95\uc73c\ub85c": 15, "2\uc808\uc5d0\uc11c": 15, "\uc18c\uac1c\ud55c": 15, "\uae30\uc220\uacfc": 15, "\uacb0\ud569\ud588\ub2e4": 15, "16\uc5d0\uc11c": 15, "sd\ub294": 15, "\ud558\ub098\uc774\uace0": 15, "autoencoder\uc758": 15, "space\ub97c": [15, 19], "composition\uc740": 15, "ldm\uc73c\ub85c\ubd80\ud130": 15, "\ud45c\ud604\ub418\uc9c0\ub9cc": 15, "local": [15, 16], "frequenc": [15, 21], "\ub514\ud14c\uc77c\ud55c": 15, "\ud5a5\uc0c1\ud558\uace0\uc790": 15, "\ud5a5\uc0c1\ud588\ub2e4": 15, "\ub05d\uc73c\ub85c": 15, "sd\ub97c": 15, "\uc544\ud0a4\ud14d\ucc98\uc5d0\uc11c": 15, "\ubc30\uce58\uc0ac\uc774\uc988": 15, "average\ub97c": 15, "\uba54\ud2b8\ub9ad\uc5d0": 15, "\uc815\ub9ac\ud574\uc8fc\ub294": 15, "\uc808\uc785\ub2c8\ub2e4": 15, "step\uc740": [15, 18], "step\uc744": [15, 19], "model\ub97c": [15, 19], "\ub0b4\ubd80": 15, "\uc14b\uc73c\ub85c": 15, "2\uc5d0": 15, "\ub098\uc640\uc788\ub294": 15, "\ubd84\ud3ec\uc5d0": [15, 19], "600": 15, "000": [2, 15], "\uc0ac\uc774\uc988\ub85c": 15, "2048\ub85c": 15, "\ud559\uc2b5\uc2dc\ucf30\uace0": 15, "\ub9c8\uce68\ub0b4": 15, "offset": 15, "11": [2, 15, 25], "\uc218\uc900\uacfc": 15, "\ub2e4\uc911": 15, "\ube44\uc728": 15, "\uc601\uc5ed\uc758": 15, "\ube44\uc728\ub85c": 15, "\uacbd\ud5d8\uc801\uc73c\ub85c": 15, "6\ucc98\ub7fc": 15, "\ucc3e\uc558\ub2e4": 15, "\uadf8\ub9bc\uc774": [15, 18], "stage\ub97c": 15, "\ud2b9\ud654\ub41c": 15, "\ubcc4\ub3c4\uc758": [9, 15], "ldm\uc744": [15, 19], "sdedit\uc5d0\uc11c": 15, "ediff": 15, "\ub530\ub790\uc73c\uba70": 15, "\uc2a4\ucf00\uc77c\uc5d0": 15, "inference\uc5d0\uc11c": 15, "diffuse\uc640": 15, "denoise\ub97c": 15, "\ub123\uc5c8\ub2e4": 15, "\uc2a4\ud15d\uc740": 15, "\uc120\ud0dd\uc774\uc9c0\ub9cc": 15, "\ubc30\uacbd": [15, 19], "\uc0ac\ub78c": [15, 20, 25], "\ub514\ud14c\uc77c\uc5d0\uc11c": 15, "13": [2, 15, 20, 25], "\uc788\uc5c8\ub2e4": [9, 15, 16], "your": 16, "One": [16, 19], "2303": 16, "03231": 16, "sty": 16, "lize": 16, "ne": 16, "\ud55c\uc7a5\uc758": 16, "\uc785\ud788\uace0\uc790\ud558\ub294": 16, "\uc9c4\ud589\uc911\uc774\ub2e4": 16, "\uc774\uc804\uae4c\uc9c0\uc758": 16, "\uc5f0\uad6c\ub4e4\uc740": 16, "\ud55c\uc7a5\uc529\uc744": 16, "\ud65c\uc6a9\ud558\ub824\ub294": 16, "\uc2dd\uc774": 16, "\uc8fc\ub97c": 16, "\uc774\ub8e8\uc5c8\ub2e4": 16, "\ubc29\uc2dd\uc5d0\ub294": 16, "face\ub97c": 16, "\uc758\uc874\ub3c4\uac00": 16, "\ucee4\uc11c": [16, 20], "style\uc744": [16, 19], "\uc785\ud788\uae30": 16, "\ud798\ub4e4\ub2e4": 16, "space\uc548\uc5d0\uc11c": 16, "content": [16, 26], "\uc815\ubcf4\uc640": 16, "entangl": [16, 17, 23], "\ub418\uc5b4\uc788\ub2e4": 16, "styo\ub294": 16, "\ud3ec\uc6a9\ud558\ub294": 16, "base\ubaa8\ub378\ub85c": 16, "\ucc44\uc6a9\ud55c\ub2e4": 16, "stage\ub85c": 16, "\uad6c\uc131\ub418\ub294\ub370": 16, "disentangl": 16, "learner": 16, "idl": 16, "\ubd84\ub9ac": 16, "grain": 16, "fcc": 16, "idl\ub85c\ubd80\ud130": 16, "\ubd84\ub9ac\ub41c": 16, "content\uc640": 16, "\uc6d0\ud558\ub294\ub300\ub85c": 16, "\uc7ac\uc870\ud569": 16, "src": 16, "detail\ud55c": 16, "\uc720\uc9c0\ud558\uae30\uc704\ud574": 16, "map\uc744": 16, "\uc7ac\uc0ac\uc6a9\ud558\ub294": 16, "trick\uc744": 16, "\uc81c\uc548\ud588\ub2e4": 16, "gan\uc774": [16, 19], "\ubd84\uc57c\ub97c": 16, "\uc7a5\uc545\ud558\ub358": 16, "\ub4f1\uc7a5\uc73c\ub85c": [16, 18], "\uc8fc\ubaa9\uc744": [16, 25], "\uc2dc\uc791\ud588\ub2e4": 16, "prompt\ub97c": [9, 16, 19], "\uac00\ub2a5\ud574\uc84c\uc9c0\ub9cc": 16, "\ubd80\ubd84\uae4c\uc9c0": 16, "control\ud558\uae30\uc5d0\ub294": 16, "fine\ud55c": 16, "\uc815\ubcf4\uae4c\uc9c0": 16, "model\uc774\ub2e4": 16, "\ubcf4\uc774\uba74\uc11c": 16, "stylegan\uc744": 16, "\ubca0\uc774\uc2a4\ub85c": 16, "dataset\uc744": [16, 25], "\uc758\uc874\uc131\uc774": 16, "\ucee4": 16, "artist": 16, "\uc785\ud788\ub294\ub370": 16, "\ud55c\uacc4\ub97c": [2, 16, 20], "\uac1c\uc120\ud55c": 16, "\uac04\uc758": [2, 9, 16, 20], "transfer\ub97c": 16, "disentagl": 16, "\ubd84\ub9ac\ud558\ub294": 16, "s_": 16, "\ubc18\ub300": [16, 20], "\uc548\uc5d0": [16, 25], "a\uc758": [16, 17], "conext": 16, "\ubc30\uc81c\ud568\uacfc": 16, "\ud3ec\ud568\ud558\uae30\uc704\ud574": 16, "\uc55e\uc5d0": [16, 20, 22], "negat": 16, "\ubd80\uc815\uc758": 16, "\uc758\ubbf8\ub97c": [9, 16, 18], "\ub2e8\uc5b4": [16, 19], "except": 16, "auxiliari": [16, 25], "\uc14b\uc744": [16, 19], "\uad6c\uc131\ud574": 16, "ffhq": [16, 17], "\uc784\uc758\ub85c": 16, "\ud6a8\uacfc": [16, 20, 25], "\ud559\uc2b5\ud568\uc73c\ub85c\uc368": [16, 19, 24], "prompt\uac04": 16, "disentanglement\ub97c": 16, "\ud5a5\uc0c1": [16, 18, 21], "\uc774\ubbf8\uc9c0\uc5d0\ub294": 16, "\uc774\ubbf8\uc9c0\ub9cc\uc758": 16, "\uc8fc\uc785": [2, 16], "style\uacfc": [16, 17], "\uad6c\ubcc4\ud558\ub294\ub370": 16, "\ub3c4\uc6c0\uc744": 16, "\uc90c": 16, "idl\uc758": 16, "\ud559\uc2b5\ub9cc\uc73c\ub85c": 16, "transfer\uac00": 16, "\uc774\ubbf8\uc9c0\ucc98\ub7fc": 16, "\uc783\uc5b4\ubc84\ub9ac\ub294": 16, "\uac1c\uc120\ud558\uae30\uc704\ud574": 16, "\ub3c4\uc785\ud558\uc600\ub2e4": 16, "idl\ub85c": 16, "\uc870\ud569": 16, "recombin": 16, "\uc720\uc9c0\ud558\ub3c4\ub85d": 16, "trick": 16, "ldm\uc740": [16, 19], "\uc8fc\uc785\ud558\uae30\uc704\ud574": 16, "mechanism\uc744": 16, "promt": 16, "paper\uc5d0\uc11c": 16, "m\uc758": 16, "layout\uc5d0": 16, "\ubbf8\uce5c\ub2e4": 16, "mask\ub97c": 16, "\uacfc\uc815\uc5d0": 16, "\uc8fc\uc785\ud569\uc73c\ub85c\uc368": 16, "\uc720\ub3c4": [16, 22], "map\uc758": 16, "replace\ud558\uc9c0\uc54a\uace0": 16, "content\uc5d0": 16, "index\ub9cc": 16, "\uc120\ud0dd\uc801\uc73c\ub85c": 16, "replac": 16, "index": [16, 19], "time\uc5d0\uc11c": 16, "n\ubc88": 16, "\uc0ac\uc6a9\ud568\uc73c\ub85c\uc11c": 16, "n_": 16, "\uc2e4\ud5d8\uc0c1": 16, "\uc774\ud558\uc758": [16, 23], "\ucd94\ucc9c": 16, "5b": 16, "ak47": 16, "m4a1": 16, "adam": [16, 18, 27], "400": 16, "ldm\uacfc": 16, "\ub3d9\uc77c": [16, 20], "styo\uac00": 16, "identity\uc640": 16, "\uc720\uc9c0\ud568\uacfc": 16, "\uc790\uc5f0\uc2a4\ub7fd\uac8c": [9, 16], "\uacb0\uacfc\ubb3c\uc744": 16, "\uc0dd\uc131\ud574\ub0b8\ub2e4": [16, 19], "study\ub3c4": 16, "\ubaa8\ub378\ub4e4\uc5d0": [16, 18], "effect": [16, 17, 26, 27], "contrast": [9, 16, 21, 25], "templat": 16, "\ub123\uace0": 16, "\ud559\uc2b5\ud560\uacbd\uc6b0": 16, "overfitting\uc774": 16, "\uc2ec\ud558\uace0": 16, "\uc815\ubcf4\uc758": 16, "\ubd84\ub9ac\uc5d0": 16, "\uc5b4\ub824\uc6c0\uc744": [9, 16, 21], "detail\uc744": [16, 19], "set\uc758": 16, "trick\ub3c4": 16, "\uc801\uc6a9\ud558\ub294\uac83\uc774": 16, "\uc0dd\uc131\ud574\ub0c8\ub2e4": 16, "inference\ud560": 16, "\ubcf4\uc774\uc9c0\ub9cc": 16, "fcc\ub97c": 16, "\ud3ec\ud568\ud560": 16, "\ub192\uc544\uc838": 16, "significant\ud55c": 16, "\uc0dd\uc131\ub418\ub294\uac83\uc744": 16, "photorealistic\uc5d0\uc11c": 16, "artistic\ud558\uac8c": 16, "\ubc14\ub00c\uace0": 16, "\ub9c8\ucc2c\uac00\uc9c0\ub85c": [16, 18, 19], "\ub098\uc624\ub294": [16, 18, 21, 22, 23], "idl\uacfc": 16, "gan\uc744": [16, 17, 20], "\ubaa8\ub378\ub4e4\ubcf4\ub2e4": [16, 27], "\uc0dd\uc131\ud574\ub0bc": 16, "singl": [16, 19, 27], "10\ubd84\uc774": 16, "\uac78\ub9ac\ubbc0\ub85c": 16, "efficiency\uac00": 16, "\ubabb\ud558\ub2e4\ub294": 16, "2019": 17, "1812": 17, "04948": 17, "huangzh13": 17, "12": [2, 17, 20, 24, 28], "stylegan\uc785\ub2c8\ub2e4": 17, "gan\uacfc": 17, "\ubcc0\uacbd\ud568\uc73c\ub85c\uc368": 17, "\uc62c\ub9ac\uace0": 17, "feature\uc758": 17, "control\uc774": 17, "loss\ub098": 17, "discrimin": [17, 20, 24], "\uac1c\uc120\uc5d0": 17, "\ubcf4\ub3c4\ub85d": 17, "\ud558\uc8e0": 17, "\uc81c\uc548\ud558\uc5ec": 17, "\ub192\uc774\uba74\uc11c": 17, "\uac00\ub2a5\ud574\uc84c\uc2b5\ub2c8\ub2e4": 17, "\uc81c\uc548\ud588\uc2b5\ub2c8\ub2e4": 17, "\uc911\uc5d0\uc11c": [17, 21], "contribution\uc744": [17, 22], "abstract\uc5d0\ub294": 17, "\ubb38\uc7a5\uc774": 17, "lead": 17, "automat": [17, 26], "unsupervis": [17, 20], "ident": [17, 20, 23], "freckl": 17, "enabl": [17, 19], "intuit": 17, "\uc81c\uc548\ud55c": [17, 18], "\uad6c\uc870\uac00": 17, "\uc77c\uc744": 17, "\uc124\uba85\ud558\ub294": [17, 18, 19, 20], "\ubcf4\uc2dc\uba74": 17, "attribute\uc758": 17, "separation\uc774": 17, "\uc598\uae30\ud558\uace0": 17, "\ubd80\ubd84\uc774": [17, 18], "stylegan\uc758": 17, "\ud2b9\uc9d5\uc774\ub77c\uace0": 17, "\uc0ac\uc6a9\uc790\ub294": [17, 19], "\ubaa9\uc801\uc744": 17, "\uc790\uc2e0\uc774": 17, "\ub9cc\ub4e4\uace0\uc790": 17, "\ud488\uc9c8\uc774": 17, "\uc88b\ub354\ub77c\ub3c4": 17, "\uc0ac\uc6a9\uc790\uc758": 17, "\uc758\ub3c4\uc640": 17, "\uc0c1\uad00\uc5c6\ub294": 17, "\ub79c\ub364\ud55c": [17, 18], "\ub0b4\ubc49\uc5b4\uc900\ub2e4\uba74": 17, "\uc2e4\uc6a9\uc131\uc774": 17, "\uc88b\ub2e4\uace0": [17, 18, 27], "\uc5c6\uc744": [17, 25, 26], "\uadfc\ub798\uc5d0": 17, "\uc778\uae30\ub97c": 17, "\uc5bb\uc5c8\ub358": 17, "\uc774\uc720\ub3c4": 17, "\ub204\uad6c\ub098": 17, "\uc810\ub3c4": 17, "\ud55c\ubaab\ud588\ub2e4\uace0": 17, "stylegan\uc740": 17, "controllability\ub97c": 17, "\ubaa8\ub378\uc774\ub77c\ub294": 17, "\uc758\ubbf8\uc788\ub2e4\uace0": 17, "network\ub294": 17, "4x4\uc5d0\uc11c": 17, "1024x1024\uae4c\uc9c0": 17, "\ub192\uc5ec\uc90d\ub2c8\ub2e4": 17, "\uac16\uac8c\ub429\ub2c8\ub2e4": 17, "gan\ud558\uace0": 17, "\ube44\uad50\ud574\uc11c": [17, 21], "\ud2b9\uc774\ud55c": 17, "\uc810\uc774": [17, 18], "z\ub97c": 17, "noise\uc640": 17, "\uc0dd\uac01\ud574\ubcf4\uba74": 17, "\uac70\uccd0\uc11c": 17, "\uad6c\uc870\uc785\ub2c8\ub2e4": 17, "z\ub294": 17, "distribution\uc5d0\uc11c": [17, 22], "\uc0d8\ud50c\ub9c1\uc73c\ub85c": 17, "\uc5bb\uc2b5\ub2c8\ub2e4": 17, "distribution\uc73c\ub85c": 17, "\ubcf4\ub0b4\ub294": 17, "\ubc30\uc6b0\uac8c": 17, "\ub420": [2, 17, 18, 19], "\uac83\uc774\uace0": 17, "\ubd84\ud3ec\ub294": 17, "\uc0dd\uae30\uac8c": 17, "\uc8fc\uc5b4\uc838\uc11c": 17, "\uc5c6\uac70\ub098": 17, "\uc801\uc744": 17, "\uc608\ub97c": [17, 20, 26], "\ub4e4\uc5b4": [17, 20], "\ud53c\ubd80\uac00": 17, "\ud76c\uba74\uc11c": 17, "\uae34": 17, "\uc0d8\ud50c\ub4e4\uc774": 17, "\ud574\ubd05\uc2dc\ub2e4": 17, "\ud53c\ubd80\uc0c9\uacfc": 17, "\uba38\ub9ac": 17, "\uae38\uc774\ub77c\ub294": 17, "feature\ub294": 17, "\uc5bd\ud788\uac8c": 17, "\ud558\ub098\ub97c": [17, 25], "\ubc14\uafc0": [17, 20], "\ud558\ub098\ub3c4": [17, 19], "\ubc14\ub00c\ub294": 17, "\uc77c\uc5b4\ub098\uac8c": 17, "\uc644\ud654\ud558\uae30": 17, "gaussian\uc5d0\uc11c": 17, "learnabl": [9, 17, 23, 27], "w\ub97c": 17, "\uc0ac\uc6a9\ud569\ub2c8\ub2e4": [17, 23], "instanc": [17, 20, 23], "normalization\uc740": 17, "\ucc44\ub110\ub9c8\ub2e4": 17, "\ucde8\ud574\uc8fc\ub294": 17, "normalization\uc5d0": 17, "scale\uc744": [17, 22], "\uacf1\ud574\uc8fc\uace0": 17, "\ub354\ud574\uc8fc\ub294": 17, "vector\uc758": 17, "transformation\uc73c\ub85c": 17, "\uc8fc\uc5b4\uc9c0\ub294": 17, "w\ub294": 17, "\ubcf4\ub0b4\uc9c0\uac8c": 17, "adain\uc758": 17, "\uc218\uc2dd\uc740": 17, "adain\uc740": 17, "\ube14\ub85d\ub9c8\ub2e4": 17, "\uac1c\uc529": 17, "\ub4e4\uc5b4\uac00\uc11c": [17, 19], "style\uc740": 17, "\uc5f4\uc5ec\ub35f": 17, "\ubc88": 17, "adain\uc744": 17, "generator\uc5d0": [17, 19], "\ub4e4\uc5b4\uac00\uac8c": [17, 26], "localization\uc774\ub77c\ub294": 17, "\ud2b9\uc9d5\uacfc\ub3c4": 17, "\uc5f0\uad00\uc774": 17, "\ub9d0\ud558\ub294": 17, "localization\uc774\ub780": 17, "\uc77c\ubd80\ub97c": 17, "\ubc14\uafc8\uc73c\ub85c\uc368": 17, "\ud2b9\uc9d5\ub4e4\uc744": 17, "\uc758\ubbf8\uc785\ub2c8\ub2e4": 17, "\ub2e4\uc74c\uc5d0": 17, "map\ub4e4\uc740": 17, "normalization\ub418\uace0": 17, "style\uc5d0": 17, "\uc758\ud574": [2, 17, 20, 23], "statistics\ub97c": 17, "\uac00\uc9c0\uac8c": 17, "convolution\uc5d0": 17, "\uc801\uc6a9\ub418\uace0": 17, "convolution\uc5d0\uc11c": 17, "normalization\uc774": 17, "\uc218\ud589\ub418\uae30": 17, "layer\uc5d0": 17, "style\uc774": 17, "\ubd84\ub9ac\ub418\uac8c": 17, "\ud559\uc2b5\ub420": [17, 18], "\ucf54\ub4dc": 17, "stylemod": 17, "latent_s": 17, "use_wscal": 17, "lin": 17, "equalizedlinear": 17, "gain": 17, "n_channel": 17, "view": [17, 23, 24, 28], "layerepilogu": 17, "thing": 17, "dlatent_s": 17, "use_nois": 17, "use_pixel_norm": 17, "use_instance_norm": 17, "use_styl": 17, "activation_lay": 17, "noiselay": 17, "activ": 17, "pixel_norm": 17, "pixelnormlay": 17, "instance_norm": 17, "instancenorm2d": 17, "top_epi": 17, "ordereddict": 17, "style_mod": 17, "dlatents_in_slic": 17, "assert": 17, "b\uc758": 17, "style\ub85c": 17, "\ubcc0\uacbd\ud574\uc11c": 17, "\uc774\ubbf8\uc9c0\ub4e4\uc785\ub2c8\ub2e4": 17, "18\uacf3\uc5d0\uc11c": 17, "\uc0ac\uc6a9\ub418\ub294\ub370": 17, "\ucc98\uc74c": [9, 17], "4\uacf3": 17, "coars": 17, "\uadf8\ub2e4\uc74c": 17, "middl": [2, 17, 26, 27], "10\uacf3": 17, "64": [17, 18, 25, 27], "1024": [17, 21, 24, 25], "\uc815\uc758\ud558\uc600\uc2b5\ub2c8\ub2e4": 17, "\uc717": [17, 20], "\ubd80\ubd84\uc5d0\uc11c\ub294": 17, "\ud3ec\uc988\ub098": 17, "\uc2a4\ud0c0\uc77c\uac19\uc774": 17, "\uac08\uc218\ub85d": 17, "\ud2c0\uc744": 17, "\ubd80\ubd84\ub4e4\uc744": 17, "b\uc5d0\uc11c": [17, 25], "\uac00\uc838\uc654\uc74c\uc744": 17, "\uc548\uc5d0\ub294": 17, "\ubc14\ub014": 17, "\uc8fc\uadfc\uae68": 17, "\uba38\ub9bf\uacb0": 17, "\ud53c\ubd80": 17, "\ubaa8\ub378\ub9c1\ud558\uae30": 17, "\ub354\ud574\uc9d1\ub2c8\ub2e4": 17, "\uc548\uc5d0\uc11c\ub3c4": 17, "\ub514\ud14c\uc77c\ub4e4\uc740": 17, "\ub2ec\ub77c\uc9c8": 17, "deviation\uc744": 17, "\uad6c\ud574\ubd24\uc744": 17, "\uc5bc\uad74\ud615\uacfc": 17, "attribute\ub294": 17, "\ubcc0\ud558\uc9c0\uc54a\uc9c0\ub9cc": 17, "noise\uc5d0": 17, "\uc758\ud574\uc11c": [2, 17], "\uba38\ub9ac\uce74\ub77d\uacfc": 17, "\uc0dd\uae40\uc744": 17, "\uc900": [9, 17, 22], "\uc8fc\uc9c0": 17, "\uc5d0\ub9cc": [17, 27], "\uba38\ub9ac\uce74\ub77d\uac19\uc740": 17, "\ub514\ud14c\uc77c\uc774": 17, "\uc81c\ub300\ub85c": 17, "\uc0b4\uc544\uc788\uc9c0": 17, "layers\uc5d0": 17, "\ub4e4\uc5b4\uac04": 17, "\uba38\ub9ac\uce74\ub77d\uc758": 17, "\uc138\ubc00\ud55c": [17, 27], "\ubd80\ubd84\uc5d0": [17, 18, 28], "\ub07c\uce5c\ub2e4\ub294": 17, "localization\uc774": 17, "\ub418\uac8c\ud558\uae30": 17, "mixing\uc774\ub77c\ub294": 17, "\uc55e": 17, "\ucabd": 17, "layer\uc5d0\ub294": 17, "\ub4a4": 17, "generator\uac00": [17, 20], "\uc778\uc811\ud55c": 17, "style\ub07c\ub9ac": 17, "correlated\ub418\uc5b4\uc788\ub2e4\uace0": 17, "\ub9c9\uc544\uc11c": 17, "localization\uc744": 17, "\ub418\uac8c": 17, "\ubaa9\uc801\uc785\ub2c8\ub2e4": [17, 28], "\uc800\uc790\ub4e4\uc774": [17, 18, 25], "\ubc29\ubc95\ub4e4\uc774": [9, 17], "\ud6a8\uacfc\uac00": [17, 27], "\uc788\uc5c8\ub294\uc9c0": 17, "\ud655\uc778\ud574\ubd05\uc2dc\ub2e4": 17, "\ud45c\uc640": 17, "\uc2e4\ud5d8\uc801\uc73c\ub85c": [2, 17, 25], "\ubcf4\uc558\uc744": [17, 20], "\ubc29\ubc95\ub4e4\uc744": 17, "fid\uac00": [17, 18, 22], "variou": [17, 22, 27, 29], "design": [17, 24, 28], "2304": 18, "08466": 18, "jeonghwa": 18, "yoo": 18, "\uc774\ubc88\uc5d0": 18, "\ub9ac\ubdf0\ud560": 18, "\uad6c\uae00": [18, 25], "\ub9ac\uc11c\uce58": 18, "\uadf8\ub8f9\uc5d0\uc11c": 18, "tmlr": 18, "transact": 18, "machin": 18, "research": [18, 26], "2023\uc5d0": 18, "\uc81c\ucd9c\ud55c": 18, "\ub17c\ubb38\uc778": 18, "\uc18d\ub3c4\ub85c": 18, "\ubc1c\uc804\ud558\uace0": 18, "\uc788\ub294\ub370\uc694": [18, 27], "\uc218\uc900\uc774": 18, "\uc5bc\ub9cc\ud07c": 18, "\uc654\ub294\uc9c0": 18, "\ub370\uc774\ud130\uc778": 18, "\ucda9\ubd84\ud55c": 18, "\uc815\ub3c4\uac00": 18, "\ub418\uc5c8\ub294\uc9c0": 18, "augment\ub41c": 18, "\uc815\ub3c4\uae4c\uc9c0": 18, "\uc654\ub294\uc9c0\uc5d0": 18, "\uc2e4\ud5d8\uacfc": 18, "\ub2f5\uc744": 18, "\uc81c\uc2dc\ud569\ub2c8\ub2e4": [18, 20, 26], "\uae00\uc758": 18, "\ubaa9\ucc28\ub294": 18, "\ub0b4\uc6a9\uacfc": 18, "\uad6c\uc131\ud558\uc600\uc2b5\ub2c8\ub2e4": 18, "\uc694\uc57d": 18, "task\uc5d0\uc11c": [18, 20], "augmentation\uc73c\ub85c": 18, "\ubd84\ub958": [18, 20], "tuning\ud558\uc5ec": 18, "\ub2ec\uc131": [18, 19, 21], "imagenet\uc5d0": 18, "tuning\ub41c": 18, "\uc0ac\uc6a9\ud568": [18, 19, 20, 21, 25], "\ud569\uc131": 18, "\uc0ac\uc6a9\ud558\uc600\uc744": 18, "\ub428": [2, 18, 21], "\uae30\uc220\uc801\uc73c\ub85c": 18, "\uc5c4\uccad": 18, "\ub0b4\uc6a9\uc740": 18, "\uc5c6\ub294\ub370\uc694": 18, "\ub2e4\ub9cc": 18, "\uc0ac\uc6a9\ud558\ub358": 18, "\ubc29\ubc95\ub4e4\uacfc\ub294": 18, "imagen\uc744": 18, "\ud588\ub2e4\ub294": 18, "\uc0c8\ub86d\uc2b5\ub2c8\ub2e4": 18, "\uae30\uc220\uc774": [9, 18], "\ubc1c\uc804\ub418\uc5c8\uc2b5\ub2c8\ub2e4": 18, "\ub9cc\ud07c\uc758": [2, 18], "\uc790\uc5f0\uc2a4\ub7ec\uc6b4": 18, "\uc9c8\ubb38\uc774": 18, "\ub2f9\uc5f0\ud558\uace0": 18, "\ucc3e\uace0\uc790": 18, "\uc9c8\ubb38\uc5d0": 18, "\uc774\uc57c\uae30": 18, "imagen\uc774": [18, 25], "ca": 18, "\ud558\uc600\ub2e4": 18, "\ub370\uc774\ud130\uc640": [18, 23, 24, 28], "\uacb0\ud569\ud558\uc5ec": 18, "\ub370\uc774\ud130\uc758": [18, 20], "\uc2dc\uac04\uc774": [18, 22], "\uae38\uc218\ub85d": 18, "\ud5a5\uc0c1\ub418\uc5c8\ub2e4": 18, "\uc788\ub4ef\uc774": [18, 20], "\ub370\uc774\ud130\ub85c\ub9cc": 18, "\uc815\ud655\ub3c4\uc640": 18, "\uc801\ub2e4\ub294": 18, "\uc54c": [9, 18, 20, 25], "\ub354\ud574\uc11c": 18, "\ud559\uc2b5\ud588\uc744": 18, "\ubaa8\ub378\uacfc": 18, "\ubaa8\ub378\ub4e4\uc5d0\uc11c": 18, "\ub54c\ubcf4\ub2e4": 18, "\ud5a5\uc0c1\uc774": 18, "augmentation\uc744": 18, "\ud558\ub824\uace0": 18, "\ud588\ub358": 18, "\ubc29\ubc95\ub4e4\uc5d0": 18, "\uc9e7\uac8c": 18, "\ud590\ub824\uace0": 18, "\ucd5c\uadfc\uc5d0\ub294": 18, "\ubcf4\uac15\ud558\ub294\ub370": 18, "\uc0ac\uc6a9\ub418\uae30": 18, "\uc2dc\uc791\ud588\uc2b5\ub2c8\ub2e4": 18, "\uc608\ub85c": 18, "Is": 18, "readi": 18, "\ub17c\ubb38\uc774": 18, "glide\ub85c": 18, "shot\uacfc": 18, "few": [18, 21], "\uc2dc\ucf30\uc73c\uba70": 18, "glide\ub97c": 18, "\uc138\ud2b8\uac00": [18, 20], "100\uc758": 18, "\uc2dc\ucf30\ub2e4\uace0": 18, "\ud3ec\ud568\ud574\uc11c": 18, "\ub17c\ubb38\ub4e4\uc740": 18, "\uc774\uc6a9\ud574\uc11c": [18, 19], "\ud558\uc5ec\ub3c4": 18, "\uc2dc\ud0a4\uc9c0": 18, "\ubabb\ud588\uc2b5\ub2c8\ub2e4": 18, "\ud558\uc9c0": [18, 19], "\uc54a\uc558\uc2b5\ub2c8\ub2e4": 18, "\ub17c\ubb38\ub4e4\uacfc\ub294": 18, "\ub3d9\uc791\ud558\uace0": 18, "\uc2dc\ud0ac": 18, "\uc6cc\ub099": 18, "\uc4f0\uc5ec\uc11c": 18, "\uc124\uba85\uc740": 18, "\uc0dd\ub7b5\ud558\uace0": 18, "cas\uc5d0": 18, "\uc368\uc838": 18, "\ub0b4\uc6a9\uc73c\ub85c": 18, "\uc18c\uac1c\ud558\uaca0\uc2b5\ub2c8\ub2e4": 18, "cas\ub294": 18, "score\uc640": [9, 18], "\ub9cc\ub4e4\uc5b4\ub0b8": 18, "\uc9c0\ud45c\uc785\ub2c8\ub2e4": 18, "\ub85c\ub9cc": 18, "\ub9cc\ub4e4\uc5b4\ub0c5\ub2c8\ub2e4": 18, "\ub370\uc774\ud130\ub9cc\uc744": 18, "\uc774\uc6a9\ud558\uc5ec": [9, 18, 21], "50\uc744": 18, "\uc2dc\ud0a4\uace0": 18, "cas\uac00": 18, "\ub9cc\uc57d": [2, 18], "imagenet\uacfc": 18, "\ube44\uc2b7\ud558\ub2e4\uba74": 18, "\ubcf4\uc77c": 18, "\uac00\uc815\uc744": [18, 22, 28], "\uc9c0\ud45c\ub77c\uace0": 18, "\uc774\ud574\ud558\uba74": 18, "\uc800\uc790\uc5d0": 18, "\uc758\ud558\uba74": 18, "\uadf8\ub3d9\uc548": 18, "\uc0dd\uc131\ubaa8\ub378\uc758": [9, 18], "\uc54a\uc558\ub2e4\uace0": 18, "\uc0d8\ud50c\ub85c\ub9cc": 18, "\ub5a8\uc5b4\uc84c\uace0": 18, "\ub2f9\uc5f0\ud574\ubcf4\uc785\ub2c8\ub2e4": 18, "\ub5a8\uc5b4\uc84c\ub2e4\uace0": 18, "\uc544\ub9c8\ub3c4": 18, "\ud488\uc9c8": 18, "\ub2e4\uc591\uc131": 18, "\uac83\uc774\ub77c\uace0": 18, "\uc5ec\uae30\uc11c\ub294": [18, 19], "\ud558\uc600\ub294\uc9c0\uc5d0": 18, "\uc124\uba85\uc744": [18, 19], "\ubaa8\ub378\ub85c\ub294": 18, "\uc0ac\uc6a9\ud558\uc600\uc2b5\ub2c8\ub2e4": 18, "\ud074\ub798\uc2a4\uc640": 18, "\uc9c0\uc5d0": 18, "\uace0\ubbfc\uc774": 18, "\ud544\uc694\ud588\ub2e4\uace0": 18, "\uc9e7\uc740": 18, "\ud558\uc600\ub294\ub370": 18, "imagen\uc5d0\uc11c": 18, "\ub2e4\uc591\uc131\uc774": 18, "\uc800\ud558": 18, "\ub418\uba74\uc11c": 18, "\ud604\uc0c1\uc77c": 18, "\ub450\ub2e8\uc5b4": 18, "\ud074\ub798\uc2a4": 18, "\uc774\ub984\uc73c\ub85c": 18, "\uc218\uc815\ud558\uace0": 18, "\ud588\ub2e4\uace0": [9, 18, 22], "tuning\uc774": 18, "\uc774\ubbf8\uc9c0\uace0": 18, "\uc624\ub978\ucabd\uc774": 18, "\uc801\uc6a9\ub418\uc9c0": 18, "imagen\uc785\ub2c8\ub2e4": 18, "\uc544\ub798\uc5d0\uc11c": [18, 20], "\ud074\ub798\uc2a4\uc778": 18, "schipperke\ub97c": 18, "\uc2a4\ud0a4\ud37c\ud0a4\ub77c\ub294": 18, "\uac1c": [18, 20], "\ud488\uc885\uc744": 18, "\uc758\ubbf8\ud558\ub294\ub370": 18, "imagen\uc758": 18, "\uacbd\uc6b0\ub294": [18, 20], "\uaf43\uacfc": 18, "\uc804\ud600": [18, 20], "\uc5c9\ub6b1\ud55c": 18, "\ub9cc\ub4e4\uace0": [18, 19], "\ud588\ub294\uc9c0\ub97c": 18, "\uad6c\uc870\uc5d0\uc11c": 18, "\uc6d0\uc73c\ub85c": 18, "\ud45c\uc2dc\ub41c": 18, "\ub300\ud574\uc11c\ub9cc": 18, "frozen": [18, 25], "\uc6d0\ub798": [18, 20, 22], "imagen\uc5d0\uc11c\ub3c4": 18, "\ubd80\ubd84\uc774\ub77c": 18, "\uc54a\uc558\uace0": 18, "\ucd9c\ub825\uc73c\ub85c": 18, "\uace0\ud574\uc0c1\ub3c4\uc758": 18, "\uc801\uc5b4\uc11c": 18, "210k": 18, "\ud559\uc2b5\ud558\uc600\uace0": 18, "optimizer\uc758": 18, "\uc0ac\uc6a9\ud558\uc600\ub358": 18, "adafactor": 18, "optimizer\ub97c": 18, "\uc0ac\uc6a9\ud558\uc600\ub2e4\uace0": [9, 18], "490k": 18, "\ucd5c\uc801\uc758": 18, "\uc120\ud0dd\uc758": 18, "\uae30\uc900\uc73c\ub85c\ub294": 18, "sampler\uc640": 18, "1k": 18, "10k\uac1c\uc758": 18, "\uc0d8\ud50c\ub4e4\uc5d0": 18, "score\ub97c": [9, 18], "\uacc4\uc0b0\ud588\uc744": 18, "\uc120\ud0dd\ud588\ub2e4\uace0": 18, "\uc815\ud588\ub294\uc9c0\ub97c": 18, "\uc0d8\ud50c\ub9c1\uc758": 18, "\uc18d\ub3c4\ub294": 18, "\ub514\ud4e8\uc804": 18, "\uc2a4\ud15d": 18, "free": [18, 26, 27], "coeffici": 18, "\ub4f1\uc5d0": 18, "\ubc1b\ub294\ub2e4\uace0": 18, "\uac04\ub2e8\ud558\uac8c": [18, 20], "\uc124\uba85\ud558\uba74": 18, "\ud655\ub960\uc801\uc778": 18, "\ub3c4\uc785\ud558\uc5ec": 18, "\ub2e4\uc591\uc131\uc744": 18, "\uc99d\uac00\uc2dc\ud0a4\ub294": 18, "\uc77c\ubc18\uc801\uc73c\ub85c": [18, 19], "\uc7a0\uc7ac": 18, "\uacf5\uac04\uc758": 18, "\ubcf4\uc774\uac8c": 18, "\ub9cc\ub4e4\uba70": 18, "understand": [18, 21, 25], "\ucc38\uace0\ud574\uc8fc\uc138\uc694": 18, "\ubd84\ub958\uae30\ub098": 18, "\uc9c0\ud45c": [18, 22], "\uc678\ubd80": 18, "\uc0ac\uc6a9\ud55c\ub2e4\ub294": 18, "\ubc18\uc601\ud560\uc9c0\ub97c": 18, "\uc758\ubbf8\ud560": 18, "\uc870\uc808\ud558\uc5ec": 18, "\ud2b9\uc131\uc774\ub098": 18, "\uc0dd\uc131\ud558\ub3c4\ub85d": [9, 18, 19], "\ubd84\ud3ec\uc758": 18, "\ubcc0\ub3d9\uc131\uc744": 18, "\uacc4\uc218\ub97c": 18, "\ud3c9\uade0\uacfc": [18, 25], "\ubd84\uc0b0\uc744": [2, 18, 25], "\uc870\uc808\ud568\uc73c\ub85c\uc368": 18, "\ub85c\uadf8": 18, "\ud63c\ud569": 18, "\uacc4\uc218\ub294": 18, "\uc0ac\uc6a9\ub418\uba70": 18, "\uc758\ubbf8\ud558\uace0": 18, "\uc758\ubbf8\ud568": 18, "\uc0dd\uc131\uc758": 18, "\uc124\uc815\ubc95\uc5d0": 18, "\uc124\uba85\ud558\uaca0\uc2b5\ub2c8\ub2e4": [18, 22], "\uc804\ubc18\uc801\uc778": [18, 19, 23], "\ud2b9\uc9d5\uacfc": 18, "\ub2e4\uc591\uc131\uc758": 18, "\uc8fc\uac8c": [18, 20], "1\ucc28": [2, 18], "sweep\uc73c\ub85c": 18, "ddpm": [2, 9, 18, 25], "\uc0d8\ud50c\ub7ec\ub97c": 18, "50k\uc5d0": 18, "\ud558\uc774\ud37c\ud30c\ub77c\ubbf8\ud130\ub97c": 18, "\ucc3e\uc2b5\ub2c8\ub2e4": 18, "sweep\uc758": 18, "\ud558\uc774\ud37c\ud30c\ub77c\ubbf8\ud130\uc758": 18, "\ubc94\uc704\ub294": 18, "75": 18, "128": [18, 20, 24], "sweep": 18, "fid\ub294": 18, "variance\ub294": 18, "1000\uc774\uc5c8\uc744": 18, "\ub54c\ub77c\uace0": 18, "sweep\uc774": 18, "\ub05d\ub09c": 18, "\ud6c4\uc5d0\ub294": 18, "weight\uc5d0": 18, "sweep\uc744": 18, "\ub54c\uc5d0\ub294": [18, 21], "2m": 18, "guidacn": 18, "cas\ub97c": 18, "\uce21\uc815\ud588\ub2e4\uace0": 18, "\ud558\uc774\ud37c\ud30c\ub77c\ubbf8\ud130\uc5d0": 18, "sweep\uc5d0": 18, "\uacb0\uacfc\uace0": 18, "\uac00\uc6b4\ub370\uc640": 18, "2\ucc28": 18, "\uacb0\uacfc\ub85c": 18, "\ub098\ud0c0\ub0b8": 18, "\uc774\uc81c": 18, "\ub2e4\uc74c\uc73c\ub85c\ub294": 18, "\uc120\ud0dd\ud558\ub294": [18, 21], "range\ub294": 18, "30": [18, 26], "denos": 18, "129": 18, "\uadf8\ub798\ud504\ub294": 18, "\uc124\uc815\ud558\uace0": [18, 23], "\ubcc0\uacbd\ud588\uc744": 18, "cas\uc758": 18, "\uadf8\ub798\ud504\ub97c": 18, "\uadf8\ub798\ud504\uc785\ub2c8\ub2e4": 18, "logvar": [18, 28], "coeff\uac00": 18, "3\uc77c": 18, "\ubcf4\uc600\uc73c\uba70": 18, "\uacbd\uc6b0\ub3c4": [18, 20], "\ubcf4\uc778": [18, 21], "\ubd84\uc11d\ud574\ubcf4\uc790\uba74": 18, "\uc0c1\uad00\uad00\uacc4\uac00": 18, "weight\uac00": 18, "\ub192\uc544\uc9c0\uc9c0\ub9cc": 18, "score\uc5d0\ub294": 18, "\ubd80\uc815\uc801\uc778": 18, "\uc8fc\uba70": [18, 25], "augmentation\uc774": 18, "0\uc77c": 18, "\ud558\uc774\ud37c\ud30c\ub77c\ubbf8\ud130": 18, "\uc124\uc815\ud55c": 18, "\uac19\ub2e4\uace0": 18, "\ubca0\uc774\uc2a4": 18, "\ub098\uba38\uc9c0": [18, 19, 26], "sampler": 18, "\ud569\uc131\uc740": 18, "\ud504\ub85c\ud1a0\ucf5c\uc744": 18, "\ub530\ub790\ub294\uc9c0\uc5d0": 18, "balance\ub97c": 18, "\uc720\uc9c0\ud558\uba70": 18, "\ud569\uc131\ud588\uc73c\uba70": 18, "\ud569\uc131\ub41c": 18, "\uaddc\ubaa8\ub294": 18, "1\ubc30\uc778": 18, "10\ubc30\uc778": 18, "12m": 18, "\ubc94\uc704\ub97c": 18, "\uac00\uc9c0\ub3c4\ub85d": 18, "\ud569\uc131\ud588\ub2e4\uace0": 18, "\ud0dc\uc2a4\ud06c\uc5d0\uc11c": 18, "\uc9c0\ud45c\uc778": 18, "is\uc758": 18, "\uad00\uc810\uc73c\ub85c": 18, "\ubd05\ub2c8\ub2e4": 18, "\ud45c\uc5d0\uc11c": 18, "\ud30c\uc778": 18, "\ud29c\ub2dd\ub41c": 18, "\ubca0\uc774\uc2a4\ubaa8\ub378\ub4e4": 18, "is\uac00": 18, "resolution\uacfc": 18, "resolution\uc5d0\uc11c": 18, "\ud574\ub2f9\ub418\uc5c8\uc2b5\ub2c8\ub2e4": 18, "\ud655\uc778\ud558\ub294": 18, "5\uc5d0\uc11c": [18, 19, 20], "\ud30c\ub780\uc0c9": [2, 18], "\uc131\ub2a5\uc774\uace0": 18, "\ube68\uac04\uc0c9": 18, "\uc131\ub2a5\uc785\ub2c8\ub2e4": 18, "\ubca0\uc774\uc2a4\ub77c\uc778": 18, "cdm": 18, "\uadf8\ub9bc\uc774\uba70": 18, "\uac00\uc6b4\ub370\ub294": 18, "\uc624\ub978\ucabd\uc740": 18, "\ubd80\ubd84\ubcf4\ub2e4": 18, "\uc704\ucabd\uc5d0": 18, "\uc704\uce58\ud558\uba74": 18, "\ud574\uc11d\ud560": 18, "\ubca0\uc774\uc2a4\ub77c\uc778\ubcf4\ub2e4": 18, "\ubcf4\uc778\ub2e4\ub294": 18, "2\uc5d0\uc11c\ub3c4": 18, "\uc8fc\ubaa9\ud560": 18, "\ub9cc\ud55c": 18, "resnet50\uc774": 18, "256x256\uc73c\ub85c": [9, 18], "\ub2e4\uc6b4\uc0d8\ud50c\ub9c1": 18, "\ud568\uc5d0\ub3c4": 18, "\uc88b\ub2e4\ub294": [18, 26], "our": [18, 19, 21, 29], "resolution\ubcf4\ub2e4": 18, "resolution\uc758": 18, "\uc6d4\ub4f1\ud788": [18, 26], "\ub192\uc74c": [18, 25], "\uc885\ub958\uc758": 18, "\uc2dc\ucf30\uc744": 18, "cas\uc640": 18, "cas\uc5d0\uc11c\ub294": 18, "resnet50": 18, "\ud655\uc778\ud588\uc9c0\ub9cc": [18, 27], "\uc774\uc678\uc5d0": 18, "\ubaa8\ub378\ub85c\ub3c4": 18, "\ubcf8\ub2e4\ub294": 18, "\ucc28\uc774\uc810\uc774": 18, "\uc0b4\ud3b4\ubcf8": 18, "\ub0ae\uc558\uc9c0\ub9cc": 18, "\ud569\uccd0\uc11c": 18, "\ub370\uc774\ud130\ub9cc": 18, "\uc99d\uac00\ud55c": [2, 18], "onvnet\uae30\ubc18": 18, "\uc591\uc0c1\uc744": 18, "\ubcf4\uc600\uc2b5\ub2c8\ub2e4": 18, "\uaddc\ubaa8\uc5d0": 18, "50\uc758": 18, "\ubd84\uc11d\ud55c": 18, "\uc99d\uac00\ud568\uc5d0": 18, "\uc9c0\uc18d\uc801\uc73c\ub85c": 18, "8m": 18, "\uaddc\ubaa8\uac00": [9, 18], "\ub54c\uae4c\uc9c0\ub294": 18, "\uc88b\uc558\uc73c\ub098": 18, "\uc774\uc0c1\uc758": 18, "\ub418\uc5c8\uc744": 18, "\uc624\ud788\ub824": 18, "\uacb0\ub860": 18, "\ubcf4\uc790\uba74": 18, "sclae": 18, "\ud30c\uc778\ud29c\ub2dd\ud558\uc5ec": 18, "\uc9c0\ud45c\uc5d0": 18, "\ub2ec\uc131\ud588\uc2b5\ub2c8\ub2e4": 18, "76": 18, "239": 18, "96": 18, "69": 18, "\uadf8\ub807\uac8c": [18, 19], "resnet\uacfc": 18, "accuracy\ub97c": 18, "\uc2dc\ucf30\uc2b5\ub2c8\ub2e4": 18, "\uacb0\uacfc\uc5d0": [18, 20], "\uc0dd\uac01\ud574\ubcfc\ub9cc\ud55c": 18, "\uac70\ub9ac\ub4e4\uc774": 18, "\uc788\uc5c8\ub294\ub370": 18, "\ud558\ub098\ub294": 18, "\uce21\uc815\ud560": 18, "\uc785\ub825\uc744": 18, "256x256\ubcf4\ub2e4": 18, "1024x1024\uc758": 18, "\ub2e4\uc6b4\uc0d8\ud50c\ub9c1\uc744": 18, "\ud558\ub354\ub77c\ub3c4": 18, "resolution\uc774": 18, "\ud074": 18, "\ub2f4\ub294\ub2e4\ub294": 18, "\uac83\uc77c": 18, "\uc815\ud655\ub3c4\uac00": 18, "\uc99d\uac00\ud588\uc9c0\ub9cc": 18, "\ub370\uc774\ud130\uc5d0\uc11c\ub294": 18, "\uadf8\ub807\uc9c0": 18, "\uc54a\uc558\ub358": 18, "\uace0\ud574\uc0c1\ub3c4\uc5d0": 18, "\uc815\uad50\ud55c": 18, "\ud544\uc694\ud560": [18, 20], "\uc2dc\uc0ac\ud558\uace0": 18, "\ub9ac\ubdf0\ub97c": 18, "\ub9c8\uce58\uaca0\uc2b5\ub2c8\ub2e4": 18, "\ub290\ub080": 18, "\uc0b0\uc5c5\uc5d0\uc11c\ub294": 18, "shortage\ub098": 18, "imbal": 18, "\ubc1c\uc0dd\ud558\ub294\ub370": 18, "\ud574\uacb0\ubc95": 18, "\ud558\ub098\uac00": [18, 22], "\uac19\ub2e4\ub294": 18, "\ub4e4\uc5c8\uc2b5\ub2c8\ub2e4": 18, "\ud30c\uc778\ud29c\ub2dd\uc774": 18, "\ub418\uc9c0": 18, "\uc0b0\uc5c5\uc5d0\uc11c\ub9cc": 18, "\ud14d\uc2a4\ud2b8\uac00": 18, "\uc788\uc744\uae4c": 18, "\ud569\uc131\ud558\uace0\uc790": 18, "\ub370\uc774\ud130\uc14b\uc5d0": [2, 18, 20], "\ud30c\uc778\ud29c\ub2dd\uc744": 18, "\ud574\uc57c\ud558\ub294": 18, "\uaf64\ub098": 18, "\ubd88\ud3b8\ud560": 18, "\uac19\uc544\uc11c": 18, "\uac16\ub294\uc9c0": 18, "\uc788\uc5c8\uc73c\uba74": 18, "\uc88b\uc558\uc744": 18, "\uac1c\uc778\uc801\uc778": 18, "\uc720\ucd94\ud574\ubcfc": 18, "\uc21c": 18, "\uc788\uc9c0\ub9cc\uc694": 18, "worth": 19, "2208": [19, 23], "01618": 19, "devocean": 19, "techboarddetail": 19, "page": 19, "id": 19, "164320": 19, "boardtyp": 19, "writer": 19, "searchdata": 19, "sam56903": 19, "subindex": 19, "idlist": 19, "pnwriterid": 19, "kwang": 19, "su": 19, "mun": [19, 20], "5\uc7a5\uc73c\ub85c": 19, "\uac1c\ub150": [19, 23], "\ucf58\uc149\ud2b8": 19, "concept": [19, 25], "\ubf51\uc544\ub0b4\ub294": 19, "\uc790\uc5f0\uc5b4\ub97c": 19, "creation\uc5d0": 19, "\uc804\ub840\uc5c6\ub294": 19, "\uc790\uc720\ub3c4\ub97c": 19, "contept\ub97c": 19, "\uadf8\uac83\uc758": 19, "\ubc14\uafb8\uac70\ub098": 19, "\uc5ed\ud560\uc774": 19, "\uc8fc\uc5b4\uc9c0\uac70\ub098": 19, "\ucc38\uc2e0\ud55c": 19, "\uc7a5\uba74\uc774": 19, "\uadf8\ub824\uc9c0\ub294\uac74": 19, "\ubd88\ubd84\uba85\ud558\ub2e4": 19, "\uc774\uac83\uc744": 19, "\uadf8\ub824\uc918": 19, "\ub9d0\ud560": 19, "\uc774\uac83": 19, "\uac83\uc774\ub0d0\ub294": 19, "\ubb3c\uc74c\uc5d0\ub294": 19, "\uac19\ub2e4": [9, 19, 22, 25], "5\uac1c\ub9cc\uc73c\ub85c": 19, "\uc0ac\ubb3c\uc774\ub098": 19, "\uc790\uc5f0\uc5b4": 19, "\ubb38\uc7a5\uc5d0": [19, 20], "\ub179\uc544\ub4e4\uc5b4\uac00": 19, "\uc9c1\uad00\uc801\uc778": 19, "\uc774\ub04c\uc5b4": 19, "\ub3c5\uc790\uc801\uc774\uba74\uc11c": 19, "\ucf58\uc149\ud2b8\ub97c": 19, "capture\ud558\uae30": 19, "\uc704\ud574\uc11c\ub294": [2, 19, 21, 27], "\ucda9\ubd84\ud558\ub2e4\ub294": 19, "\uc54c\uac8c": 19, "\ub418\uc5c8\ub2e4": 19, "\ub300\uaddc\ubaa8": [9, 19], "\uac1c\ub150\uc744": 19, "\ub3c4\uc785\ud558\ub294": [19, 20], "\uc77c\uc740": 19, "\uc77c\uc774\ub2e4": 19, "\uac1c\ub150\uc5d0": 19, "\ud655\uc7a5\ub41c": 19, "retraining\ud558\ub294": 19, "\uc5c4\uccad\ub098\uac8c": 19, "\ube44\uc6a9\uc774": 19, "\ub4e4\uace0": 19, "\uc608\uc81c\uc5d0": 19, "\uce58\uba85\uc801\uc778": 19, "\ub9dd\uac01\uc744": 19, "\ucd08\ub798\ud55c\ub2e4": 19, "\uacf5\uac04\uc5d0\uc11c": 19, "\ub2e8\uc5b4\ub97c": 19, "\uadf9\ubcf5\ud560": 19, "figure\uc5d0\uc11c": 19, "\uc9c0\ub098\uba74\uc11c": 19, "508": 19, "701": 19, "set\uc73c\ub85c": [19, 21], "\ubcc0\ud658\ub418\uace0": 19, "\ud1a0\ud070\uc740": [19, 21], "\uc790\uccb4": 19, "\ubca1\ud130\ub294": 19, "\ub2e4\uc6b4\uc2a4\ud2b8\ub9bc": 19, "\uc81c\uacf5\ub428": 19, "concept\ub97c": 19, "\ub098\ud0c0\ub0b4\ub294": [19, 20], "word\uc778": 19, "\ub098\ud0c0\ub0b8\ub2e4": [2, 19], "vector\ub294": 19, "\ub2e8\uc5b4\uc640": 19, "\ucc98\ub9ac\ub418\uba70": 19, "query\ub97c": 19, "\uad6c\uc131\ud558\ub294\ub370": 19, "query\ub294": 19, "\uc758\ub3c4\ud55c\ubc14\uc640": 19, "\uc77c\uce58\ud558\ub3c4\ub85d": 19, "\uadf8\ub9bc\uc774\ub77c\uace0": 19, "\uc0dd\uc131\ubaa8\ub378": 19, "ldm\uc774": 19, "\uc4f0\uc784": 19, "untouched\ub418\uc5b4": 19, "\ub530\ub85c": [19, 20], "\ub4e4\uc5b4\uac00\uc9c0": 19, "\uc54a\ub294\ub4ef\ud568": 19, "\ud568\uc73c\ub85c\uc368": [2, 19, 26], "\uc190\uc2e4\ub418\ub294": 19, "text\uc5d0": [19, 21], "\uc774\ud574\ub3c4\ub098": 19, "generalization\uc744": 19, "\uc720\uc0ac\ub2e8\uc5b4": 19, "\ucc3e\uae30": 19, "inversion\uc2dc\ucf1c": 19, "\ud504\ub808\uc784\ud654": 19, "5\uac1c\uc758": [2, 19], "set\uc774": [19, 20], "\uc8fc\uc5b4\uc9c4\ub2e4": 19, "\ubb38\uc7a5\uc744": 19, "\uc124\uc815\ud574": 19, "\uc7ac\uad6c\uc131": 19, "\uc774\uc5b4\uc9c0\ub294": 19, "\ucc3e\ub294": [19, 20], "concept\uc778": 19, "\ud55c\ub2e4\uace0": 19, "found": 19, "palavra": 19, "\ubc14\uafb8\ub294\ub370": 19, "\ucd94\uc815": 19, "object\uc758": 19, "\ubcf5\uad6c": 19, "segmentation\uc744": 19, "palavra\ub294": 19, "\uac1c\uccb4\ub97c": 19, "\ucc38\uc870\ud558\ub294": 19, "clip\uc758": 19, "word\ub97c": 19, "\uc2dd\ubcc4\ud568": 19, "\uac80\uc0c9\uc744": 19, "\uc124\uba85\ud558\uac70\ub098": 19, "\uc7a5\uba74\uc5d0\uc11c": 19, "\ubd84\ud560\ud558\uae30": 19, "\uc0ac\uc6a9\ub428": 19, "\ubcf4\ub4ef\uc774": 19, "\uadf8\ub7f4\ub4ef\ud55c": 19, "\ud569\uc131\uc5d0": [19, 25], "\ucea1\ucc98\ud558\uc9c0": 19, "goal": 19, "specifi": 19, "\uc758\uc5ed": 19, "\uc758\ub3c4\ud55c": 19, "\ucd08\ucca8\uc744": 19, "\ub9de\ucd98": 19, "embedding\uc73c\ub85c": 19, "\uac00\uc774\ub4dc\ud574\uc11c": 19, "\uad1c\ucc2e\uc740": [2, 19], "\uc131\uacfc\ubb3c\uc744": 19, "representation\uc73c\ub85c": 19, "\uc778\ucf54\ub529\ud558\ub294\ub370": 19, "\ucd08\uc810\uc744": [19, 20, 28], "\ub9de\ucda4": 19, "model\uc5d0\uc11c\ub294": 19, "representation\uc5d0": 19, "\ud6c4\ubcf4\uad70\uc744": 19, "\ucc3e\ub294\ub2e4": 19, "\uadf8\ub7ec\ub098": 19, "depth": [19, 20, 22, 27], "visual": [19, 25, 27], "understanding\uc744": 19, "\ud544\uc694\ub85c": 19, "\uc54a\ub294\ub2e4": [19, 22], "\uc0dd\uc131\uc790\uac00": 19, "\uadf8\ub9b0\ub2e4": 19, "inversion\uc5d0\uc11c": 19, "\uc601\uac10\uc744": 19, "\uc81c\uc2dc": [2, 19, 22], "\ucd9c\ucc98": [19, 20], "hyoseok": 19, "tistori": [2, 19], "entri": 19, "vector\ub97c": 19, "vector\ub85c\ubd80\ud130": 19, "\uc774\uc758": 19, "\uc5ed\uacfc\uc815\uc73c\ub85c\uc368": 19, "gan\uc758": [19, 20], "inverting\uc2dc\ucf1c": 19, "\uc54c\uc544\uac00\ub294": 19, "\uc0dd\uc131\ubaa8\ub378\ub85c\uc11c": 19, "\ub9d0\ud588\ub4ef\uc774": 19, "\uac74\ub4e4\uc9c0": 19, "\uc785\ub825\ub41c": [19, 26], "\ubb38\uc790\uc5f4\uc758": 19, "\ud558\uc704": 19, "\ub2e8\uc5b4\ub294": 19, "\ud1b5\uacfc\ud558\uba70": 19, "\uc815\uc758\ub41c": 19, "dictionary\uc5d0\uc11c": 19, "token\uc73c\ub85c": [9, 19], "\ubcc0\ud658\ud568": 19, "\ucc3e\uc744": 19, "\uace0\uc720\ud55c": 19, "\ubca1\ud130\uc5d0": 19, "\uc5f0\uacb0\ub428": 19, "index\uc5d0": 19, "encoder\uc778": 19, "c_\u03b8\uc758": 19, "\uc77c\ubd80\ub85c": 19, "target\uc73c\ub85c": 19, "\uc0bc\uc558\uc74c": 19, "\ub098\ud0c0\ub0b4\uae30": 19, "\uc790\ub9ac\ud45c\uc2dc\uc790": 19, "\ubb38\uc790\uc5f4\uc778": 19, "\uc9c0\uc815\ud568": 19, "palavra\ub97c": 19, "\ucd94\uc815\ud568": 19, "process\uc5d0": 19, "\uac1c\uc785\ud574\uc11c": 19, "tokenize\ub41c": 19, "\ubb38\uc790\uc5f4\uacfc": 19, "\ub300\uccb4\ud558\uc5ec": 19, "\ubcf8\uc9c8\uc801\uc73c\ub85c": 19, "\uc5b4\ud718": 19, "\uc8fc\uc785\ud568": 19, "5\uc7a5": 19, "\ud3ec\uc988\uc640": 19, "\uc124\uc815\uc5d0": 19, "\uac78\uccd0": 19, "\ubb18\uc0ac\ud568": 19, "\ucd5c\uc18c\ud654\ud558\ub294": [19, 24, 27], "v\ub97c": 19, "\ucd5c\uc801\ud654\ud568": 19, "\uace0\uc815\ud558\uae30": 19, "\ud15c\ud50c\ub9bf\uc5d0\uc11c": 19, "\ud30c\uc0dd\ub41c": 19, "\uc911\ub9bd": 19, "\ucee8\ud14d\uc2a4\ud2b8": 19, "\uc5ec\uae30\uc5d0\ub294": 19, "rendit": [19, 23], "\ud615\uc2dd": 19, "\ud504\ub86c\ud504\ud2b8\uac00": 19, "\ud3ec\ud568\ub41c\ub2e4": 19, "\uc544\ub9c8": [19, 25], "\uc6d0\ubcf8\uacfc": 19, "\ube44\uad50\ud558\uae30": 19, "\ubaa9\uc801\uc774": 19, "\uc544\ub2d0\uae4c": 19, "\uc2f6\uc74c": 19, "\ubaa9\ud45c\uc2dd\uc740": 19, "\uac19\uc74c": [2, 19, 20], "loss\ud568\uc218\uc640": 19, "\uc720\uc0ac\ud568": 19, "c\u03b8\uc640": 19, "e\u03b8\ub294": 19, "\ubbf8\uc138\ud55c": 19, "\ud3ec\ucc29\ud560": 19, "\uc788\uc744\uac83\uc73c\ub85c": 19, "\uae30\ub300\ud568": 19, "\ud3ec\ucc29\ud558\ub294": 19, "\uc720\uc0ac\ud558\uba74\uc11c\ub3c4": 19, "guide\uc5d0": 19, "\ub9de\ucdb0\uc11c": 19, "\uc9c4\ud589\ud568": 19, "\uc8fc\uc81c\uc5d0": 19, "\uc815\ud655\ud558\uac8c": 19, "\ubcf4\uc874\ud558\uace0": 19, "\uc784\ubca0\ub529\uacfc": 19, "\ucea1\uc158\ub4e4\uc5d0": 19, "\ucd94\ub860\uc774": 19, "\uac00\ub2a5\ud588\uc74c": 19, "\ub370\uc774\ud130\uc14b\uc73c\ub85c\ub3c4": 19, "\ubcf4\uc874\ud558\uba74\uc11c": 19, "\ud45c\ud604\ud55c": 19, "\uc0ac\uc9c4\uc5d0\uc11c\uc640": 19, "\uc758\uc0ac": 19, "\ubc31\uc778": 19, "\ub0a8\uc131": 19, "\uc758\uc0ac\ub97c": 19, "\uadf8\ub824\ub0c8\uc74c": 19, "\ub9ce\uc558\uc74c\uc744": 19, "imageset\uc5d0\uc11c": 19, "\uc778\uc885\uc801": 19, "\ub2e4\uc591\uc131\uc5d0": 19, "\uc778\uc2dd\uc744": 19, "embedding\uc758": 19, "y\ucd95": 19, "\ubcf5\uc81c\ud558\ub294\uc9c0": 19, "\ubcc0\ud615\uc744": [9, 19], "\uc0dd\uc131\ud558\ubbc0\ub85c": 19, "\uac70\ub9ac\ub97c": 19, "\uace0\ub824\ud558\uc5ec": 19, "\uc720\uc0ac\uc131\uc744": 19, "\ucee8\uc149\uc5d0": 19, "64\uac1c\uc758": 19, "x\ucd95": 19, "\ub09c\uc774\ub3c4\uc640": 19, "\uc124\uc815\uc758": 19, "\uc77c\ub828\uc758": 19, "\ubcc4\ub85c": 19, "prompt\uc758": 19, "embedding\uc5d0\uc11c": 19, "similarity\ub97c": 19, "\uc2a4\ucf54\uc5b4\ub294": 19, "capability\uc640": 19, "\uc2e0\ub8b0\ub3c4\ub97c": 19, "\ubcf4\uc5ec\uc90c": [19, 20, 25], "\ud658\uacbd": 19, "\ub530\ub984": 19, "\uc0dd\ub7b5": 19, "evaluation1": 19, "baseline\uacfc": 19, "quality\ub294": 19, "set\uc5d0\uc11c": 19, "\uc784\uc758\uc758": [2, 9, 19], "\uc0d8\ud50c\ub9c1\ud558\ub294": 19, "\uc5c6\uc5c8\ub2e4": [19, 20], "\ub2ec\uc131\ud558\uace0": 19, "baseline\uc5d0\uc11c": 19, "editablity\uc744": 19, "space\uc758": 19, "\uc778\uc0c1\uc801\uc778": [19, 20, 27], "\uc720\uc5f0\uc131\uc744": 19, "\ub098\ud0c0\ub0b4\uace0": 19, "\ub2e8\uc77c": [9, 19, 20], "word\ub9cc": 19, "\uc815\ud655\ub3c4\ub85c": 19, "\ucea1\ucc98\ud558\ub294\ub370": 19, "distort": 19, "tradeoff": 19, "\uace1\uc120\uc758": 19, "outline\uc744": 19, "\uadf8\ub9ac\uba70": 19, "\uc218\uc815\ub420": 19, "target\uc758": 19, "\ucea1\ucc98\ud558\uc9c0\ub294": 19, "\ubc18\ub300\ub85c": 19, "\uba40\ub9ac": 19, "\ubc97\uc5b4\ub098\uba74": 19, "editability\uac00": 19, "\uac10\uc18c\ud558\ub294": 19, "reconstruction\uc774": 19, "rate\ub97c": [19, 20], "\ubcc0\uacbd\ud574": 19, "\uace1\uc120\uc744": 19, "\uc774\ub3d9\ud560": 19, "\uc788\uc73c\ubbc0\ub85c": 19, "\uc0ac\uc6a9\uc790\uc5d0\uac8c": 19, "tradeoff\uc5d0": 19, "\uc815\ub3c4\uc758": 19, "\uc81c\uc5b4\ub97c": 19, "\uc81c\uacf5\ud568": 19, "description\uc744": 19, "\ud3ec\ucc29\ud558\uc9c0": 19, "\ubabb\ud558\uba74\uc11c\ub3c4": 19, "\uac10\uc18c\ud568": 19, "\uc124\ubb38\uc9c0": 19, "\uc81c\uacf5\ubc1b\uc558\uace0": 19, "\uc774\ubbf8\uc9c0\uc640\uc758": [19, 23], "\uc720\uc0ac\uc131\uc5d0": 19, "\uc21c\uc704\ub97c": 19, "\ub9e4\uae40": 19, "context\ub97c": [9, 19], "\uc9c8\ubb38\ubcc4\ub85c": 19, "600\uac1c\uc529": 19, "200\uac1c\uc758": 19, "\uc751\ub2f5\uc744": 19, "\uc218\uc9d1": [19, 26], "\uc81c\uacf5\ud558\uc9c0\ub9cc": 19, "\uc758\ubbf8\ub860\uc801\uc778": 19, "\ubcf8\uc9c8\uc744": 19, "\ud30c\uc545\ud558\uac70\ub098": 19, "shape\ub97c": 19, "\ud55c\uacc4": [2, 19], "\ucd5c\uc801\ud654\uac00": 19, "\uc624\ub798": 19, "\uac78\ub9b0\ub2e4": 19, "2\uc2dc\uac04\uc774": 19, "\uc18c\uc694\ub428": 19, "\uc124\uc815\uacfc": 19, "\ud65c\uc6a9\ud558\ub294": 19, "\uac1c\uc778\ud654\ub418\uba70": 19, "generation\uc744": 19, "\uc18c\uac1c\ud568": 19, "word\ub85c": 19, "inverse\ud558\uc5ec": 19, "\uc791\ub3d9\ud568": 19, "word\ub294": 19, "\uc7a5\uba74\uc5d0": 19, "\uac04\ub2e8\ud558\uace0": 19, "\uc758\ubbf8\uc5d0\uc11c": 19, "\ud3b8\uc9d1\ud558\uae30": 19, "\uc27d\ub3c4\ub85d": 19, "interpace\ub97c": 19, "\uc0ac\uc6a9\ud558\uc9c0\ub9cc": 19, "\uc790\uc5f0": 19, "\uc5b8\uc5b4\uc758": 19, "\ud55c\uacc4\uc5d0": 19, "\uc811\uadfc\ud560": 19, "\ub2e8\uc11c\ub97c": 19, "\uacf5\uac1c\uc801\uc73c\ub85c": 19, "\uc0ac\uc6a9\uac00\ub2a5\ud55c": 19, "model\uc778": 19, "\uad6c\ud604\ub428": 19, "\uc544\ud0a4\ud14d\ucc98": 19, "\uc815\ubcf4\uc5d0": 19, "\uc758\uc874\ud558\uc9c0": 19, "\uc801\uc6a9\ud560": 19, "\uc0dd\uac01": 19, "\uac70\uae30\uc5d0\uc11c": 19, "preserav": 19, "\ud5a5\uc0c1\ub420": 19, "unpair": 20, "translat": [2, 20], "iccv": 20, "2017": 20, "1703": 20, "10593": 20, "tensorflow": 20, "tutori": 20, "\ub17c\ubb38\ub9ac\ubdf0": 20, "cyclegan\uc744": 20, "\uc0ac\ub78c\uc774": [20, 25, 26], "\ud55c\uad6d\uc778\uc774\ub77c\uace0": 20, "\ub72f\uc5b4\ubcf4\uae30": 20, "kwangsu": 20, "\ub3c4\uba54\uc778\uc744": 20, "\ub3c4\uba54\uc778\uc73c\ub85c": 20, "\ubcc0\ud658\uc2dc\ud0a4\ub294": 20, "vision\uc758": 20, "translation\uc740": 20, "input\uacfc": 20, "\uc9dd\uc774": 20, "\uc9c0\uc5b4\uc9c4": 20, "\uc5bb\ub294": [20, 25], "\uc5b4\ub835\uc2b5\ub2c8\ub2e4": 20, "\uc9dd\uc9c0\uc5b4\uc9c4": 20, "x\ub77c\ub294": 20, "domain\uc73c\ub85c\ubd80\ud130": 20, "\uc5bb\uc740": 20, "domain": [2, 20], "y\ub85c": 20, "\ubc14\uafb8\ub294": [20, 23], "\uc5f0\uad6c\ub294": 20, "\ubd84\ud3ec\uc640": 20, "y\ub85c\ubd80\ud130\uc758": 20, "\uad6c\ubd84\uc774": 20, "\ubd88\uac00\ub2a5\ud558\ub3c4\ub85d": 20, "y\ub85c\uc758": 20, "mapping\uc5d0": 20, "\uc81c\uc57d\uc744": 20, "\uac00\ud574\uc11c": 20, "\uac15\uc81c\ud558\uae30": 20, "\uc5ed\ubc29\ud5a5": 20, "\ub9e4\ud551\uc744": 20, "\uc9c4\ud589\ud558\uace0": 20, "\uc720\uc0ac\ud574\uc9c0\ub3c4\ub85d": 20, "\uac15\uc81c\ud558\ub294": 20, "\ub3c4\uc785\ud588\uc2b5\ub2c8\ub2e4": 20, "pair\uac00": 20, "\ubcf4\uc5ec\uc92c\ub2e4\uace0": 20, "image\ub85c": [9, 20], "\uadf8\ub9bc\uc73c\ub85c": 20, "\ubcc0\ud658\ud55c\ub2e4\uac70\ub098": 20, "\ub0ae\uc5d0": 20, "\ucc0d\uc740": 20, "\ubc24\uc5d0": 20, "\ud754\ud788": 20, "output\uc73c\ub85c": 20, "\ubc14\ud0d5\uc73c\ub85c": 20, "\uc774\ub8e8\uc5b4\uc838": [2, 9, 20, 28], "\uc788\uc5c8\ub294\ub370\uc694": 20, "\uc5b4\ub835\uace0": [2, 20], "\ube44\uc2fc": 20, "\uc77c\uc774": 20, "image\uac00": [9, 20], "\uc77c\ub300\uc77c\ub85c": 20, "\uc9dd\uc9c0\uc5b4\uc9c0\uc9c0": 20, "\ubaa8\uc74c\uc758": 20, "\ucea1\uccd0\ud558\uace0": 20, "\ubaa8\uc74c\uc73c\ub85c": 20, "\ubcc0\ud658\ud560": 20, "x\uc5d0": 20, "\uc138\ud2b8": 20, "y\uc5d0": 20, "\uc81c\uacf5\ub418\uace0": 20, "output\uacfc": [20, 22], "y\uac00": 20, "discriminator\uc5d0": 20, "\uad6c\ubcc4\ud560": 20, "\uc5c6\ub3c4\ub85d": 20, "y\ub97c": [20, 22], "\ud559\uc2b5\ud569\ub2c8\ub2e4": [20, 27], "\uc774\uac8c": 20, "\uac1c\ubcc4": 20, "\ubb34\uc870\uac74": 20, "\uc720\uc758\ubbf8\ud558\uac8c": 20, "\uc30d\uc744": 20, "\uc774\ub8ec\ub2e4\ub294": 20, "\ub73b\ud558\uc9c0\ub294": 20, "g\uac00": 20, "image\uc5d0\ub294": 20, "\ubb34\ud55c\ud55c": 20, "\uc218\uac00": [2, 20, 21], "\ub54c\ubb38": [2, 20], "collapse\uac00": 20, "\uc77c\uc5b4\ub098\uae30\ub3c4": 20, "dl": 20, "blogspot": 20, "08": [2, 20], "problem": 20, "image\ub4e0": 20, "\ub9e4\ud551\ud558\uba74\uc11c": 20, "\ucd5c\uc801\ud654\uc5d0": 20, "\uc2e4\ud328\ud558\ub294": 20, "\ud604\uc0c1": [2, 20, 23], "\ud604\uc0c1\uc740": 20, "\uc785\uc7a5\uc5d0\uc11c": 20, "discriminator\uac00": 20, "\uc0ac\uc9c4\uc774": [20, 22], "\uc9c4\uc9dc": 20, "y\uc778\uc9c0": 20, "\uac00\uc9dc\uc778": 20, "\uc778\uc9c0": 20, "\uad6c\ubcc4\ud558\ub294": 20, "\uc18d\uc774\uae30\ub9cc": 20, "\uc6b0\ub9ac\uc758": 20, "\ubaa9\uc801\uacfc": 20, "\uc0c1\uad00\uc774": 20, "\ub9cc\ub4e4\ub354\ub77c\ub3c4": 20, "\uc0dd\uae30\uc9c0": 20, "\uc54a\uc544\uc11c": 20, "\ubc1c\uc0dd\ud568": 20, "\uc774\uc288\ub85c": 20, "\ud544\uc694\ud574": 20, "\uc84c\uc2b5\ub2c8\ub2e4": 20, "task\ub294": 20, "\uc601\uc5b4": 20, "\ud504\ub791\uc2a4\uc5b4": 20, "\uc601\uc5b4\ub85c": 20, "\ubc88\uc5ed\ud588\uc744": 20, "\ub3c4\ub2ec\ud558\ub294": 20, "\uac83\ucc98\ub7fc": [2, 20], "\ub3cc\uc544\uac00\ub294": 20, "\uac19\uc544\uc57c": 20, "\ud55c\ub2e4\ub294": 20, "\uc758\ubbf8\uc758": 20, "cyclic": 20, "consistency\uc774\ub77c\ub294": 20, "\uc18d\uc131\uc744": 20, "\uc774\uc6a9\ud569\ub2c8\ub2e4": 20, "\ubaa9\uc801\uc2dd\uc744": 20, "\uc815\ub9ac\ud558\uba74": 20, "\uc815\ubc29\ud5a5": 20, "\ub17c\ubb38\uacfc": 20, "\uc5f0\uad6c\uc5d0": 20, "\ub0b4\uc6a9\uc774\uc5c8\uc74c": 20, "\uac1c\ub150\ub4e4\uc740": 20, "introduction\uc5d0\uc11c": 20, "\uc124\uba85\ud588\uace0": 20, "\uc2a4\ud130\ub514\uc640\ub294": 20, "\uad00\ub828\uc774": 20, "\uc2a4\ud0b5\ud588\uc74c": 20, "\ub3c4\uc2dd\ud654": 20, "\uc790\ub8cc": [2, 20], "mapping\ud558\ub294": 20, "function\uc744": [20, 21], "\uc6a9\uc5b4": 20, "\uc815\ub9ac": [2, 20], "pdata": 20, "\ud45c\uc2dc": 20, "dx": [20, 24], "dy\ub294": 20, "dx\ub294": 20, "\uad6c\ubd84": 20, "y\uc640": 20, "\ubaa9\uc801\uc2dd\uc740": 20, "\ub450\uac1c": 20, "domain\uc758": 20, "distribution\uacfc": 20, "\uc77c\uce58\uc2dc\ud0a4\uae30": 20, "g\uc640": 20, "f\uac00": 20, "\ubaa8\uc21c\ub418\ub294": 20, "\ubc29\uc9c0\ud558\uae30": 20, "dy\uc5d0": 20, "l_gan": 20, "gan\uc5d0\uc11c": 20, "\ub300\uc2e0\uc5d0": 20, "\uac08": [20, 22], "x\ub85c": 20, "\uc218\uc2dd\uc774": 20, "\ub098\uc624\uba70": 20, "dx\uc5d0": 20, "dx\ub97c": 20, "\ub123\uc740": 20, "\uc55e\uc11c": 20, "\ub9d0\ud588\ub4ef": 20, "\uc81c\ud55c\uc744": 20, "\ub450\uc5b4": [2, 20], "\uc218\uc2dd\uc73c\ub85c\uc11c": 20, "\uc608\ube44": 20, "\uc2e4\ud5d8\uc5d0\uc11c": [2, 9, 20], "l1": 20, "loss\ub85c": 20, "\ub300\uccb4\ud574\ubd24\ub294\ub370": 20, "\ud5a5\uc0c1\uc744": [20, 22], "\uad00\ucc30\ud560": 20, "\uc5c6\uc5c8\uc74c": 20, "\uc720\ub3c4\ub41c": 20, "loss\uc640\uc758": 20, "\uc0c1\ub300\uc801": 20, "\uc911\uc694\ub3c4\uc5d0": 20, "\uacb0\uc815\ub428": 20, "architecture\ub85c\uc11c": 20, "transfer\uc640": 20, "\ubcf4\uc5ec\uc900": [20, 27], "\ucc44\ud0dd\ud568": [20, 22], "3\uac1c\uc758": 20, "sever": 20, "block": [20, 22, 24, 27], "fraction": 20, "stride": 20, "feature\ub97c": 20, "rgb\ub85c": 20, "\ub9e4\ud551\ud558\ub294": 20, "\uc548\uc815\ud654\uc2dc\ud0a4\uae30": 20, "\ud14c\ud06c\ub2c9\uc744": 20, "function\uc5d0\uc11c": 20, "\ubcc0\uacbd": [2, 20], "50\uac1c\ub97c": 20, "\uc800\uc7a5\ud574": 20, "\ud55c\uaebc\ubc88\uc5d0": 20, "\uc9c4\ub3d9\uc744": 20, "sjinu": 20, "ysbsb": 20, "02": 20, "lsgan": 20, "generator\uc758": 20, "\uc5c5\ub370\uc774\ud2b8\ub97c": 20, "lsgan\uc744": 20, "\uc774\ud574\ub294": 20, "\ubabb\ud588\uace0": 20, "\uc774\ub7f0\uac8c": 20, "\uc788\uad6c\ub098": 20, "\uc815\ub3c4\ub85c\ub9cc": 20, "discriminator\ub294": 20, "\uc774\ubcf4\ub2e4": 20, "\uace0\ucc28\uc6d0\uc774\uc9c0\ub9cc": 20, "\uac04\ub7b5\ud788": 20, "2\ucc28\uc6d0\uc744": 20, "\ud45c\ubc29\ud558\uba74": 20, "\uacb0\uc815\uacbd\uacc4\ub97c": 20, "\ub098\ud0c0\ub0bc": [2, 20], "\ucabd\uc774": 20, "\uac00\uc9dc": [20, 24], "\uc601\uc5ed": 20, "\uc601\uc5ed\uc785\ub2c8\ub2e4": 20, "\uc544\ub798\uc5d0": 20, "\uac70\ub9ac\uac00": [2, 20], "\uba3c": 20, "\uc0ac\uc6a9\ud55c\ub2e4\uba74": 20, "\uc785\uc7a5\uc5d0\uc11c\ub294": 20, "discriminator\ub97c": 20, "\uc18d\uc774\uace0": 20, "vanish": [20, 24], "\uc77c\uc5b4\ub098\uae30": 20, "\uc18d\uc778\ub2e4\ub294": 20, "\uc774\uc720\ub9cc\uc73c\ub85c": 20, "\ud328\ub110\ud2f0\ub97c": 20, "\uc5c6\uac8c": 20, "ls": 20, "generator\ub294": 20, "\uc18d\uc774\ub294": 20, "\ub118\uc5b4\uc11c": 20, "\uac00\uc9c0\uac8c\ub054": 20, "\ud574\uc57c\ud569\ub2c8\ub2e4": 20, "\ub78c\ub2e4\ub97c": 20, "10\uc73c\ub85c": 20, "\uc544\ub2f4\uc744": 20, "\ub124\ud2b8\uc6cc\ud06c\ub294": 20, "\uc5d0\ud3ec\ud06c": 20, "\ub3d9\uc548\uc5d0\ub294": 20, "ln\uc744": 20, "\uc5d0\ud3ec\ud06c\ub9c8\ub2e4": 20, "\uc870\uae08\uc2dd": 20, "\uc218\ub834\ud558\uac8c": 20, "amt": 20, "\ucc38\uac00\uc790\ub4e4\uc740": 20, "\uc0ac\uc9c4\uc774\ubbf8\uc9c0": 20, "\uc9c0\ub3c4": [2, 20], "\uac00\uc9dc\uc774\ubbf8\uc9c0\uc5d0": 20, "\ub178\ucd9c\ub41c": 20, "\uc9c4\uc9dc\ub77c\uace0": 20, "\uc0dd\uac01\ub418\ub294": 20, "\uc120\ud0dd\ud558\uac8c": 20, "1\ubc88": 20, "study\uac00": 20, "\ud14c\uc2a4\ud2b8\uc5d0": 20, "\uc788\uc5b4": [9, 20], "\uae30\uc900\uc784\uc5d0\ub3c4": 20, "\uc2e4\ud5d8\uc774": 20, "\uc591\uc801\uc778": 20, "\uae30\uc900\uc744": 20, "\ucc3e\uc558\ub294\ub370": 20, "score\uc784": 20, "fcn\uc740": 20, "\uc0ac\uc9c4\uc5d0": 20, "\ub808\uc774\ube14": 20, "\ub9f5\uc744": 20, "\ub9f5\uc740": 20, "\ubd84\ud560": 20, "\uba54\ud2b8\ub9ad\uc744": 20, "label\uacfc": 20, "\ube44\uad50\ud560": 20, "\ub3c4\ub85c": 20, "\uc0c1\uc758": 20, "\uc790\ub3d9\ucc28": 20, "label\uc5d0\uc11c": 20, "fcn\uc774": 20, "\uac10\uc9c0\ud558\uba74": 20, "\uc131\uacf5\ud55c": 20, "\ub77c\ubca8\ub9c1": 20, "pixel\ub2f9": 20, "\uc815\ud655\ub3c4": 20, "\ub2f9": 20, "iou": 20, "intersect": 20, "union": 20, "cityscap": 20, "benchmark\uc758": 20, "cogan": 20, "simgan": 20, "pix2pix": [2, 20], "aginst": 20, "6\uc5d0\uc11c": 20, "baseline\uc5d0\uc11c\ub3c4": 20, "\uac15\ub825\ud55c": [9, 20], "\ubc18\uba74\uc5d0": [20, 24], "cyclegan\uc740": 20, "supervise\uc778": 20, "pix2pix\uc640": 20, "translation\uc744": 20, "realism": 20, "\uc9c0\ub3c4\uc5d0\uc11c": 20, "\ud56d\uacf5": 20, "\uc0ac\uc9c4\uc5d0\uc11c": 20, "\ubaa8\ub450\uc5d0\uc11c": 20, "4\uc758": 20, "\ucc38\uac00\uc790\ub97c": 20, "\uc18d\uc77c": 20, "baseline\uc740": 20, "\ub3c4\uc2dc": 20, "\ud48d\uacbd\uc5d0": 20, "\ud3c9\uac00\ud558\uace0": 20, "3\uc740": 20, "\ud3c9\uac00\ud568": [20, 25], "cyclegan\uc774": 20, "baseline\ub4e4\uc758": 20, "\ub2a5\uac00\ud55c\ub2e4": 20, "consistency\uc758": 20, "\ubcf4\uc5ec\uc8fc\ub294": [20, 21, 23, 27], "\uc5c6\uc560\uba74": 20, "cycle\uc744": 20, "\uc81c\uac70\ud558\ub294": 20, "\uc800\ud558\ub428": 20, "\uacb0\ub860\uc744": 20, "\ub0b4\ub9b4": 20, "\ubc29\ud5a5\uc5d0\uc11c\ub9cc": 20, "\uba54\uc18c\ub4dc\ub97c": 20, "cycle\ub9cc": 20, "\ub3cc\ub838\uc744": 20, "backward": [20, 23, 24], "\uc774\ub530\uae08\uc529": 20, "\ubcf4\uc774\uace0": [20, 21], "collapse\ub97c": 20, "\uc720\ubc1c\ud558\ub294": 20, "\ubc1c\uacac\ud568": 20, "\uc81c\uac70\ub41c": [9, 20], "\ub9e4\ud551\uc758": 20, "\ubc29\ud5a5\uc5d0": 20, "7\uc744": 20, "\uc787\uc5c8\uc74c": 20, "\uc7ac\uad6c\uc131\ub41c": 20, "\uc0ac\uc9c4\uacfc": 20, "\ub3c4\uba54\uc778\uc774": 20, "\uacbd\uc6b0\uc5d0\ub3c4": 20, "\ud14c\uc2a4\ud2b8": 20, "\ub9ce\uc558\uc74c": 20, "8\uc740": 20, "cmp": 20, "fa\u00e7ad": 20, "database\uc758": 20, "\uac74\ucd95": 20, "ut": 20, "zapoos50k": 20, "dataset\uc758": 20, "\uc2e0\ubc1c\uacfc": 20, "pix2pix\uc5d0": 20, "cyclegan\uc758": 20, "\ud488\uc9c8\uc740": 20, "\ub300\uc758": 20, "\uc9f1\uc774\ub2e4": 20, "\ub9ce\uc544": 20, "\uc0dd\ub7b5\ud558\uaca0\uc2b5\ub2c8\ub2e4": 20, "\u3160": 20, "data\uac00": 20, "data\uc5d0\uc11c": 20, "transslation\uc774": 20, "\ud55c\uac83\ubcf4\ub2e4": 20, "\ub9e4\ub825\uc801\uc774\ub2e4": 20, "application\uc740": 20, "\uc6f9\uc0ac\uc774\ud2b8\uc5d0": 20, "\uc2e0\uacbd": 20, "\uc804\ub2ec": 20, "\uc791\uc5c5\uacfc": 20, "\uc120\ud0dd\ud55c": 20, "\uc608\uc220": 20, "\uc791\ud488\uc758": 20, "\uc804\ub2ec\ud558\ub294": 20, "\uc791\ud488": 20, "\uceec\ub809\uc158\uc758": 20, "\ubaa8\ubc29\ud558\ub294": 20, "\ubcc4\uc774": 20, "\ube5b\ub098\ub294": 20, "\uadf8\ub9ac\ub294": 20, "\ubc18": 20, "\uace0\ud750": 20, "\ub530\ub77c\ud558\ub294": 20, "\ub290\ub08c\uc744": 20, "\ub530\ub77c\ud55c\ub2e4": 20, "turmukhambetov": 20, "\ubc94\uc8fc\uc758": 20, "\uac1d\uccb4\ub85c": 20, "\uc81c\uc548\ud558\ub294": [2, 20], "\uc2dc\uac01\uc801\uc73c\ub85c": [20, 21], "\ubc94\uc8fc": 20, "\ubcc0\ud615\uc5d0": 20, "\uc911\uc810\uc744": [9, 20, 21], "\ub461\ub2c8\ub2e4": [20, 28], "turn": 20, "hors": 20, "zebra": 20, "\uac04": 20, "\uc0c9": 20, "\uad6c\uc131\uc744": 20, "\ubcf4\uc874\ud558\uae30": 20, "\uc720\uc6a9\ud558\ub2e4\ub294": 20, "\ubc1c\uacac\ud560": 20, "taigman": 20, "49": 20, "\ucc44\ud0dd\ud558\uc5ec": [20, 22], "\uc81c\ub108\ub808\uc774\ud130\uac00": 20, "\ub3c4\uba54\uc778\uc758": 20, "\uc81c\uacf5\ubc1b\uc744": 20, "\uadfc\ucc98\uc5d0": 20, "\uc815\uaddc\ud654\ud569\ub2c8\ub2e4": 20, "lident": 20, "ey_pdata": 20, "lidentity\uac00": 20, "\uc5c6\uc73c\uba74": 20, "\uc0dd\uc131\uc790": 20, "\uad73\uc774": 20, "\uc54a\uc744": [9, 20], "\uc0c9\uc870\ub97c": 20, "\uc790\uc720\ub86d\uac8c": 20, "\ubcc0\uacbd\ud560": 20, "monet\uc758": 20, "flickr": 20, "\uc0dd\uc131\uc790\ub294": 20, "\uadf8\ub9b0": 20, "\uc77c\ubab0": 20, "\uc2dc\uac04\uc5d0": 20, "\ub9e4\ud551\ud569\ub2c8\ub2e4": 20, "\uc801\ub300\uc801": 20, "\uc0ac\uc774\ud074": 20, "\uc77c\uad00\uc131": 20, "\uc190\uc2e4": [20, 21], "\ub9e4\ud551\uc774": 20, "\ub3d9\ub4f1\ud558\uac8c": 20, "\uc720\ud6a8\ud560": 20, "\uc190\uc2e4\uc758": 20, "\ud6a8\uacfc\ub294": 20, "9\uc5d0\uc11c": 20, "\ubcf4\uc5ec\uc9d1\ub2c8\ub2e4": 20, "9\ub294": 20, "\ud3ec\ud568\ub418\uc5b4": 20, "set\uc740": 20, "set\uc73c\ub85c\ubd80\ud130": 20, "\uadf8\ub824\uc9c4": 20, "datqa\ub97c": 20, "\uadf8\ub9bc\uc5d0": 20, "\ud0c0\ub2f9\ud55c": 20, "\uc544\ub2c8\ub2e4": 20, "monet\uc774": 20, "\uc0c8": [20, 21], "\uadf8\ub9b4": 20, "generalization\uc740": 20, "press": 20, "\uc595\uc740": 20, "\uae4a\uc774\uc758": 20, "flickr\uc5d0\uc11c": 20, "\ub2e4\uc6b4\ub85c\ub4dc\ud55c": 20, "\uaf43": 20, "\ud6c8\ub828\ud569\ub2c8\ub2e4": 20, "\uc18c\uc2a4": 20, "\ub3c4\uba54\uc778\uc740": 20, "\uc2a4\ub9c8\ud2b8\ud3f0\uc73c\ub85c": 20, "\ucc0d\ud78c": 20, "\uad6c\uc131\ub418\uc5b4": [20, 27], "\uc870\ub9ac\uac1c\ub85c": 20, "\uae4a\uc740": 20, "dof": 20, "\ucd08\uc810": 20, "\uae4a\uc774": 20, "\ub300\uc0c1\uc740": 20, "\uc870\ub9ac\uac1c\uac00": 20, "dslr\ub85c": 20, "\ucd2c\uc601\ub41c": 20, "\ud3ec\ud568\ud569\ub2c8\ub2e4": 20, "\uc0ac\uc9c4\uc73c\ub85c\ubd80\ud130": 20, "\uc131\uacf5\uc801\uc73c\ub85c": 20, "shallow": 20, "field": [20, 26], "\ucd08\uc810\uc774": 20, "\ub9de\uc740": 20, "\ubc30\uacbd\uc774": 20, "\ud750\ub9bf\ud558\uac8c": 20, "\ud65c\uc6a9": [2, 20, 21, 22], "\uad6c\ubaa9\ud558\uace0\uc790": 20, "\uac15\uc870\ud558\uae30": 20, "domain\uc740": 20, "\uc2a4\ub9c8\ud2b8\ud3f0\uc758": 20, "target\uc740": 20, "discuss": 20, "\ud765\ubbf8\ub85c\uc6b4": [20, 23], "\uade0\uc77c\ud558\uac8c": 20, "\uc544\ub2c8\uc5c8\uc2b5\ub2c8\ub2e4": 20, "\ud574\uc11d": 20, "task\uc640": 20, "\ucd5c\uc18c\ud55c\uc758": 20, "\ubcc0\ud654\ub9cc": 20, "\ubcc0\ud654\uac00": [20, 25], "\uc548\ub418\ub294": 20, "\uc788\uc5c8\uace0": [2, 20], "\ud615\uccb4\uac00": 20, "\uc560\ub9e4\ud574\uc9c4": 20, "\uc774\ub7f0\uac78": 20, "geometri": 20, "\ud45c\ud604\uc744": 20, "\ubcf4\uc544": 20, "\ucf54": 20, "\uc785\uc5d0": 20, "\uad6c\ud604\ud558\ub294\ub370": 20, "\ub9d0": 20, "\uc5bc\ub8e9\ub9d0": 20, "\uc608\uc81c\uc758": 20, "\ub9d0\uc740": [2, 20], "\ud0c0\ub294": 20, "\ub9ce\uc558\ub294\ub370": 20, "\uc5bc\ub8e9\ub9d0\uc758": 20, "\uc5c6\ub2e4\ubcf4\ub2c8": 20, "\ubc30\uacbd\ub3c4": 20, "\uc5bc\ub8e9": 20, "\uadf8\ub9ac\uac70\ub098": 20, "\uc5bc\ub8e9\ub9d0\uc5d0\uc11c": 20, "\ub178\ub797\uac8c": 20, "\uce60\ud55c": 20, "\uc0dd\uae40": 20, "\ub54c\ub54c\ub85c": 20, "\ub098\ubb34\uc640": 20, "\uac74\ubb3c\uc758": 20, "label\uc744": 20, "\ubaa8\ud638\uc131\uc744": 20, "\ud574\uacb0\ud558\ub824\uba74": 20, "weak": 20, "supervision\uc774": 20, "\ub9c8\ubb34\ub9ac": 20, "\ud48d\ubd80\ud558\uac8c": 20, "\uc81c\uacf5\ub418\uba70": 20, "\ud65c\uc6a9\ud574\uc57c": 20, "setting\uc5d0\uc11c": 20, "\uac83\uc758": 20, "\ub298\ub9ac\ub294\ub370": 20, "\uae30\uc5ec\ud569\ub2c8\ub2e4": 20, "12092": 21, "unoffici": 21, "donggeun": [21, 22, 25], "sean": [21, 22, 25], "ko": [21, 22, 25], "june": 21, "22": 21, "\ubaa8\ub378\uc774\uba70": 21, "120\uc5b5\uac1c": 21, "\uc218\uc640": 21, "5\uc5b5": 21, "\ubaa8\ub378\ub9c1\uc744": 21, "\ud1b5\ud558\uc5ec": 21, "2021\ub144": 21, "diverse\ud55c": 21, "3\uc640": 21, "vae\ub97c": 21, "transformer\uc744": 21, "architecture\uc744": [21, 22], "\uad6c\ucd95": [2, 21, 27], "model\uba70": 21, "learning\uc744": [9, 21], "\ub0c4": [2, 21], "\uc218\ub294": 21, "shot\uc744": 21, "\ubd80\ubd84\ub9cc": [21, 22], "1750\uc5b5": 21, "\uac1c\uc218\uc758": 21, "2005": 21, "14165": 21, "jalammar": 21, "how": 21, "gpt3": 21, "encoder\uc5d0\uc11c": 21, "output\uc740": 21, "discret": [2, 21], "categor": 21, "\uac16\ub294\ub2e4\uace0": 21, "cnn": 21, "\uac70\uce5c": [9, 21, 26], "d\ucc28\uc6d0\uc758": 21, "\uc704\uce58\uc5d0": 21, "\uadf8\ub9ac\ub4dc\ub85c": 21, "\ub098\ub204\uace0": 21, "\ud835\udc52_1": 21, "\ud835\udc52_\ud835\udc58": 21, "code\ub85c": 21, "\ubcc0\ud658": [2, 21], "z_": [21, 27], "e_j": 21, "\ucc3e\uc544\uc11c": 21, "\ubd80\uc5ec\ud568": 21, "p2yeong": 21, "explain": 21, "issu": 21, "pixel\uc744": 21, "\uc9c1\uc811\uc801\uc73c\ub85c": 21, "token\uc744": [9, 21], "\uace0\ud654\uc9c8": [21, 22], "\uc774\ubbf8\uc9c0\uc77c\uc218\ub85d": 21, "\uba54\ubaa8\ub9ac\ub7c9\uc774": 21, "\ud544\uc694\ud574\uc11c": 21, "\ube44\ud6a8\uc728\uc801": 21, "short": 21, "depend": [21, 23], "model\ub4e4": 21, "likelihood": [21, 22, 24, 28], "dependency\ub97c": 21, "\uac83\uc774\uba70": 21, "detail\uc5d0": 21, "\uc9d1\uc911\ud558\uac8c": 21, "recognizable\ud574\uc11c": 21, "2\uac00\uc9c0": [21, 25], "\uadf9\ubcf5\ud558\uace0\uc790": 21, "textbf": 21, "rgb": 21, "rightarrow": 21, "\uc555\ucd95": 21, "192\uac1c\uc758": 21, "\uac12": [2, 21], "\uc911\uc5d0": 21, "\ubc30\uc815": 21, "size\ub97c": 21, "bpe": 21, "\ub4e4\uacfc": [21, 25], "\uc5f0\uc18d\uc801\uc73c\ub85c": 21, "\uc785\ub825\ud568": 21, "concaten": [21, 26], "token\uacfc": [9, 21], "\ub4e4\uc758": 21, "\uacb0\ud569": [2, 21], "\ubaa8\ub378\ub9c1\ud558\uc5ec": [21, 22], "\uc2dc\uac01\ud654": [21, 22], "jiho": 21, "ml": [21, 28], "weekli": 21, "nlp": 21, "40": 21, "\ud30c\uc774\ud504\ub77c\uc778": 21, "cqom0r2kmvi": 21, "1729": 21, "\ud835\udc5e": 21, "\u03c6": 21, "dvae": 21, "token\ub97c": 21, "\ud835\udc5d": 21, "\ud835\udf03": 21, "token\uc5d0\uc11c": 21, "decoder\uc5d0\uc11c": 21, "\u03c8": 21, "purpl": 21, "\ubaa8\ub378\ub9c1\ud55c": [2, 21], "text\uc640": 21, "token\ub4e4\uc758": 21, "\ud835\udc5e_\u03c6": 21, "\ud835\udc5d_\ud835\udf03": 21, "\ud559\uc2b5\ud568": 21, "elb": 21, "bound\ub97c": 21, "192": 21, "elb\ub97c": 21, "continuous\ub97c": 21, "\ubc14\uafd4\uc57c": 21, "\ud559\uc2b5\uc2dc\uc5d0\ub294": 21, "argmax\ub97c": 21, "\uc778\ub371\uc2a4\ub97c": 21, "\uc120\ud0dd\ud558\uc5ec": 21, "\uacc4\uc0b0\ud558\uba74": 21, "reparameter": 21, "gradient\ub97c": [9, 21, 22], "\uc5f0\uc0b0": 21, "argmax": 21, "gumbel": 21, "\ud574\uacb0": 21, "underset": 21, "g_i": 21, "e_i": 21, "relaxation\ub97c": 21, "q_": [2, 21, 28], "tau": [21, 27], "temperatur": 21, "relaxation\uc744": 21, "tight\ud558\uac8c": 21, "\uc7a1\uc544\uc90c": 21, "psi": 21, "120\uc5b5\uac1c\uc758": 21, "token\uc740": 21, "logit\uc5d0\uc11c": 21, "\uc18c\ubb38\uc790\ud654": 21, "384": 21, "vocabulary\ub97c": 21, "\ud55c\ubc88\uc5d0": 21, "causal": 21, "row": 21, "column": 21, "\ub300\ud558\uc5ec": [2, 21], "n\uac1c\ub294": 21, "n\uac1c": 21, "\uace8\ub77c\uc11c": 21, "\uace0\ub974\uae30": 21, "\ubc88\uc9f8\ub85c": 21, "\uc120\ud0dd\ud568": 21, "best\ub97c": 21, "\uace0\ub97c\ub54c": 21, "\uc99d\uac00\ud560\uc218\ub85d": 21, "prompt\ub791": 21, "\ub098\uc634": [21, 22], "\uc54c\uace0\ub9ac\uc998\uc744": 21, "score\uc774": 21, "\uc81c\uc77c": [21, 22, 27], "\ubf51\uc74c": 21, "\uc54c\ub9de\uc740": 21, "\uac1c\uc218\uc5d0": [21, 23], "df": 21, "five": 21, "vote": 21, "gan\ubcf4\ub2e4": [21, 22], "\uc555\ub3c4\uc801\uc778": [9, 21], "\ucc28\uc774\ub85c": 21, "\ud22c\ud45c": 21, "\ubc1b\uc558\uc74c": 21, "frechet": 21, "distanc": 21, "\ub0ae\uc744\uc218\ub85d": [21, 22], "\uc88b\uc73c\uba70": 21, "\ub192\uc744\uc218\ub85d": [21, 22], "\ub791": 21, "cub": 21, "coco\uc5d0\uc11c\ub294": 21, "\ubcf4\uc5ec\uc92c\uc74c": 21, "cub\uc5d0\uc11c\ub294": 21, "\ucc0d\uc9c0": 21, "\ubabb\ud558\uc600\uace0": 21, "score\uc5d0\uc11c\ub294": 21, "\uae30\ub85d\ud568": [2, 21], "cub\uc5d0": 21, "\uacc4\uc120\uc744": 21, "\uc0dd\uac01\ud568": 21, "\uacb0\uacfc\uac12": 21, "\ud655\uc7a5": 21, "parameter\uacfc": 21, "\ub6f0\uc5b4\ub098\uac8c": 21, "\ud574\uacb0\ud568": 21, "\ud6cc\ub96d\ud55c": [2, 21, 25], "\uc77c\ubc18\ud654": 21, "\ud3c9\uac00\uc5d0\uc11c": 21, "\uc900\uc218\ud55c": 21, "\uc2f6\uc740": 21, "\uac1d\uccb4\uac00": 21, "\ud3ec\ud568\ub418\uba74": 21, "\uacaa\uc74c": 21, "\uace0\uc2b4\ub3c4\uce58\uac00": 21, "2\ub9c8\ub9ac\uac70\ub098": 21, "\uac15\uc544\uc9c0\uc640": 21, "\uace0\uc2b4\ub3c4\uce58": 21, "\ub458\ub2e4": 21, "\ud06c\ub9ac\uc2a4\ub9c8\uc2a4": 21, "\uc2a4\uc6e8\ud130\ub97c": 21, "\uc785\uace0": 21, "\uc544\uc26c\uc6b4": 21, "\ub370\uc774\ud130\uc14b\uc774": [21, 26], "tuning\uc73c\ub85c": 21, "limitation\uc744": 21, "2105": 22, "05233": 22, "\ubaa8\ub378\ub4e4\uc758": 22, "\ub6f0\uc5b4\ub118\uc74c": 22, "\ubd80\ubd84\uc5d0\uc11c\ub3c4": 22, "\ubcf4\uc5ec\uc900\ub2e4\uace0": [22, 23], "\uc8fc\uc7a5\ud568": 22, "diversity\uc640": 22, "fidelity\uc758": 22, "trade": [9, 22], "off\uc5d0": 22, "model\ub4e4\uc774\uba70": 22, "\uc0dd\uc131\ud574\ub0b4\ub294\ub370\uc5d0": 22, "\uc131\uacf5": 22, "\ud588\uc74c": [2, 22], "deep\uc5d0": 22, "\ub0ae\uc73c\uba70": 22, "\uac1c\uc120\uc0ac\ud56d\uc774": 22, "\ud544\uc694\ud568": 22, "\ub450\uac00\uc9c0": 22, "model\ub4e4\uc758": 22, "\ub04c\uc5b4\uc62c\ub9ac\uba70": 22, "\ub0ae\ucd94\uaca0\ub2e4\uace0": 22, "\uc124\uba85\ub418\uc788\uc73c\ubbc0\ub85c": 22, "\ub17c\ubb38\ub4e4\uc758": 22, "\uadfc\uc0ac\uac12\uc774\ub77c\uace0": 22, "\uac00\uc815\ud558\uba70": 22, "\uacc4\uc0b0\ud55c\ub2e4": 22, "approx": [22, 24, 28], "\ub9cc\ub4e0\ub2e4": 22, "\uc608\uce21\ud55c\ub2e4": [9, 22], "\uacf5\ubd84\uc0b0": 22, "\ubd88\uac00\ub2a5\ud55c": 22, "\ub9e4\uac1c\ubcc0\uc218\ub85c": 22, "\uc124\uc815\ub418\uba70": 22, "\uac00\uc9c4\ub2e4": 22, "pipelin": [22, 25], "ddpm\uc5d0\uc120": 22, "\uc9c0\ud45c\uac00": 22, "\ub0ae\uc558\ub2e4": 22, "scheduling\uc744": 22, "\uc0ac\uc6a9\ud588\uc9c0\ub9cc": 22, "\uc8fc\uc7a5\ud588\ub2e4": 22, "\ud559\uc2b5\uc5d0\ub3c4": 22, "\ub04a\uace0": 22, "\ubc14\uafc8": 22, "iteration\uc73c\ub85c": 22, "\ucc44\ud0dd\ud588\uc9c0\ub9cc": 22, "parameter\uc744": 22, "\ubcc0\uacbd\ud558\uc5ec": 22, "\uc77c\uc815\ud558\uac8c": 22, "\uac00\uc838\uac00\uba74\uc11c": 22, "\uc99d\uac00": [2, 22], "\ubcf4\uae30": 22, "\uc2dc\ucf1c\ubcf4\uae30": 22, "head\uc5d0": 22, "8x8": 22, "16x16": 22, "\ud574\ubcf4\uae30": 22, "\uc77c\ubc18": 22, "block\uc774": 22, "biggan\uc758": 22, "block\uc744": [9, 22], "connection\uc744": 22, "chang": 22, "32\uc77c\ub54c": 22, "\ub0ae\ub2e4": 22, "160": 22, "resolution\uc744": [9, 22], "block\ub9c8\ub2e4": 22, "\uc904\uc774\uae30": 22, "\ud29c\ub2dd\uc744": 22, "adain\uc774\ub791": 22, "\uc5f0\uc0b0\ud558\ub294": 22, "adagn": 22, "\uc18c\uac1c\ud588\ub2e4": 22, "\ubc29\ubc95\ub860\uc778\uc9c0\ub294": 22, "\ubaa8\ub974\uaca0\ub2e4": 22, "normalization\uc744": 22, "adpative\ud558\uac8c": 22, "embedding\uacfc": 22, "adain": 22, "\uacf1\ud558\uace0": 22, "\ub354\ud568": 22, "y_b": 22, "where": [22, 27], "adagn\uc758": 22, "adagn\uacfc": 22, "additon": 22, "normalization\ubcf4\ub2e4": 22, "addit": 22, "layer\uc744": 22, "\uc0ac\uc6a9\ud588\ub294\ub370": 22, "\ub0ae\uac8c": 22, "\uc8fc": 22, "de": 22, "\uc90c\uc73c\ub85c\uc368": 22, "zp_": 22, "normalizing\uc744": 22, "\uc0c1\uc218": 22, "log_": 22, "\uace1\ub960\uc774": 22, "\ubb34\ud55c\uc73c\ub85c": 22, "rightarrow0": 22, "\ud14c\uc77c\ub7ec": 22, "\uae09\uc218\ub97c": 22, "\uc7ac\uc804\uac1c": 22, "classifier\uc758": [9, 22, 25], "\uc2dd": [2, 22], "\uc720\ub3c4\ub294": 22, "\ubcf8\ubb38\uc758": 22, "\ubc88\uc2dd\uc774\ubbc0\ub85c": 22, "\ubc29\ubc95\uc774\ub2e4": 22, "\ub611\uac19\uc774": 22, "sample\ud55c\ub2e4": 22, "ddim\uc5d0\uc11c": [22, 25], "gradient\uc758": 22, "\ube7c": 22, "score\uc744": 22, "\uad6c\ud55c\ub2e4": 22, "scaling\uc758": 22, "\uc601\ud5a5": [2, 22], "\uac12\uc5d0": 22, "classifier\uac00": 22, "scaling\uc774": 22, "\ub2e4\ub974\ub2e4": 22, "\uc8fc\uba74": 22, "\uc6f0\uc2dc\ucf54\uae30\ub77c\ub294": 22, "\uc6f0\uc2dc\ucf54\uae30\uc2a4\ub7ec\uc6b4": 22, "\uac15\uc544\uc9c0\uac00": 22, "\ub418\uc9c0\ub294": 22, "\uc6f0\uc2dc\ucf54\uae30": 22, "class\ub77c\ub294": 22, "\ubd84\uc704\uae30\uc758": 22, "\uac15\uc544\uc9c0\uc758": 22, "epsilon\uc774\ub77c\ub294": 22, "scale\uc5d0": 22, "\ubc1b\ub294\uc9c0": 22, "sampling\ud560": 22, "off": [9, 22], "scale\uc774": 22, "recall\uc740": 22, "\ub0ae\uc9c0\ub9cc": 22, "precision\uc740": 22, "\ub192\ub2e4": 22, "\uc0dd\uae30\ub294\ub370": 22, "recall\uc774": 22, "diveristy\uac00": 22, "\ub0ae\ub2e4\ub294": [22, 28], "\uc758\ubbf8\uc774\uace0": 22, "precision\uc774": 22, "\ub192\ub2e4\ub294": 22, "\ub73b\uc774\ub2e4": 22, "\ub192\uc77c\uc218\ub85d": 22, "label\ucabd\uc73c\ub85c": 22, "guide\uac00": 22, "\uc0dd\uae30\ubbc0\ub85c": 22, "\uc77c\uc815\ud55c": 22, "sfid\ub294": 22, "off\ub85c": 22, "\ub3c4\ucd9c\ub418\ub294": 22, "\uac12\uc774\ubbc0\ub85c": 22, "\ucd5c\uace0\uc758": 22, "\uc9c0\uc810\uc5d0\uc11c": 22, "\ub098\uc654\ub2e4": 22, "adm\uc740": 22, "\uc57d\uc790\uc774\uba70": 22, "adm": [9, 22], "g\ub294": 22, "guidance\uc758": 22, "\uc57d\uc790\uc774\ub2e4": 22, "\uc8fc\uc5c8\uc744": 22, "fid\uac12\uc774": [22, 25], "\ub098\uc654\uc73c\uba70": 22, "vice": 22, "versa": 22, "center": 22, "\ub450\ubc88\uca30": 22, "\ud50c\ub77c\ubc0d\uace0": 22, "\ubcfc\ub54c": 22, "biggan\uc740": 22, "\uc774\ubbf8\uc9c0\uac04\ub4e4\uc758": 22, "\ud50c\ub77c\ubc0d\uace0\uac00": 22, "\ub2e4\uc218": 22, "\ub290\ub08c\uc758": 22, "\ubf51\uc544\ub0b8\ub2e4": 22, "\ub2e4\ucc44\ub85c\uc6b4": 22, "\ud55c\ub9c8\ub9ac\ub9cc": 22, "\uc0ac\uc9c4\ub3c4": 22, "\ub290\ub9ac\ub2e4": 22, "distil": 22, "\ubc95\uc744": 22, "\uace0\ub824": [2, 22], "guidance\ub294": 22, "classif": [9, 22, 24], "function\uc758": 22, "label\uc774": 22, "data\uc5d0\ub294": 22, "\ud655\uc7a5\uc774": 22, "\ubd88\uac00\ub2a5\ud558\ub2e4": 22, "unlabel": 22, "sample\uc744": [9, 22], "cluster": 22, "\ubc29\ubc95\ub860\uc744": 22, "\ud558\ub824": 22, "driven": [9, 23], "12242": 23, "huggingfac": [23, 27], "\ucd5c\uadfc\uc5d0": [23, 24, 25], "\ub4f1\uc7a5\ud558\uc600\uc9c0\ub9cc": 23, "\ubd80\ubd84\uc5d0\uc11c": 23, "\uba74\ub4e4\uc744": 23, "\uac1c\uc120\ud558\uae30": 23, "\uae30\ubc95\uc73c\ub85c": [9, 23, 27, 28], "\uc18c\uac1c\ub418\uc5c8\uace0": 23, "5\uc7a5\uc758": 23, "\ub418\uba70": [23, 26], "nvidia": [23, 27], "5\ubd84": [2, 23], "\uc815\ub3c4\ubc16\uc5d0": 23, "\uc18c\uc694\ub418\uc9c0": 23, "\uc54a\ub294\ub2e4\uace0": 23, "\ubb34\uc5c7\uc778\uc9c0": [23, 28], "\uc54c\uc544\ubcf4\uae30": 23, "\uc815\ub9ac\ub97c": 23, "\ud574\ubcfc": [2, 23], "gamma": 23, "\uc785\ub825\ubc1b\uc544\uc11c": 23, "gen": 23, "\uc218\uc2dd\uc801\uc73c\ub85c": [23, 28], "\ud45c\ud604\ud558\uba74": [23, 28], "w_t": [2, 23], "alpha_tx": 23, "t5": 23, "xxl": 23, "\ud560\ub54c": 23, "\ub54c\ub85c\ub294": 23, "\ud3ec\ud568": 23, "\uace0\uc815\uc2dc\ud0a8\ub2e4\uace0": 23, "\uc55e\uc368": [23, 26, 27], "\uc124\uba85\ub4dc\ub838\ub358": 23, "\ub0b4\uc6a9\ub4e4\uc744": 23, "blob": 23, "main": 23, "text_encoder_cl": 23, "import_model_class_from_model_name_or_path": 23, "arg": [2, 23, 28], "noise_schedul": 23, "ddpmschedul": 23, "from_pretrain": 23, "subfold": 23, "text_encod": 23, "autoencoderkl": 23, "unet2dconditionmodel": 23, "epoch": [23, 24, 27], "first_epoch": 23, "num_train_epoch": 23, "train_dataload": 23, "skip": [23, 25], "until": 23, "reach": 23, "resum": 23, "resume_from_checkpoint": 23, "resume_step": 23, "progress_bar": [23, 27], "continu": [2, 23], "accumul": 23, "pixel_valu": 23, "weight_dtyp": 23, "latent_dist": 23, "config": 23, "scaling_factor": 23, "offset_nois": 23, "randn": 23, "bsz": 23, "randint": 23, "num_train_timestep": 23, "accord": 23, "magnitud": 23, "noisy_lat": 23, "add_nois": 23, "get": 23, "encoder_hidden_st": [23, 27], "input_id": 23, "model_pr": 23, "prediction_typ": 23, "v_predict": 23, "get_veloc": 23, "part": 23, "model_pred_prior": 23, "target_prior": 23, "mse_loss": 23, "float": 23, "prior_loss": 23, "sync_gradi": 23, "params_to_clip": 23, "itertool": 23, "clip_grad_norm_": 23, "max_grad_norm": 23, "zero_grad": [23, 24], "set_to_non": 23, "set_grads_to_non": 23, "noun": 23, "\uc720\uc9c0\ud558\uace0\uc790": 23, "\ub300\uc0c1\uc5d0": 23, "\ub2f4\ub294": 23, "rare": [23, 26], "3\uac1c": 23, "unicod": 23, "charact": 23, "\ub79c\ub364\ud558\uac8c": 23, "\uc0d8\ud50c\ub9c1\ud574\uc11c": 23, "\uc815\uc758\ud569\ub2c8\ub2e4": [23, 26, 27, 28], "drift": 23, "\uc785\ub825\ud558\uc5ec": 23, "\uacc4\uc0b0\ud569\ub2c8\ub2e4": 23, "\uacfc\uc815\uc73c\ub85c": [2, 23], "\ud559\uc2b5\ud558\uace0\uc790": 23, "\uc2dc\ud0a8": 23, "sigma_t": 23, "alpha_": [9, 23], "\ucd94\uac00\ud568\uc73c\ub85c\uc368": 23, "\uc720\uc9c0\ud558\uac8c": 23, "\uc774\ub85c\uc368": [23, 28], "encourag": 23, "\uac00\uc9c0\uc758": 23, "\uccab\ubc88\uc9f8\ub85c\ub294": 23, "dino": 23, "\uc0dd\uc131\ub418\uae30": 23, "\uc120\ud638\ub41c\ub2e4\uace0": 23, "\uc790\uc138\ud558\uac8c\ub294": [23, 27, 28], "\uacc4\uc0b0\ub429\ub2c8\ub2e4": 23, "pairwis": 23, "\ube44\uad50\ud588\uc744\ub54c": 23, "\uacb0\uacfc\ub3c4": [23, 27], "\uc801\uc6a9\ub428\uc73c\ub85c\uc368": 23, "\uc18c\uac1c\ub4dc\ub838\ub358": 23, "div": 23, "\ud574\uacb0\ub418\ub294": 23, "\uc785\ub825\ud588\uc744\ub54c\uac00": 23, "\uc124\uba85\ud569\ub2c8\ub2e4": 23, "randomli": 23, "can": 23, "backpack": 23, "recontextu": 23, "articul": 23, "art": 23, "famou": 23, "painter": 23, "statu": 23, "sculptor": 23, "\ucc44": 23, "\ud615\ud0dc\ub3c4": 23, "novel": 23, "\uac01\ub3c4\uc5d0\uc11c": 23, "\ubcf4\ub294": 23, "\uc0dd\uc131\ub3c4": [23, 25], "properti": [2, 23], "modif": 23, "dog": 23, "speci": 23, "\uace0\uc720": 23, "\ub4e4\uc774": 23, "\uc5d0\uc11c\ub3c4": [2, 23], "\ubc18\uc601\uc774": 23, "\ud55c\uacc4\uc810\ub3c4": 23, "\uc790\uc8fc": 23, "\ub098\ud0c0\ub098\uc9c0": 23, "appear": 23, "\ubcf4\uc778\ub2e4\uace0": [9, 23], "\ubcf8\ubb38\uc5d0": 23, "\uc18c\uac1c\ub418\uace0": 23, "\uc788\uc9c0\ub294": 23, "\uc54a\uc9c0\ub9cc": 23, "\ubd80\ubb38\uc5d0\uc11c\ub3c4": 23, "\ud559\uc2b5\uacb0\uacfc\ub97c": 23, "\ubcf4\uc5ec\uc8fc\ub294\ub370": 23, "\uc7a5\ub9cc\uc73c\ub85c\ub3c4": 23, "\ub9cc\ud654": 23, "\uc0ac\ub840\ub4e4\uc744": 23, "nip": 24, "2014": [24, 28], "1406": 24, "2661": 24, "eriklindernoren": 24, "smart": [24, 28], "lab": [24, 27, 28, 29], "kaist": [24, 28], "\ub525\ub7ec\ub2dd": [24, 28], "chp": 24, "ian": 24, "goodfellow": 24, "2014\ub144\uc5d0": 24, "\ubc1c\ud45c\ud55c": 24, "\uc18c\uac1c\ub418\uae30": 24, "\uc804\uae4c\uc9c0": 24, "\ub144": 24, "\uc0dd\uc131\ubd84\uc57c\uc5d0\uc11c": 24, "\ub300\ud45c\uc801\uc778": 24, "\uc790\ub9ac\uc7a1\uc558\uc5c8\uc2b5\ub2c8\ub2e4": 24, "margin": [2, 24, 28], "\uad6c\ud558\uac8c": 24, "taxonomi": 24, "\uc7a0\uc7ac\ubcc0\uc218": [24, 28], "\uadf8\ub85c\ubd80\ud130": 24, "\uad6c\ubd84\ud558\ub294": 24, "\uad6c\uc131\uc774": 24, "\ub9d0\ud574\uc11c": 24, "\ub4e4\uc5b4\uc624\uba74": 24, "\uac00\uc9dc\ub85c": 24, "\ucd9c\ub825\ud558\ub294": 24, "binari": 24, "\uc9c4\ud589\ud569\ub2c8\ub2e4": 24, "\ucf54\ub4dc\ub3c4": 24, "in_feat": 24, "out_feat": 24, "batchnorm1d": 24, "leakyrelu": 24, "inplac": 24, "opt": 24, "latent_dim": 24, "np": 24, "prod": 24, "img_shap": 24, "tanh": 24, "sigmoid": [24, 28], "img_flat": 24, "d\ub97c": 24, "g\ub97c": 24, "\uc190\uc2e4\ud568\uc218": [24, 28], "min_g": 24, "max_d": 24, "p_z": 24, "\uc54c\uace0\ub9ac\uc998\uacfc": 24, "\ube44\uad50\ud574\ubcf4\uaca0\uc2b5\ub2c8\ub2e4": 24, "n_epoch": 24, "variabl": [2, 24, 28], "fill_": 24, "fake": 24, "real_img": 24, "optimizer_g": 24, "gen_img": 24, "measur": 24, "abil": [24, 27], "fool": 24, "g_loss": 24, "adversarial_loss": 24, "optimizer_d": 24, "real_loss": 24, "fake_loss": 24, "d_loss": 24, "print": 24, "item": 24, "batches_don": 24, "sample_interv": 24, "save_imag": 24, "png": 24, "nrow": 24, "\ucd5c\ub300\ud654\ud558\uace0": 24, "descent": 24, "\uc9c4\ud589\ud558\uac8c": 24, "\ud559\uc2b5\ud558\uc9c0": 24, "\uc0c1\ud669\uc774": 24, "\ubc1c\uc0dd\ud569\ub2c8\ub2e4": [24, 26], "\ucd5c\uc18c\ud654\ud558\uc9c0": 24, "\ucd5c\ub300\ud654\ud558\ub294": 24, "\uae30\ubc95\ub3c4": 24, "\ucd5c\uc801\ud654\ub41c": 24, "solut": 24, "\uc644\ubcbd\ud788": 24, "\ubcf5\uc6d0\ud558\uace0": 24, "\uc5b8\uc81c\ub098": 24, "\ub0b4\ubc49\uac8c": 24, "proposit": 24, "p_g": 24, "\uc99d\uba85\ud558\uc790\uba74": 24, "\uc190\uc2e4\ud568\uc218\ub97c": [24, 27], "int_x": 24, "int_z": 24, "dz": [24, 28], "\uc77c\ub54c": 24, "\uc131\ub9bd\ud558\uace0": 24, "\uc190\uc2e4\ud568\uc218\ub294": [24, 27], "\uac19\uace0": 24, "ast": 24, "jsd": 24, "\ucd5c\uc19f\uac12\uc740": 24, "\uc131\ub9bd\ud569\ub2c8\ub2e4": 24, "mnist": 24, "toronto": 24, "databas": 24, "tfd": 24, "\ud3c9\uac00\ud588\uc2b5\ub2c8\ub2e4": 24, "\ud3c9\uac00\uc2dc\uc5d0\ub294": 24, "parzen": 24, "densiti": 24, "\uac70\uccd0": 24, "vae\ub294": 24, "\ud750\ub9bf\ud558\ub2e4\ub294": 24, "unstabl": 24, "converg": [24, 25], "\ucc28\uc6d0\ucd95\uc18c\ub85c": 24, "\ud65c\uc6a9\ub418\uace0": 24, "\uc0dd\uc131\ud558\ub294\ub370\ub294": [9, 24], "\ud65c\uc6a9\ub418\uc5c8\ub2e4\uace0": 24, "2205": [2, 25], "11487": 25, "learning\uc774": 25, "\ub354\ubd88\uc5b4": 25, "\ub4e4\uc744": [25, 28], "\ub3c5\ucc3d\uc801\uc778": 25, "\ub9d0\ubb49\uce58": 25, "corpu": 25, "llm\ub4e4\uc758": 25, "embedding\ub4e4\uc740": 25, "\ud6a8\uacfc\uc801\uc774\ub77c\uace0": 25, "\ucda9\uc2e4\ub3c4": 25, "\uc0ac\uc774\uc988\ub97c": 25, "\uc911\uc694\ud558\ub2e4\ub294": 25, "\uc99d\uba85\ud568": [2, 25], "\uc81c\uc2dc\ud558\uc5ec": 25, "weight\uc744": 25, "leverag": 25, "\ub9cc\ub4e4\uc5b4": 25, "\ud604\uc2e4\uc801\uc778": 25, "palett": [25, 26], "\uad6c\uc870\ubcf4\ub2e4": 25, "\uc81c\uc2dc\ud568": 25, "27": 25, "\ub2ec\uc131\ud568": 25, "evaluation\uc6a9": 25, "benchmark": [25, 26], "encoder\uc744": 25, "\ud574\ub193\uc74c": 25, "improv": [9, 25], "sr": 25, "\uc774\ub780": 25, "\ud6a8\uacfc\ub97c": [25, 26], "guidance\uac00": [9, 25], "generation\uc774": 25, "\uc77c\uc815\ud558\uc9c0": 25, "\ubabb\ubc1b\uc544\uc11c": 25, "class\ub098": 25, "object\uc774": 25, "\uc77c\uc815\ud558\uace0": 25, "\ubb34\uc5c7\uc744": 25, "\uc0dd\uc131\ud558\ub294\uac83\uc778\uc9c0": 25, "\uc790\uc138\ud558\uac8c": 25, "guide\uc758": 25, "\ub192\uc774\uba74": 25, "\ubd88\uc77c\uce58\uac00": 25, "\uac00\uc911\uce58\uc758": 25, "\ubc94\uc704": [25, 26], "\uc774\ub3d9\uc2dc\ucf1c": 25, "\uc544\uc608": 25, "\ube57\ub098\uac00": 25, "\uc774\uc0c1\ud55c": 25, "satur": 25, "\ub35c\ud55c": 25, "\ub40c": 25, "\ud574\uacb0\ud558\uace0\uc790": 25, "\ubc31\ubd84\uc704\uc218": 25, "\uc808\ub300": 25, "\uc9c0\uc815\ud558\uace0": 25, "s\ub85c": 25, "\ub098\ub208\ub2e4": 25, "90": 25, "\uc9c0\uc810\uc758": 25, "among": 25, "net\uc774\ub77c\ub294": 25, "net\uc5d0\uc11c": 25, "\uc5ec\ub7ec\uac00\uc9c0": 25, "modification\uc744": 25, "\ud558\uc600\ub2e4\uace0": 25, "effu": 25, "net\uc740": 25, "\uc758\ub8cc\ucabd\uc73c\ub85c": 25, "\uc788\ub294\uac78\ub85c": 25, "\uc544\ub294\ub370": 25, "remov": 25, "keep": 25, "connect": 25, "scaling\uc744": 25, "\ud558\uc5ec": [2, 25], "block\uc5d0\uc11c": 25, "blocks\ub97c": 25, "\ucd94\uac00\ud568": 25, "\ubca4\uce58\ub9c8\ud06c": 25, "\ub370\uc774\ud130\uc14b\uc740": [9, 25], "categori": 25, "\uc774\ub8e8\uc5b4\uc84c\ub2e4": 25, "\uae43\ud5c8\ube0c\uc5d0\uc11c": 25, "\ub2e4\uc6b4": 25, "\ubc1b\uc744": 25, "\uac17\ub2e4": 25, "25\uba85\uc758": 25, "\ud3c9\uac00\uc790": 25, "a\uc5d0\uc11c": 25, "\ud3c9\uac00\uc790\ub294": 25, "\uc9c8\ubb38\uc744": 25, "\uae30\uc900\uc810\uc73c\ub85c": 25, "q1": 25, "higher": 25, "q2": 25, "repres": 25, "\uae30\uc900\uc810": 25, "\ub2f5\ubcc0": 25, "\uc120\ud0dd\ud574\uc57c\ud568": 25, "am": 25, "indiffer": 25, "screenshot": 25, "drawbench\uc5d0\uc11c": 25, "\uccb4\ub9ac\ud53c\ud0b9": 25, "\uc5c6\uc774\ub3c4": 25, "\uce74\ud14c\uace0\ub9ac\uc5d0\uc11c\ub3c4": 25, "\uc8fc\uc7a5\uc778": 25, "\ubaa8\ub378\ub4e4": [2, 25], "peopl": 25, "\uc62c\ub77c\uac10": 25, "people\uc744": 25, "\uc0dd\uc131\ud558\uae30\uc5d0": 25, "rater": 25, "xxl\ub85c": 25, "\uc120\ud638\ud568": 25, "\ubc1b\uc74c": [2, 25], "evaul": 25, "\uc911\uc694\ud568": 25, "\uc0ac\uc774\uc988\uc758": 25, "\ub07c\uce68": 25, "boost\uc5d0": 25, "thresholding\uc744": 25, "\ub04c\uc5b4": 25, "\uc62c\ub9b4": 25, "allow": 25, "usag": 25, "much": 25, "editbench": 26, "advanc": 26, "inpaint": [2, 26, 27], "06909": 26, "06": 26, "\uc2dc\uac04\uc5d0\ub294": [26, 27], "googl": 26, "\uc18c\uac1c\ud558\ub294": [26, 27, 28], "impaint": [9, 26], "\ud3c9\uac00\uae30\ubc95": 26, "\uc608\uc815\uc785\ub2c8\ub2e4": [26, 27], "\uae30\uc874\uc5d0\ub294": 26, "\uc601\uc5ed\uc744": 26, "\uc9c0\uc815\ud558\uc5ec": 26, "\ucc38\uc870\ud558\uc9c0": 26, "\uc624\ub85c\uc9c0": 26, "\ub9cc\uc73c\ub85c": 26, "\ucc38\uc870\ud560": [9, 26], "\uc720\ub3c4\ud558\ub294": 26, "ssd": 26, "mobilenet": 26, "v2": 26, "detector": 26, "\uac1c\uc120\ub418\ub294": 26, "\ud2b9\uc9d5\uc740": 26, "cascad": 26, "\uc810\uc785\ub2c8\ub2e4": 26, "sr3": 26, "\ud558\uba74\uc11c": 26, "\uac00\uc9c4\ub2e4\uace0": 26, "\uc791\uc5c5": 26, "\uc785\ub825\ud569\ub2c8\ub2e4": [26, 27], "\ub0b4\uae30": 26, "\ucd94\uac00\ub418\ub294": 26, "\ucd08\uae30\ud654\ud574\uc11c": 26, "\uc18c\uac1c\ub418\uc5c8\ub358": 26, "1\ubd80\ud130": 26, "\ubcc0\ud654\uc2dc\ud0a4\ub294": 26, "oscil": 26, "\uc801\uc6a9\ud568\uc73c\ub85c\uc368": 26, "\ud004\ub9ac\ud2f0": 26, "\uc0c1\uc2b9\ub418\ub294": 26, "240\uac1c\uc758": 26, "\uc30d\uc73c\ub85c": [9, 26], "\uad6c\ucd95\ub418\uc5b4\uc788\uace0": 26, "\uc30d\ub9c8\ub2e4": 26, "3\uac00\uc9c0\uc758": 26, "\uce21\uc815\ud558\uac8c": 26, "\uc73c\ub85c\ub294": [26, 27], "clipscor": 26, "prec": 26, "\uc808\ubc18\uc740": 26, "open": 26, "\ub370\uc774\ud130\uc14b\uc73c\ub85c\ubd80\ud130": 26, "\uc218\uc9d1\ub418\uc5c8\uace0": 26, "\uc0dd\uc131\ud574\uc11c": 26, "\uad6c\ucd95\ud588\uc2b5\ub2c8\ub2e4": 26, "\uc694\uc18c\ub4e4\uc744": 26, "\uac16\ucd94\ub3c4\ub85d": 26, "\uc0dd\uc131\ud588\uc2b5\ub2c8\ub2e4": 26, "materi": 26, "common": 26, "render": 26, "indoor": 26, "outdoor": 26, "\ub4e4\uc5b4\uc11c": 26, "metal": 26, "\ubb38\uad6c\ub97c": 26, "stand": 26, "farm": 26, "\ud574\ub2f9\uc0ac\uc9c4\ucc98\ub7fc": 26, "rich": 26, "\uad6c\ucd95\uc2dc": 26, "\ud06c\uae30\ub3c4": 26, "\ub2e4\uc591\ud558\uac8c": 26, "\uc124\uc815\ud558\uc5ec": [9, 26], "\ud06c\uae30\uc5d0": [9, 26], "\uce21\uc815\ud574\ubcf8": 26, "medium": 26, "\uc131\ub2a5\uc801\uc73c\ub85c": 26, "\uc800\ud558\ub418\ub294": [26, 27], "\uc18d\uc131\ubcf4\ub2e4": 26, "\uc18d\uc131\uc5d0": 26, "\ucde8\uc57d\ud55c": 26, "failur": 26, "\uc0ac\uc9c4\uc785\ub2c8\ub2e4": [26, 27], "maskrich": 26, "dig": 27, "more": 27, "08453": 27, "tencent": 27, "arc": 27, "\ube44\ub86f\ud55c": 27, "\ub09c\ud574\ud55c": 27, "car": 27, "fly": 27, "wing": 27, "iron": 27, "man": 27, "bunni": 27, "ear": 27, "\uc785\ub825\ubc1b\uc744": 27, "textur": 27, "\ud45c\ud604\ud558\uae30": 27, "\ub9cc\uc73c\ub85c\ub294": 27, "\ud544\uc694\ud558\ub2e4\uace0": 27, "\uc11c\uc220\ud569\ub2c8\ub2e4": 27, "intern": 27, "knowledg": 27, "extern": 27, "\uc18c\uac1c\ud558\uace0": 27, "5\uac00\uc9c0": 27, "plug": 27, "plai": 27, "77m": 27, "300m": 27, "\uc5f0\uc0b0\uc791\uc5c5\uc774": 27, "\uc2e4\ud589\ub429\ub2c8\ub2e4": 27, "\uac00\uc838\uc624\uae30": 27, "\uc6a9\ub7c9\uc774": 27, "\ud06c\uace0": 27, "flexibl": 27, "compos": 27, "generaliz": 27, "\uae30\ubc18\uc774": 27, "autoencod": [27, 28], "\ubc14\uafb8\uace0": 27, "\ubcf5\uc6d0\ud558\ub294": 27, "_2": 27, "bar": 27, "_t": [2, 27], "z_0": 27, "\uc785\ub825\ud568\uc73c\ub85c\uc368": 27, "matric": 27, "\uac00\uc9c0\uba70": [9, 27], "unshuffl": 27, "\ubcc0\ud658\uc774": 27, "1\uac1c\uc758": 27, "4\ubc88": 27, "\ud1b5\uacfc\ud558\uac8c": 27, "\uac70\uce58\uace0": 27, "f_c": 27, "\uc0dd\uc131\ub418\uace0": 27, "\uc5d0\uc11c\uc758": [2, 27], "intermedi": 27, "f_": 27, "enc": 27, "\ub354\ud574\uc9c0\uac8c": 27, "\ub3d9\uc77c\ud558\ub3c4\ub85d": 27, "\uc124\uc815\ud588\uae30": 27, "\ub367\uc148": 27, "\uc5f0\uc0b0\ud558\ub294\ub370": 27, "fulladapt": 27, "in_channel": 27, "320": 27, "640": 27, "1280": 27, "num_res_block": 27, "downscale_factor": 27, "pixelunshuffl": 27, "conv_in": 27, "kernel_s": 27, "bodi": 27, "adapterblock": 27, "total_downscale_factor": 27, "out_channel": 27, "downsample2d": 27, "in_conv": 27, "adapterresnetblock": 27, "act": 27, "relu": [27, 28], "adapter_st": 27, "adapter_input": 27, "adapter_conditioning_scal": 27, "num_images_per_prompt": 27, "repeat": 27, "do_classifier_free_guid": 27, "num_warmup_step": 27, "order": 27, "total": [2, 27], "latent_model_input": 27, "scale_model_input": 27, "noise_pr": 27, "prompt_emb": 27, "cross_attention_kwarg": 27, "down_block_additional_residu": 27, "state": [2, 27], "noise_pred_uncond": 27, "noise_pred_text": 27, "previou": 27, "extra_step_kwarg": 27, "prev_sampl": 27, "\uc885\ub958\ub85c\ub294": 27, "\ubd84\ub958\ud560": 27, "sketch": [2, 27], "segment": 27, "keypos": 27, "bicub": 27, "\uc81c\uc678\uc2dc\ud0a4\uace0": 27, "nearest": 27, "\ud06c\uae30\ub85c": 27, "\ubd80\ubd84\ucc98\ub7fc": 27, "\uc815\uc758\ud558\uac8c": [27, 28], "\uace0\uc815\uc2dc\ud0a8": [9, 27], "\ud30c\ub77c\ubbf8\ud130\ub9cc": 27, "t2": 27, "\uc2dc\uc640": 27, "dure": 27, "\ub123\uc73c\uba74\uc11c": 27, "\ub9c8\ub2e4": [2, 27], "expens": 27, "late": 27, "\uc2e4\ud5d8\ud574\ubcf8": 27, "\ud06c\ub2e4\uace0": 27, "earli": 27, "\ud3ec\ud568\ub418\ub3c4\ub85d": 27, "\uc218\uc2dd\ucc98\ub7fc": 27, "uniformli": 27, "\uc9c4\ud589\ud588\uace0": 27, "cubic": 27, "\uc0c1\uc138\uc0ac\ud56d\uc740": 27, "4x": 27, "tesla": 27, "32g": 27, "v100": 27, "dai": 27, "\uc2e4\ud5d8\ubcc4": 27, "coco17": 27, "164k": 27, "pidinet": 27, "stuff": 27, "keypoint": 27, "aesthet": 27, "\ub370\uc774\ud130\uc14b\ub85c\ubd80\ud130": 27, "600k": 27, "\ucd94\ucd9c": 27, "mm": 27, "mida": 27, "\ubaa8\ub378\ub4e4\uacfc": 27, "\uc815\ub7c9\uc801\uc778": 27, "\uc218\uce58\ub85c": 27, "\ube44\uad50\ud558\ub294\ub370": 27, "\uc0ac\uc6a9\ud558\uc600\uace0": [9, 27], "\ud558\ub2e8": 27, "\uc0ac\uc9c4\ucc98\ub7fc": 27, "\uc88b\uc2b5\ub2c8\ub2e4": 27, "quantit": [2, 9, 27], "comparisoin": 27, "\uc608\uc2dc\ub4e4\uc740": 27, "\uc815\ud655\ud558\uc9c0": 27, "\uc9c0\uc5ed\uc744": 27, "\ubabb\ud558\ub2e4\uace0": 27, "\uac83\ub85c": 27, "\uc704\uc5d0\uc11c\ubd80\ud130": 27, "\uc7a5\uc810\ub4e4": 27, "\uba85\uc2dc\ub418\uc5c8\ub358": 27, "\uc0ac\ub840\uc785\ub2c8\ub2e4": 27, "\uc644\ub8cc\ud55c": 27, "\uc801\uc6a9\ud558\uba74\uc11c": 27, "4\ubcf4\ub2e4": 27, "\uc791\uc744": [2, 27], "\uc801\uc6a9\ud588\uc2b5\ub2c8\ub2e4": 27, "\uacbd\ub7c9\ud654\ub41c": 27, "\uc608\uc2dc\ucc98\ub7fc": 27, "\uc22b\uc790\ub97c": 27, "\ubc14\uafd4\uac00\uba70": 27, "tini": 27, "x4": 27, "x8": 27, "compress": 27, "auto": 28, "bay": 28, "1312": 28, "6114": 28, "gunhochoi": 28, "fastcampu": 28, "ch": 28, "\ubb38\uad6c\uac00": 28, "\uc801\ud600\uc788\ub294\ub370\uc694": 28, "bayesian": 28, "vb": 28, "approach": [28, 29], "involv": 28, "\uc81c\uc2dc\ud558\ub294": 28, "aevb": 28, "\uc54c\uace0\ub9ac\uc998": 28, "\ub274\ub7f4": 28, "\ub124\ud2b8\uc6cc\ud06c\ub85c": 28, "\uadfc\uc0ac\ud568\uc73c\ub85c\uc368": 28, "\uc774\uac00": 28, "\ubc14\uac00": 28, "\ubd80\ubd84\uc73c\ub85c": 28, "\ub9cc\ub4e4\uc5b4\ub0b4\uace0": 28, "\ubcf5\uc6d0\ud558\uac8c": 28, "assumpt": 28, "\ub0b4\ub9bd\ub2c8\ub2e4": 28, "\uccab\ubc88\uc9f8\ub85c": 28, "parametr": 28, "\ud558\ub2e4\ub294": 28, "\ub530\ub974\uace0": 28, "\uc131\uc9c8\uc5d0": 28, "bernoulli": 28, "\ub530\ub974\ub3c4\ub85d": 28, "\uacc4\uc0b0\uc774": 28, "\ucd5c\ub300\ud654\uc2dc\ud0a4\ub294": 28, "\uad6c\ud558\ub294": [9, 28], "\uacc4\uc0b0\ud558\uae30": 28, "\ub4f1\uc7a5\ud558\uac8c": 28, "\uadfc\uc0ac\ud654\ud558\ub294": 28, "\ub124\ud2b8\uc6cc\ud06c": 28, "\ub3c4\uc2dd\ud654\ud55c": 28, "\uc815\ub9ac\ud558\uc790\uba74": 28, "\uacc4\uc0b0\ub41c": 28, "\ud655\uc778\ud574\ubcf4\uaca0\uc2b5\ub2c8\ub2e4": 28, "fc1_1": 28, "784": 28, "hidden_s": 28, "fc1_2": 28, "log_var": 28, "reparametr": 28, "std": 28, "mul": 28, "exp_": 28, "ep": 28, "floattensor": 28, "cuda": 28, "add_": 28, "reparam": 28, "fc1": 28, "\ucc3e\uc73c\uba74": 28, "\ubd84\ud560\ud560": 28, "min_": [2, 28], "g_": 28, "\uc720\uc0ac\ud558\ub3c4\ub85d": 28, "\uc7a0\uc7ac\ubcc0\uc218\uc758": 28, "\uc800\ud76c\uac00": 28, "\ubd80\uc5ec\ud55c": 28, "\uac00\uae5d\ub3c4\ub85d": 28, "\uc124\uc815\ud558\ub294": 28, "mont": 28, "carlo": 28, "\uadfc\uc0ac\uac12\uc744": 28, "\uad6c\ud560": [9, 28], "\uc5f0\uc0b0\ub7c9\uc774": 28, "\ub9ce\uc73c\ubbc0\ub85c": 28, "\uc124\uc815\ud569\ub2c8\ub2e4": 28, "\uae30\ubc95\uc740": 28, "\uc0d8\ud50c\ub9c1\ud558\uc9c0": 28, "backpropag": 28, "\ub354\ud558\uace0": 28, "\uacf1\ud558\uac8c": 28, "\ub530\ub978\ub2e4\uace0": 28, "\uc124\uc815\ud588\uc744": 28, "\ub54c\uc774\uace0": 28, "\uac00\uc815\ud560": 28, "\uc2dc\ub3c4\ud560": 28, "\uba85\uc2dc\ub418\uc5b4": 28, "\uc9c0\uc815\ud574\uc92c\ub2e4\uba74": 28, "\ud30c\ub77c\ubbf8\ud130\ub4e4\uacfc": 28, "\uc7a0\uc7ac\ubcc0\uc218\ub97c": 28, "\uc0ac\uc6a9\ud574\ubcf4\uba74": 28, "repositori": 29, "pseudo": 29, "team": 29, "bulb": 29, "aim": 29, "them": 29, "theoret": 29, "conduct": 29, "experi": [2, 29], "\ucc38\uc5ec": 29, "\ub9e4\uc8fc": 29, "\uc218\uc694\uc77c": 29, "\uc624\ud6c4": 29, "9\uc2dc": 29, "\uac00\uc9dc\uc5f0\uad6c\uc18c": 29, "discord": 29, "room": 29, "dh": 29, "\uc785\uc7a5": 29, "brownian": 2, "bridg": 2, "07680": 2, "xuekt98": 2, "linkedin": [], "seonhoonkim": [], "nov": 2, "\uadf9\ubcf5\ud568": 2, "\uc2dc\uac04\uc758": 2, "\ud750\ub984\uc5d0": 2, "\ubd88\ud655\uc2e4\uc131\uc744": 2, "\ubcc0\ud558\ub294": 2, "\ubcc0\uc218\ub4e4\uc758": 2, "\uc9d1\ud569": 2, "\ubcc0\uc218\ub97c": 2, "\ubcc0\uc218\uac00": 2, "\uad00\ucc30\ub41c": 2, "\uad6c\ubd84\ud560": 2, "motion": 2, "wiener": 2, "\uc720\uccb4\uc758": 2, "\ubbf8\uc18c\uc785\uc790\uac00": 2, "\ubd88\uaddc\uce59\ud558\uac8c": 2, "\uc6b4\ub3d9\ud558\ub294": 2, "\uad74\ub69d\uc5d0\uc11c": [], "\ud37c\uc838\ub098\uac04": [], "\uc5f0\uae30": [], "\uc624\ub978\ucabd\uc73c\ub85c": [], "90\ub3c4": [], "\ud68c\uc804\uc2dc\ud0a8": [], "\uc5f0\uc18d": 2, "\uc774\ud574\ud574\ubcf4\uc790": 2, "\uac00\uc815\ud574\ubcf4\uc790": 2, "\ud558\uc790": 2, "\uc774\ud574\ud558\uae30": 2, "\ud558\ub2e4\uace0": 2, "\ubd80\uc5ec\ub418\uc5b4\uc57c": 2, "\uac04\uaca9\uacfc": 2, "\ube44\ub840\ud574\uc57c": 2, "notat": 2, "ld0rxwajpkm": 2, "finrgb": 2, "\uac04\uaca9": 2, "\uc0b4\ud3b4\ubcf4\uace0\uc790": 2, "\uac04\uaca9\uc758": 2, "epsilon_t": [2, 9], "\uc2dc\uc810\uc5d0\uc11c": 2, "\uac04\uaca9\uae4c\uc9c0": 2, "\uc815\uc758\ud574": 2, "\uadfc\uac70\ub97c": 2, "\ucc3e\uc544\ubcf4\uba74": 2, "\ubcc0\uc218": 2, "\ub3c4\uc785\ud568\uc73c\ub85c\uc368": 2, "\ubd80\uc5ec": 2, "\uac04\uaca9\ub3c4": 2, "\ud558\ud544": 2, "\uacf1\ud588\uc744\uae4c": 2, "\uac00\uae4c\uc6cc\uc9c8": 2, "\ucc9c\ucc9c\ud788": 2, "\uc218\ub834": 2, "\ud558\ub2e4\uba74": 2, "\ub77c\uba74": 2, "\uc791\uc544\uc9d0": 2, "\ucee4\uc9c8": 2, "\ucee4\uc9d0": 2, "\uc8fc\uc758\ud560": 2, "\uc0ac\ud56d": 2, "w_1": 2, "\ub3c5\ub9bd": 2, "\ub9de\uc9c0\ub9cc": 2, "\ub3c5\ub9bd\uc774\ub77c\ub294": 2, "\uc544\ub2d8": 2, "epsilon_0": 2, "var": 2, "\uacf5\ubd84\uc0b0\uc740": 2, "\uc810\ub4e4\uc740": 2, "\ubcf4\ub77c\uc0c9": 2, "\uc810\ucc98\ub7fc": 2, "\ud655\ub960\uc5d0": 2, "\uc874\uc7ac\ud560": 2, "\uc218\ud589\ud558\uba74": 2, "\ubcc0\ud55c\ub2e4": 2, "t_2": 2, "t_1": 2, "10\ubd84\uc73c\ub85c": 2, "\uc9c4\ud589\ud558\uba74": 2, "w_5": 2, "\uc544\ub2d0": 2, "\uc788\uc73c\ub098": 2, "\ubcc0\ud654\ub7c9": 2, "t_5": 2, "\ub530\ub978\ub2e4": 2, "\uc2dc\uc810\uacfc": 2, "\uc54c\uace0": 2, "\ubb34\uc5c7\uc77c\uae4c": 2, "sine": 2, "qua": 2, "158": 2, "\uc120\ud615\uc73c\ub85c": 2, "\uc5f0\uacb0\ub41c": 2, "\uc2dc\uc810": 2, "\uac12\uc778": 2, "\ud45c\ud604\ud574\ubcf4\uc790": 2, "\uc77c\uae4c": 2, "\uadf8\ub7ec\uae30": 2, "\uc774\uc5b4\uc57c": 2, "\ud3b8\ucc28\uc758": 2, "\uc81c\uacf1\uc758": 2, "\ud3c9\uade0\uc758": 2, "\uc81c\uacf1": 2, "\uc5f0\uacb0\ud55c": 2, "\ub9cc\ub4e4\uae30": 2, "\uc6b0\ubcc0\uc5d0": 2, "\ub354\ud574\ubcf4\uc790": 2, "\ub3c5\ub9bd\uc778": 2, "\uc2dd\uc5d0\ub294": 2, "\ub300\uc785\ud574\ub3c4": 2, "\ub098\uc624\uace0": 2, "\uc5f0\uacb0\ud558\ub294": 2, "\ub2e4\ub9ac\uac00": 2, "\ub85c\uc11c\uc758": 2, "\uc131\uc9c8": 2, "\uc99d\uba85\ud558\uae30": 2, "\ud45c\uc900\uc815\uaddc\ubd84\ud3ec\ub97c": 2, "\uc815\uaddc\ubd84\ud3ec": 2, "\ud3c9\uade0\uc740": 2, "\ub3c5\ub9bd\uc774\ubbc0\ub85c": 2, "t_0": 2, "\uc810": 2, "abstrcat": 2, "\ubcc0\ud658\uc744": 2, "\ub2e4\ub8f8": 2, "\uc0c1\uc774\ud55c": 2, "\ubaa8\ub378\ub9c1\ud558\ubbc0\ub85c": 2, "bidirect": 2, "\uc784": 2, "\ubcc0\ud658\uc5d0": 2, "\uc811\ubaa9\ud55c": 2, "\ub17c\ubb38\uc784": 2, "introduct": 2, "i2i": 2, "\ubcc0\ud658\uc5d0\uc11c": 2, "fideltii": 2, "\ub192\uc558\uc73c\ub098": 2, "\ub5a8\uc5b4\uc9c4\ub2e4": 2, "\uc548\ub098\uc624\uace0": 2, "applic": 2, "\ud1b5\ud569": 2, "\uc2dc\ud0b4\uc73c\ub85c\uc368": 2, "desir": 2, "\ucd94\ub860\ud574\ub0b8\ub2e4\ub294": 2, "\uba85\ub8cc\ud55c": 2, "\uc774\ub860\uc801": 2, "\uadfc\uac70\uac00": 2, "\uc548\ub418\ubbc0\ub85c": 2, "\uc5d0\uc11c\ub9cc": 2, "\uc218\ud589\ud568\uc73c\ub85c\uc368": 2, "\ud558\uae34": 2, "\ud588\uc73c\ub098": 2, "\uc8fc\uc5b4\uc9c0\ubbc0\ub85c": 2, "\uc81c\uc2dc\ud558\uae30\uac00": 2, "\ud798\ub4e6": 2, "\uac00\uc18d\uc744": 2, "\uc218\ud589\ud568": 2, "duffus": 2, "simplifi": 2, "\ub4dc\ub7ec\ub098": 2, "\uc54a\uc73c\ubbc0\ub85c": 2, "\ub3c4\ub2ec\ud560": 2, "\ubcf4\uc7a5\uc774": 2, "\ub3d9\uc548\uc758": 2, "start": 2, "\uc774\uc5c8\ub2e4": 2, "\ubc14\uafd4\ubcf4\uc790": 2, "\ud5a5\ud574": 2, "vqgan": 2, "\uc601\uc0c1\uc758": 2, "\u03b4_t": 2, "\ub098\ud0c0\ub09c": 2, "\uc0ac\uc6a9\ud558\uac8c": 2, "\ub418\uba74": 2, "\ubd84\uc0b0\uac12": 2, "\ubd84\uc0b0\uac12\uc778": 2, "\u03b4_": 2, "\ucee4\uc9c0\uba74": 2, "\ubd84\uc0b0\uac12\ub3c4": 2, "\ucee4\uc9c0\ub294\ub370": 2, "\ub2e4\ub8e8\uae30\uc5d0": 2, "\uc774\uba74\uc11c": 2, "\ub3c5\ub9bd\uc77c": 2, "\uc815\uc218\uc758": 2, "\ucd5c\ub313\uac12\uc778": 2, "delta_t": 2, "\uc2dc\uc791\ud558\ub294": 2, "m_0": 2, "\ubd84\uc0b0\uc740": 2, "\ub05d\ub098\ub294": 2, "m_t": 2, "\uc9c0\uc810\uae4c\uc9c0\ub294": 2, "\ud558\ub2e4\uac00": 2, "\uc9c0\uc810\ubd80\ud130": 2, "\uac10\uc18c": 2, "\ubd84\uc0b0\uac12\uc5d0": 2, "\uacb0\uc815": 2, "\uc2a4\ucf00\uc77c\ub9c1\ud558\ub294": 2, "\uc870\uc808": 2, "\ub514\ud3f4\ud2b8": 2, "\uc11c\ub294": 2, "transit": 2, "bb": 2, "\uc54c\uc544\uc57c\ud568": 2, "m_ty": 2, "m_": 2, "\uc4f0\ub294": 2, "\uc633\uc74c": 2, "\uc720\ub3c4\ub428": 2, "\uc99d\uba85": 2, "\ub300\uc785": 2, "\uad6c\ud558\uba74": 2, "\uc544": 2, "\ud655\uc2e4\ud788": 2, "\ub3c4\uba54\uc778\uc73c\ub85c\ubd80\ud130": 2, "\ub3c4\uba54\uc778\uc73c\ub85c\uc758": 2, "\uc815\uc758\ud558\ub294\uad6c\ub098": 2, "\uc81c\uac70\ud574\ub098\uac10": 2, "\ub460\uc73c\ub85c\uc368": 2, "\uc790\uccb4\uc5d0\uc11c": 2, "\ud3c9\uade0\uac12": [], "\ub178\uc774\uc988\uc758": 2, "\ubb34\uc2dc\ud560": 2, "\ubca0\uc774\uc988": 2, "\uc774\ub860\uacfc": 2, "\ub3c4\ucd9c": 2, "\uc131\ub9bd\ub428\uc744": 2, "\uc815\ub9ac\ub428": 2, "\ud1b5\ud569\ud558\uace0reparameter": 2, "mu_t": 2, "\ubcc0\ud615\ud560": 2, "\ud559\uc2b5\ub428": 2, "\uc2dd\uc5d0": 2, "\uba85\uc2dc\ud558\uae30": 2, "\uba85\uc2dc\ub41c": 2, "\ubcc0\ud615": 2, "\ud574": 2, "14": 2, "\uadfc\uc0ac\ud558\ub3c4\ub85d": 2, "\ud559\uc2b5\ub418\uc5b4\uc57c\uaca0\ub124": 2, "\ub2e8\uc21c\ud654\ub420": 2, "\uac00\uc18d\uc2dc\ud0ac": 2, "\uae38\uc774\ub97c": 2, "\ub450\uc5c8\uc744": 2, "varibal": 2, "subset": 2, "\uc815\uc758\ub428": 2, "atent": 2, "\ub450\uc5c8\uc74c": 2, "\ud558\uc774\ud37c\ub9c8\ub77c\ubbf8\ud130": 2, "\ud504\ub808\uc784\uc6cc\ud06c\ub294": 2, "\uc774\ub8e8\uc5b4\uc9d0": 2, "lpip": 2, "\uc0dd\uc131\ubb3c\uc758": 2, "\ub9c8\ub2e4\uc758": 2, "\ud45c\uc900\ud3b8\ucc28\uc758": 2, "\uad6c\ud568": 2, "\uc2e4\ud5d8\ud568": 2, "celebamask": 2, "\uc8fc\uace0": [2, 9], "edges2sho": 2, "edges2handbag": 2, "faces2com": 2, "\ud3c9\uac00\ud588\ub2e4\uba74": 2, "\uc2e4\ud5d8\uc5d0\uc11c\ub294": 2, "\ud559\uc2b5\ud558\ubbc0\ub85c": 2, "cyclegan": 2, "\uc2a4\ucf00\uc77c\uc758": 2, "\ub5a8\uc5b4\uc9d0": 2, "drit": 2, "\uc911\uc5d0\uc11c\ub294": 2, "\ub0c8\uc73c\ub098": 2, "\ubcc0\ud658\ub41c": 2, "oversmooth": 2, "\uacfc\ub294": 2, "\uba40\uc5c8\uc74c": 2, "cde": 2, "\ubaa8\ub378\ub4e4\ubcf4\ub2e4\ub294": 2, "rregular": 2, "occlus": 2, "\ub098\ud0c0\ub098\ub294\ub370": 2, "\uc9c1\uc811\uc801\uc778": 2, "\ubb38\uc81c\ub85c\ubd80\ud130": 2, "\uc790\uc720\ub85c\uc6c0": 2, "\ud2b9\uc131\uc73c\ub85c": 2, "\uc0dd\uc131\ud574\ub0c4": 2, "\uae30\ub85d\ud588\uc73c\uba70": 2, "\uac80\uc99d": 2, "\uc2e4\ud5d8\ud588\uc74c": 2, "campar": 2, "\uae30": 2, "\ub85d\ud568": 2, "factor": 2, "\uc870\uae08\ub9cc": 2, "\ub298\ub824\ub3c4": 2, "conclus": 2, "futur": 2, "\uc5d0\ub3c4": 2, "\uc801\uc6a9\ud574\ubcfc": 2, "\uc608\uc815": 2, "toward": 9, "10741": 9, "sehwan": 9, "e\ubcf4\ub2e4": 9, "\ud3c9\uac00\uac00": 9, "\uc6b0\uc218\ud558\ub2e4\uace0": 9, "powerful\ud55c": 9, "editing\uc774": 9, "natur": 9, "language\ub85c": 9, "realistic\ud55c": 9, "\uc0dd\uaca8\ub098\uace0": 9, "prompts\uc5d0": 9, "\uc815\ud655\ud788": 9, "\ub300\uc751\ud558\ub294": 9, "photorealistic\ud55c": 9, "\uc0dd\uc131\ud558\uae30\uc5d0\ub294": 9, "\uacaa\uace0": 9, "\uc911\uc2ec\uc73c\ub85c": 9, "\ub5a0\uc624\ub974\uba70": 9, "unconditional\ud55c": 9, "\ucc0d\uc5c8\ub2e4\uace0": 9, "conditional\ud55c": 9, "\uc774\ub8e8\uc5b4\uc84c\ub294\ub370": 9, "beat": 9, "synthesis\ub77c\ub294": 9, "noise\ud55c": 9, "class\ub97c": 9, "\ucd94\uac00\ud558\uc5ec": 9, "sampling\uacfc\uc815\uc5d0\uc11c": 9, "label\uc5d0": 9, "control\uc2dc\ud0a4\ub294": 9, "classifier\uc5c6\uc774": 9, "\uc18c\uac1c\ub418\uc5c8\ub2e4": 9, "synthesis\ub97c": 9, "guidance\ub77c\ub294": 9, "\uc81c\uc2dc\ud558\uba70": 9, "guidance\uc640": 9, "\ube44\uad50\ub97c": 9, "\uacb0\uacfc\uc801\uc73c\ub85c\ub294": 9, "shot\uc73c\ub85c": 9, "\uc0dd\uc131\ud558\ub294\ub370\uc5d0": 9, "\ubcf4\uc600\uc73c\ub098": 9, "photorealistc\ud55c": 9, "\uacaa\uc744": 9, "generation\ubfd0\ub9cc": 9, "\ud3b8\uc9d1\ud560": 9, "impainting\uae30\ub2a5\ub3c4": 9, "\uaef4\uc788\ub294": 9, "\uc5bc\ub9cc\ud07c\uc778\uc9c0": 9, "constant\ud55c": 9, "\uace0\uc815\uc2dc\ud0a8\ub2e4": 9, "process\uc640": 9, "alpha_t": 9, "epsilon\uc744": 9, "\ubc29\ud5a5\uc131\uc744": 9, "\ub764\ub2e4\ub77c\uace0": 9, "\uc8fc\uc7a5\ud55c\ub2e4": 9, "proof": 9, "proport": 9, "relationship": 9, "find": 9, "constant\uac12\uc73c\ub85c": 9, "step\ub9cc\uc73c\ub85c": 9, "\uc81c\uc2dc\ud55c\ub2e4": 9, "dharwial": 9, "image\uc0dd\uc131\uc744": 9, "\ub17c\ubb38\uc5d0\uc11c\uc758": 9, "guidance\uc774\ub2e4": 9, "image\ub85c\ubd80\ud130": 9, "\uc720\uc9c0\ud558\ub418": 9, "\uc18d\ud558\ub294\uc9c0": 9, "\uc124\uc815\ud55c\ub2e4": 9, "\uacfc\uc815\uc758": 9, "score\uc5d0\uac8c": 9, "\uc18c\uac1c\ub418\uc5c8\ub294\ub370": 9, "classifiy\ub97c": 9, "\ud574\uc57c\ud558\ubbc0\ub85c": 9, "\uc5c6\uace0": 9, "heavy\ud574\uc9c0\ub294": 9, "\ubc29\ubc95\uc5d0": 9, "\uac1c\uc120\uc810\uc744": 9, "\uc2dd\uc5d0\uc11c": 9, "model\ub9cc\uc73c\ub85c": 9, "clip\uc740": 9, "representation\uc744": 9, "\uc774\ub8e8\uc5b4\uc9c4": 9, "\uc9c4\ud589\uc2dc\ud0a8": 9, "pair\uc5d0": 9, "\uc720\uc0ac\ub3c4": 9, "\ucee4\uc9c0\ub3c4\ub85d": 9, "\uc791\uc544\uc9c0\ub3c4\ub85d": 9, "guidance\uc5d0\uc11c\ub294": 9, "guidance\uc5d0\uc11c": 9, "classifier\ub300\uc2e0\uc5d0": 9, "clip\ubaa8\ub378\uc744": 9, "\ubc29\uc2dd\ub3c4": 9, "classifier\ub300\uc2e0": 9, "\uad6c\ud55c": 9, "text\uac04\uc758": 9, "\uc720\uc0ac\ub3c4\ub97c": 9, "billion": 9, "\uc99d\uac00\uc2dc\ud0a4\ub294\ub370": 9, "base\ub85c": 9, "\uc218\ud589\ud574\uc57c\ud55c\ub2e4": 9, "k\uac1c\uc758": 9, "encoding\ud55c": 9, "input\uac12\uc73c\ub85c": 9, "\ub123\uc5b4\uc900\ub2e4": 9, "output\uc758": 9, "encoding\uc744": 9, "\uc5f0\uc0b0\ud558\uace0\uc790": 9, "projection\ud558\uc5ec": 9, "\ub354\ud55c": 9, "adain\uae30\ubc95\uc744": 9, "block\uc758": 9, "\ub3c4\ucd9c\ud55c\ub2e4": 9, "layer\ub294": 9, "block\ub4a4\uc5d0": 9, "\ubd99\ub294": 9, "e\uc640": 9, "architecture\ub85c\ub294": 9, "up\ub41c": 9, "2b": 9, "paremeters\ub97c": 9, "transformer\ub97c": 9, "upsampling\ud558\ub294": 9, "model\ub3c4": 9, "\ud559\uc2b5\uc2dc\ucf30\ub2e4\uace0": 9, "ddpm\uc5d0\uc11c\uc758": 9, "upsampler\uc640": 9, "\ube44\uc2b7\ud558\ub2e4\uace0": 9, "\uc9c4\ud589\ud588\uc744\ub54c\ub294": 9, "generation\uc5d0": 9, "generation\uc758": 9, "condition\uc5d0": 9, "sequence\ub97c": 9, "impainting\uc744": 9, "\uac70\uce58\uc9c0": 9, "\uc54a\uc558\ub2e4": 9, "\uc54c\ub824\uc9c4": 9, "\uc601\uc5ed\uc5d0": 9, "\ub300\uccb4\ud558\ub294": 9, "\uc0ac\uc6a9\ud588\uae30\uc5d0": 9, "\uc5c6\ub2e4\ub294": 9, "tuning\uacfc\uc815\uc5d0\uc11c": 9, "example\uc758": 9, "\uc9c0\uc6b4\ub2e4\uc74c": 9, "\ub0a8\uc740": 9, "\uc870\uac74": 9, "\uc815\ubcf4\ub85c\uc11c": 9, "\ucc44\ub110\uacfc": 9, "\uc785\ub825\ub418\ub3c4\ub85d": 9, "\uc124\uacc4\ud558\uc600\ub2e4": 9, "guidance\uc5d0": 9, "\uc801\ud569\ud558\uac8c": 9, "\ube44\uad50\ud588\uc74c\uc744": 9, "\uc5b8\uae09\ud588\ub2e4": 9, "models\ub97c": 9, "\uc0ac\uc6a9\ud588\uc74c\uc744": 9, "\ubc1d\ud78c\ub2e4": 9, "\uc5b8\uae09\ud588\ub4ef\uc774": 9, "\uc88b\uc558\ub2e4\uace0": 9, "precision\uacfc": 9, "\uba85\ud655\ud55c": 9, "\uad00\ucc30\ud558\uace0": 9, "\uc5b8\uae09\ud55c\ub2e4": 9, "\ucd5c\uc801\uc73c\ub85c": 9, "\uc218\ud589\ub418\uc5c8\uc73c\uba70": 9, "\ubc29\ubc95\uc784\uc744": 9, "\ud5a5\uc0c1\uc2dc\ud0ac": 9, "\ud3c9\uac00\uc5d0": 9, "caption\uacfc": 9, "\uc77c\uce58\uc2dc\ud0a4\ub294": 9, "\ub6f0\uc5b4\ub098\uc9c0": 9, "\uac00\uc124\uc744": 9, "\uc778\uac04": 9, "\ud3c9\uac00\uc790\ub97c": 9, "\uc9c4\ud589\ud558\uc600\uace0": 9, "\uc778\uac04\ub4e4\uc774": 9, "\uc758\uacac\uc744": 9, "guida": 9, "nce\uac00": 9, "\uc77c\uce58\ud558\ub294": 9, "\uc0dd\uc131\ud55c\ub2e4\uace0": 9, "\ud310\ub2e8\ud588\ub2e4": 9, "table1\uc740": 9, "unguid": 9, "evaluation\uc744": 9, "\uacb0\uacfc\uc774\ub2e4": 9, "\ud56d\ubaa9\uc5d0": 9, "\ubcf4\uc784\uc744": 9, "table2\ub294": 9, "glide\uc640": 9, "model\ub4e4\uc744": 9, "\ud45c\uc774\ub2e4": 9, "\uad6c\ud558\uc600\ub2e4": 9, "coco\uc5d0": 9, "\uacbd\ud5d8\uc774": 9, "result\ub97c": 9, "100\ubc88": 2, "md": 2, "pic": [], "img_04": [], "alt": [], "bg": [], "primari": [], "mb": [], "350px": [], "w_1000": 2, "123": 2, "100\uac1c\uc758": 2, "\uc0d8\ud50c\ub9c1\ud55c": 2, "\ud3c9\uade0\uac12\uc774\uba70": 2}, "objects": {}, "objtypes": {}, "objnames": {}, "titleterms": {"inform": [0, 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28], "synthet": [0, 18], "data": [0, 3, 8, 18], "stabl": [0, 27], "diffus": [0, 5, 8, 9, 11, 12, 14, 16, 18, 19, 22, 23, 25, 27, 29], "foliar": 0, "diseas": 0, "classif": [0, 18], "1": [0, 3, 5, 7, 8, 9, 11, 13, 14, 16, 18, 21, 22, 27], "\uac1c\uc694": 0, "2": [0, 3, 5, 7, 8, 9, 11, 13, 14, 15, 16, 18, 21, 22, 27], "baselin": [0, 20], "\uad6c\ucd95": 0, "3": [0, 3, 5, 7, 8, 9, 11, 13, 14, 15, 16, 18, 21, 22, 27], "fine": [0, 3, 5, 9, 18, 23], "tune": [0, 3, 5, 9, 18, 23], "4": [0, 3, 5, 7, 8, 9, 11, 13, 14, 15, 16, 18, 22, 27], "\uc131\ub2a5": 0, "\ube44\uad50": [0, 19], "5": [0, 5, 7, 11, 13, 15, 16, 18, 22], "discuss": [0, 5], "6": [0, 7, 11, 18, 22], "appendix": [0, 1, 23], "train": [1, 3, 4, 5, 8, 9, 15, 18, 20, 21, 24], "dreambooth": [1, 10, 23], "naver": 1, "webtoon": 1, "face": [1, 16], "dataset": 1, "introduct": [1, 3, 5, 7, 8, 9, 10, 11, 13, 14, 15, 16, 18, 19, 20, 21, 22, 23, 24, 25, 27, 28], "ablat": [1, 23, 25, 27], "studi": [1, 20, 23, 25, 27], "prior": [1, 21], "preserv": 1, "loss": [1, 8, 20], "neg": 1, "prompt": 1, "instanc": 1, "guidanc": [1, 3, 9, 22, 25], "scale": [1, 8, 11, 18], "cm3leon": 3, "abstract": [3, 5, 7, 9, 11, 13, 14, 15, 16, 19, 20, 22], "pretrain": [3, 25], "imag": [3, 4, 5, 9, 20, 22, 23, 29], "token": 3, "retriev": 3, "augment": 3, "object": [3, 8, 20], "function": [3, 8, 20], "model": [3, 5, 8, 9, 11, 12, 14, 15, 16, 18, 19, 22, 23, 25, 27], "text": [3, 5, 9, 19, 23, 29], "To": 3, "result": [3, 4, 9, 18, 20, 21, 22, 24, 25], "import": 3, "decod": [3, 8], "strategi": 3, "temperatur": 3, "sampl": [3, 7, 8, 11, 18], "topp": 3, "classifi": [3, 9, 22, 25], "free": [3, 9, 25], "cfg": 3, "contrast": 3, "topk": 3, "cd": 3, "k": 3, "quantit": 3, "evalu": [3, 25], "supervis": 3, "instruct": 3, "gener": [3, 5, 7, 14, 18, 20, 29], "guid": [3, 9, 19], "edit": [3, 5], "ground": 3, "spatial": 3, "caption": 3, "visual": [3, 21], "question": 3, "answer": 3, "task": 3, "controlnet": 4, "addit": [4, 13], "control": 4, "base": [4, 14], "condit": [4, 9, 15], "block": 4, "zero": 4, "convolut": 4, "implement": [4, 20, 27], "custom": 5, "relat": [5, 10, 14, 16, 18, 19, 20], "work": [5, 10, 14, 16, 18, 19, 20, 21, 22], "deep": 5, "transfer": [5, 20], "learn": [5, 21], "adapt": [5, 22, 27], "method": [5, 10, 13, 14, 16, 19, 27], "singl": 5, "concept": 5, "multipl": 5, "composit": 5, "detail": [5, 20, 21, 27], "experi": [5, 7, 8, 10, 12, 13, 14, 16, 23, 24, 27], "limit": [5, 19, 20, 21, 22, 23], "dalle2": 6, "ddim": [7, 22], "background": [7, 8, 9, 18, 21, 22], "ddpm": [7, 8, 11, 22], "variat": [7, 17], "infer": [7, 13], "For": 7, "non": 7, "markovian": 7, "forward": [7, 8], "process": [7, 8], "from": [7, 18, 20, 25], "code": 7, "q": 8, "mathbf": 8, "x": 8, "_t": 8, "_": 8, "t": [8, 13], "revers": 8, "p": 8, "l": 8, "denois": [8, 11], "encod": 8, "l_t": 8, "l_": 8, "l_0": 8, "simplifi": 8, "qualiti": [8, 18, 20], "hyperdreambooth": 10, "contribut": [10, 25], "prelimiari": 10, "lightweight": 10, "lidb": 10, "hypernetwork": 10, "rank": [10, 13], "relax": 10, "fast": 10, "finetun": 10, "comparison": [10, 11, 20, 27], "follow": 10, "up": 10, "conclus": [10, 16, 18, 21, 25], "i": 11, "probabilist": 11, "improv": [11, 15, 18, 22], "log": 11, "likelihood": 11, "improc": 11, "speed": 11, "gan": [11, 19, 22, 24], "size": 11, "latent": [12, 19], "lora": 13, "0": 13, "terminolog": 13, "convent": 13, "problem": 13, "statement": 13, "aren": 13, "exist": 13, "solut": 13, "good": 13, "enough": 13, "our": 13, "low": 13, "parameter": 13, "updat": 13, "matric": 13, "No": 13, "latenc": 13, "appli": 13, "transform": [13, 21], "empir": 13, "ia3": 13, "aa": 13, "\uc0ac\uc6a9\ubc95": 13, "refer": 13, "sdedit": 14, "score": [14, 18], "sde": 14, "smld": 14, "sdxl": 15, "micro": 15, "crop": 15, "paramet": [15, 18, 22], "multi": 15, "aspect": 15, "autoencod": 15, "put": 15, "everyth": 15, "togeth": 15, "refin": 15, "stage": [15, 21], "styo": 16, "styliz": 16, "framework": 16, "stylegan": 17, "map": 17, "network": 17, "style": [17, 20], "adain": 17, "stochast": 17, "mix": 17, "regular": 17, "\uc2e4\ud5d8": 17, "\uacb0\uacfc": [17, 19, 20], "imagenet": 18, "imagen": [18, 25, 26], "protocol": 18, "fid": 18, "IS": 18, "accuraci": 18, "differ": 18, "merg": 18, "real": 18, "textual": 19, "invers": 19, "cf": 19, "\uc774\ud574": 19, "\ubabb\ud568": 19, "ldm": 19, "embed": 19, "\uc131\ub2a5\ud3c9\uac00": 19, "dall": [19, 21], "e": [19, 21], "2\uc640": 19, "synthesi": [19, 22], "pseudo": 19, "word": 19, "\ub450": 19, "\uac1c": 19, "\uc0ac\uc6a9": 19, "bia": 19, "reduct": 19, "\uc815\ub7c9\ud3c9\uac00": 19, "\ud3c9\uac00": 19, "setup": 19, "\uc8fc\ubaa9\ud560": 19, "\uc810": 19, "\uc0ac\uc6a9\uc790\ud3c9\uac00": 19, "\ub9c8\ubb34\ub9ac": 19, "cyclegan": 20, "\ucc38\uace0": 20, "translation\uc774\ub780": 20, "mode": 20, "collapse\ub780": 20, "\uad00\ub828": 20, "\uc5f0\uad6c": 20, "formul": 20, "adversari": 20, "cycl": 20, "consist": 20, "full": 20, "\uc804\uccb4": 20, "\ubaa9\uc801\uc2dd": 20, "least": 20, "squar": 20, "\ucd94\uac00": 20, "\uc124\uba85": 20, "\uae30\ud0c0": 20, "against": 20, "human": [20, 25], "fcn": 20, "\ub4f1": 20, "analysi": 20, "reconstruct": 20, "pair": 20, "dataset\uc5d0": 20, "\ub300\ud55c": 20, "applic": [20, 23, 27], "collect": 20, "transfigur": 20, "season": 20, "photo": 20, "paint": 20, "enhanc": 20, "gati": 20, "discusss": 20, "gpt": 21, "vq": 21, "vae": [21, 28], "methodolog": [21, 25], "previou": 21, "overview": [21, 27], "an": 21, "autoregress": 21, "pipelin": 21, "\uc608\uc2dc": 21, "equat": 21, "\ud559\uc2b5\uacfc\uc815": 21, "codebook": 21, "beat": 22, "architectur": 22, "group": 22, "normal": 22, "algorithm": 22, "7": 22, "impact": 22, "s": 22, "8": 22, "9": 22, "futur": 22, "procedur": 24, "theoret": 24, "summari": [24, 28], "t5": 25, "xxl": 25, "cascad": 25, "larg": 25, "weight": 25, "sampler": 25, "static": 25, "threshold": 25, "dynam": 25, "super": 25, "resolut": 25, "drawbench": 25, "qualit": 25, "tabl": 25, "editor": 26, "t2i": 27, "preliminari": 27, "design": 27, "optim": 27, "intract": 28, "reparameter": 28, "trick": 28, "pseudolab": 29, "feat": 29, "bbdm": 2, "glide": 9, "clip": 9, "inpaint": 9, "nois": 9}, "envversion": {"sphinx.domains.c": 2, "sphinx.domains.changeset": 1, "sphinx.domains.citation": 1, "sphinx.domains.cpp": 6, "sphinx.domains.index": 1, "sphinx.domains.javascript": 2, "sphinx.domains.math": 2, "sphinx.domains.python": 3, "sphinx.domains.rst": 2, "sphinx.domains.std": 2, "sphinx.ext.intersphinx": 1, "sphinx": 56}})
\ No newline at end of file
+Search.setIndex({"docnames": ["docs/experiments/js_exp", "docs/experiments/swjo_exp", "docs/review/BBDM", "docs/review/CM3leon", "docs/review/ControlNet", "docs/review/CustomDiffusion", "docs/review/DALLE2", "docs/review/DDIM", "docs/review/DDPM", "docs/review/GLIDE", "docs/review/HyperDreamBooth", "docs/review/I-DDPM", "docs/review/Latent_Diffusion_Model", "docs/review/LoRA", "docs/review/SDEdit", "docs/review/SDXL", "docs/review/StyO", "docs/review/StyleGAN", "docs/review/Synthetic_Data_from_Diffusion_Models_Improves_ImageNet_Classification", "docs/review/Textual_Inversion", "docs/review/Your_Diffusion_Model_is_Secretly_a_Zero_Shot_Classifier", "docs/review/cycleGAN", "docs/review/dalle", "docs/review/diffusion_beats_GANs", "docs/review/dreambooth", "docs/review/gan", "docs/review/imagen", "docs/review/imagen_editor", "docs/review/t2i_adapter", "docs/review/vae", "intro"], "filenames": ["docs\\experiments\\js_exp.md", "docs\\experiments\\swjo_exp.md", "docs\\review\\BBDM.md", "docs\\review\\CM3leon.md", "docs\\review\\ControlNet.md", "docs\\review\\CustomDiffusion.md", "docs\\review\\DALLE2.md", "docs\\review\\DDIM.md", "docs\\review\\DDPM.md", "docs\\review\\GLIDE.md", "docs\\review\\HyperDreamBooth.md", "docs\\review\\I-DDPM.md", "docs\\review\\Latent_Diffusion_Model.md", "docs\\review\\LoRA.md", "docs\\review\\SDEdit.md", "docs\\review\\SDXL.md", "docs\\review\\StyO.md", "docs\\review\\StyleGAN.md", "docs\\review\\Synthetic_Data_from_Diffusion_Models_Improves_ImageNet_Classification.md", "docs\\review\\Textual_Inversion.md", "docs\\review\\Your_Diffusion_Model_is_Secretly_a_Zero_Shot_Classifier.md", "docs\\review\\cycleGAN.md", "docs\\review\\dalle.md", "docs\\review\\diffusion_beats_GANs.md", "docs\\review\\dreambooth.md", "docs\\review\\gan.md", "docs\\review\\imagen.md", "docs\\review\\imagen_editor.md", "docs\\review\\t2i_adapter.md", "docs\\review\\vae.md", "intro.md"], "titles": ["Synthetic Data with Stable Diffusion for Foliar Disease Classification", "Training DreamBooth on Naver Webtoon Face Dataset", "BBDM", "CM3leon", "ControlNet", "Custom Diffusion", "DALLE2", "DDIM", "DDPM", "GLIDE", "HyperDreamBooth", "I-DDPM", "Latent Diffusion Model", "LoRA", "SDEdit", "SDXL", "StyO", "StyleGAN", "Synthetic Data from Diffusion Models Improves ImageNet Classification", "Textual Inversion", "YDMSZC \ubc1c\ud45c \uc790\ub8cc", "CycleGAN", "DALL-E", "Diffusion Models Beat GANs on Image Synthesis", "DreamBooth", "GAN", "Imagen", "Imagen Editor", "T2I-Adapter", "VAE", "[PseudoLab] Text-to-Image Generation (feat. Diffusion)"], "terms": {"titl": [0, 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29], "author": [0, 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29], "jisu": [0, 4, 17], "kim": [0, 2, 4, 6, 17, 20], "last": [0, 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29], "updat": [0, 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 14, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29], "jul": [0, 1], "05": [0, 15], "2023": [0, 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29], "\uc0ac\uacfc": 0, "\ub098\ubb34\uc758": 0, "\uc78e\uc5d0": 0, "\uc0dd\uae30\ub294": [0, 18], "\uc9c8\ubcd1\uc744": 0, "\uc774\ubbf8\uc9c0\ub85c": [0, 1, 5, 10, 15, 16, 26, 27, 28], "\ud310\ubcc4\ud558\ub294": 0, "kaggl": 0, "competit": [0, 20, 23], "\ub9c1\ud06c": [0, 4], "\uc5d0\uc11c": [0, 2, 4, 6, 8, 9, 11, 13, 18, 20, 22, 24, 26, 27, 28, 29], "\uc544\uc774\ub514\uc5b4\ub97c": 0, "\uc5bb\uc5b4\uc11c": 0, "\uc9c4\ud589\ud55c": [0, 2, 9], "\ud504\ub85c\uc81d\ud2b8\uc785\ub2c8\ub2e4": 0, "\ud574\ub2f9": [0, 5, 8, 9, 10, 12, 14, 18, 19, 20, 24, 28, 29], "competition\uc740": 0, "\uc0ac\uacfc\ub098\ubb34": 0, "\uac78\ub9b0": 0, "\uc9c8\ubcd1\uc5d0": 0, "\ub530\ub77c": [0, 2, 3, 6, 9, 10, 11, 13, 15, 18, 19, 20, 21, 22, 23, 24, 29], "\uc78e": 0, "\uc774\ubbf8\uc9c0\ub97c": [0, 3, 4, 5, 6, 7, 8, 9, 10, 11, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 26, 28], "4\uac1c\uc758": [0, 6, 19, 28], "class\ub85c": 0, "\ubd84\ub958\ud558\ub294": [0, 9, 28], "task\uc785\ub2c8\ub2e4": 0, "class": [0, 4, 5, 7, 8, 9, 11, 17, 18, 20, 21, 23, 24, 25, 26, 28, 29], "leav": [0, 20], "competition\uc744": 0, "\uc124\uba85\ud55c": [0, 28], "articl": 0, "\uc804\uccb4\uc801\uc778": [0, 6, 17], "accuracy\ub294": 0, "97": [0, 20], "\uc774\uc9c0\ub9cc": 0, "multipl": [0, 28], "class\uc758": [0, 23], "\uacbd\uc6b0": [0, 1, 2, 4, 5, 10, 11, 12, 13, 14, 15, 16, 17, 18, 20, 21, 25, 28], "accuracy\uac00": 0, "51": 0, "\uc5d0": [0, 1, 2, 3, 4, 5, 6, 8, 9, 10, 11, 12, 13, 16, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29], "\ubd88\uacfc\ud588\ub2e4\uace0": 0, "\uc5b8\uae09\ud569\ub2c8\ub2e4": 0, "\uc774\ubbf8\uc9c0": [0, 3, 4, 5, 6, 7, 9, 10, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28], "\uac1c\uc218\uac00": 0, "\ub2e4\ub978": [0, 5, 7, 8, 9, 10, 11, 13, 16, 17, 18, 19, 20, 21, 22, 24, 26, 27, 28, 29], "class\uc5d0": [0, 9], "\ube44\ud574": [0, 3, 4, 5, 7, 9, 10, 11, 16, 18, 23, 27], "\uc801\uc740": [0, 3, 4, 5, 7, 8, 9, 11, 13, 19, 20, 23], "\uc810\uc5d0": 0, "\uc8fc\ubaa9\ud588\uace0": 0, "diffusion\uc744": [0, 14], "\uc0ac\uc6a9\ud558\uc5ec": [0, 8, 10, 17, 18, 21, 22, 24, 26], "\ud074\ub798\uc2a4\uc758": [0, 18], "\ub370\uc774\ud130": [0, 14, 15, 18, 19, 20, 21, 22, 25, 26, 29], "\uac1c\uc218\ub97c": [0, 8], "\ub298\ub824\uc11c": 0, "classifi": [0, 18, 20, 25, 27, 28], "\ud559\uc2b5\uc5d0": [0, 3, 11, 13, 18, 21], "\uc0ac\uc6a9\ud558\uba74": [0, 11, 19, 22], "\ub354": [0, 1, 4, 5, 6, 7, 8, 9, 10, 11, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 26, 27, 28, 29], "\uc88b\uc740": [0, 1, 5, 9, 14, 15, 16, 18, 21, 23, 24, 26, 28], "\uc131\ub2a5\uc758": [0, 18], "classifier\ub97c": [0, 9], "\uc5bb\uc744": [0, 15, 18, 19, 20, 21], "\uc218": [0, 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29], "\uc788\uc744": [0, 1, 4, 8, 13, 17, 18, 20, 21], "\uac83\uc73c\ub85c": [0, 9, 13, 18, 19, 20], "\uae30\ub300\ud588\uc2b5\ub2c8\ub2e4": 0, "\ubb38\uc81c": [0, 28], "\uc0c1\ud669\uc744": 0, "\uc7ac\ud604\ud558\uae30": 0, "\uc704\ud574": [0, 2, 3, 4, 5, 6, 8, 9, 10, 13, 15, 16, 17, 19, 21, 22, 24, 27, 28], "\uae30\uc874": [0, 3, 4, 5, 9, 10, 11, 12, 13, 15, 16, 17, 18, 19, 21, 23, 26, 28], "\ub370\uc774\ud130\ub85c": [0, 1, 4, 15, 18, 20], "imag": [0, 1, 2, 6, 7, 10, 12, 14, 15, 16, 17, 18, 19, 20, 22, 25, 26, 27, 28, 29], "\ud559\uc2b5\ud558\uc5ec": [0, 22, 23], "baseline\uc73c\ub85c": 0, "\uc7a1\uc558\uc2b5\ub2c8\ub2e4": 0, "\ubaa8\ub378\uc740": [0, 3, 6, 10, 11, 12, 13, 14, 15, 17, 18, 19, 21, 28], "pretrained\ub41c": 0, "resnet18\uc5d0": 0, "linear": [0, 2, 8, 9, 11, 13, 17, 23, 25, 29], "layer\ub97c": [0, 13, 17], "\ubd99\uc5ec\uc11c": 0, "\uc0ac\uc6a9\ud588\uc2b5\ub2c8\ub2e4": [0, 6, 10, 27], "\uc804\uccb4": [0, 4, 5, 9, 10, 11], "7": [0, 1, 3, 7, 8, 11, 14, 26, 28], "class\ubcc4": 0, "healthi": 0, "99": 0, "73": [0, 19], "rust": 0, "scab": 0, "98": 0, "class\ub294": 0, "\uac1c\uc218": 0, "91\uac1c\ub85c": 0, "\ud074\ub798\uc2a4\ub4e4\uc5d0": 0, "\ube44\ud574\uc11c": [0, 6], "\uc801\uc2b5\ub2c8\ub2e4": 0, "imbalance\uac00": 0, "\uc131\ub2a5\uc744": [0, 3, 4, 5, 7, 9, 11, 12, 13, 15, 16, 17, 18, 20, 21, 22, 23, 24, 26, 27, 28], "\ub0ae\ucd94\ub294": 0, "\uc6d0\uc778\uc77c": [0, 18], "\uac83\uc774\ub77c": [0, 13], "\uac00\uc815\ud558\uace0": 0, "diffusion\uc73c\ub85c": [0, 18], "data\ub97c": [0, 5, 21], "\ucd94\uac00\ub85c": [0, 3, 11, 15, 16, 21], "\uc0dd\uc131\ud574\ubcf4\uae30\ub85c": 0, "\ud588\uc2b5\ub2c8\ub2e4": [0, 1, 6, 17, 18], "\uc608\uc2dc": [0, 3, 21, 26, 27, 28], "pretran": 0, "diffusion\uc758": 0, "\ub300\ud55c": [0, 1, 4, 5, 6, 8, 9, 10, 11, 15, 16, 18, 19, 24, 25, 28], "\uc815\ubcf4\uac00": [0, 6, 10, 16, 24], "\uc5c6\uc5b4\uc11c": 0, "\uc0dd\uc131\ud560": [0, 1, 3, 4, 6, 10, 14, 15, 18, 21, 24], "\uc544\ub798\uc640": [0, 4, 12, 17, 18, 21, 23], "\uac19\uc774": [0, 2, 4, 5, 6, 8, 10, 12, 17, 19, 21, 22, 23, 24, 25, 28, 29], "\uad00\ub828\uc5c6\ub294": 0, "\uc774\ubbf8\uc9c0\uac00": [0, 4, 6, 8, 10, 11, 14, 15, 16, 18, 19, 21, 23, 25, 26], "\uc0dd\uc131\ub429\ub2c8\ub2e4": 0, "prompt": [0, 4, 5, 6, 9, 10, 16, 19, 20, 24, 26, 27, 28], "photo": [0, 1, 5, 19], "\ub530\ub77c\uc11c": [0, 2, 3, 4, 6, 8, 9, 10, 11, 14, 15, 16, 18, 19, 20, 21, 27, 28], "model": [0, 2, 4, 6, 7, 10, 13, 20, 22, 25, 27, 30], "\uc815\ubcf4\ub97c": [0, 4, 6, 8, 10, 16, 18, 19, 21, 24, 28], "\ub123\uc5b4\uc8fc\uae30": 0, "dreambooth": [0, 5], "\ub97c": [0, 1, 2, 3, 4, 5, 6, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29], "tuning\ud588\uc2b5\ub2c8\ub2e4": 0, "training\uc5d0": [0, 7, 23], "\uc0ac\uc6a9\ud55c": [0, 3, 10, 15, 17, 18, 19, 23], "prompt\ub294": [0, 10], "disea": 0, "leaf": 0, "\uc774\uba70": [0, 19], "\uc0dd\uc131\ud55c": [0, 4, 6, 22, 24, 26, 28], "\uc774\ubbf8\uc9c0\uc758": [0, 1, 4, 5, 6, 8, 16, 17, 18, 19, 21, 22, 24, 26], "\uc608\uc2dc\ub294": [0, 26, 28], "\uac19\uc2b5\ub2c8\ub2e4": [0, 1, 4, 6, 17, 18, 21, 24, 25, 28], "\uc0dd\uc131": [0, 3, 6, 7, 8, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 22, 23, 24, 26, 28], "engineering\uc744": 0, "\uc218\ud589\ud558\ub358": 0, "\uc911": [0, 3, 5, 7, 10, 11, 13, 14, 15, 16, 17, 18, 21, 22, 23, 24, 26, 28, 29], "\uc758\ub3c4\ud558\uc9c0\uc54a\uc740": 0, "\uacb0\uacfc\ub97c": [0, 3, 4, 6, 7, 9, 10, 11, 15, 16, 17, 18, 19, 21, 26, 27], "\ubc1c\uacac\ud588\uc2b5\ub2c8\ub2e4": [0, 1, 6], "\uc544\ub798\ub294": [0, 4, 14, 29], "\uc774\uc5d0": [0, 4, 10, 16, 18, 24, 28], "\uc608\uc2dc\ub85c": [0, 20], "\uc804\uc758": [0, 15], "model\uc758": [0, 4, 5, 8, 10, 11, 13, 18, 19, 21, 23], "\uacb0\uacfc\uc640": [0, 6], "\ube44\uad50\uc785\ub2c8\ub2e4": 0, "\uc0c1\ud6691": 0, "\uc804": [0, 8, 13, 15, 18, 23], "\ud6c4": [0, 1, 3, 6, 8, 9, 13, 14, 15, 21, 22, 24, 26, 27, 28], "\uc0c1\ud6691\uc744": 0, "\ubcf4\uba74": [0, 5, 9, 11, 15, 17, 18, 19, 21, 22], "\ub2f4\uc740": 0, "uniqu": [0, 1, 24], "identifi": [0, 1, 16, 24], "\uac00": [0, 1, 2, 4, 6, 8, 9, 10, 11, 13, 15, 20, 21, 23, 24, 25, 26, 27, 28, 29], "\uc5c6\uc74c\uc5d0\ub3c4": [0, 9], "diseases\uc758": 0, "\uc78e\ub4e4\ub9cc": 0, "\uc774\ub294": [0, 3, 4, 7, 10, 15, 17, 18, 19, 25, 27, 28, 29], "\uac19\uc740": [0, 1, 3, 4, 6, 8, 9, 10, 11, 12, 13, 15, 17, 18, 19, 20, 21, 23, 24, 27, 28, 29], "\uc18d\ud558\ub294": 0, "\uc774\ubbf8\uc9c0\ub4e4\uc744": [0, 1, 4, 6, 24, 27], "\uc0dd\uc131\ud574\ub0b4\uc9c0": [0, 5], "\ubabb\ud558\uace0": [0, 8], "\uc788\ub2e4\ub294": [0, 9, 10, 13, 17, 19, 26, 27], "\uac83\uc785\ub2c8\ub2e4": [0, 4, 6, 10, 17, 18, 21, 27], "\uc774": [0, 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 28, 29], "\ud604\uc0c1\uc744": [0, 5, 17], "languag": [0, 3, 5, 6, 13, 18, 19, 22, 24, 26], "drift\ub77c\uace0": 0, "\ud558\uba70": [0, 22], "\ubaa8\ub378\uc774": [0, 1, 3, 4, 5, 7, 8, 10, 15, 16, 17, 18, 23, 24, 27, 28], "leaf\uac00": 0, "\uc544\ub2cc": [0, 1, 4, 7, 13, 19, 21, 23], "\uc77c\ubc18\uc801\uc778": [0, 3, 5, 10, 15, 19, 20, 23], "\uad00\ud55c": [0, 11, 16, 17], "\uc78a\uc5b4\ubc84\ub838\uae30": 0, "\ub54c\ubb38\uc785\ub2c8\ub2e4": [0, 21], "\uc0c1\ud6692": 0, "\uc0c1\ud6692\ub97c": 0, "photo\ub77c\ub294": 0, "prompt\ub9cc": [0, 16], "\uc0ac\uc6a9\ud558\uc600\ub294\ub370\ub3c4": 0, "\uc774\ubbf8\uc9c0\ub4e4\uc5d0": [0, 6], "\ud2b9\uc9d5\ub4e4\uc774": 0, "\ub098\ud0c0\ub0a9\ub2c8\ub2e4": 0, "dreambooth\uc5d0\uc11c\ub294": 0, "drift\ub97c": 0, "prior": [0, 6, 24, 29], "preserv": [0, 24], "loss\ub97c": [0, 3, 8, 19, 21], "\uc0ac\uc6a9\ud574\uc11c": [0, 4, 6, 9, 22, 23, 26], "\ud574\uacb0\ud558\uc600\uc73c\ubbc0\ub85c": 0, "\ubc29\ubc95\uc744": [0, 3, 9, 10, 11, 15, 17, 18, 19, 20, 21, 23, 28], "\ud574\uacb0\ud558\uae30": [0, 13, 15, 19, 24, 27, 28], "train": [0, 6, 7, 10, 11, 13, 16, 17, 19, 20, 23, 24, 26, 28], "prompt\uc5d0\uc11c": 0, "\uc81c\uc678\ud558\uace0": [0, 13, 15], "\ucd5c\ub300\ud55c": [0, 15, 19, 21, 28, 29], "\ub2e8\uc21c\ud55c": [0, 5], "model\uc744": [0, 4, 5, 7, 9, 10, 11, 13, 15, 16, 19, 23], "\ub2e4\uc2dc": [0, 8, 13, 14, 17, 21, 24, 25, 28, 29], "\uacb0\uacfc": [0, 1, 2, 3, 5, 6, 10, 11, 12, 15, 18, 20, 23, 26, 27, 28], "\uc7ac\ud6c8\ub828": 0, "\uc774\ud6c4\uc5d0\ub3c4": 0, "model\ub85c": [0, 9], "\uc0dd\uc131\ud558\uc600\uc744": 0, "\ub54c\uc640": [0, 21], "\ube44\uc2b7\ud55c": [0, 3, 5, 11, 19, 21, 23, 24], "\uc758": [0, 1, 2, 4, 6, 7, 8, 9, 10, 11, 13, 15, 17, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29], "\uacbd\uc6b0\uc5d0\ub294": [0, 18], "\uc5ec\uc804\ud788": [0, 5, 10, 26], "\uc601\ud5a5\uc744": [0, 3, 8, 11, 16, 17, 18, 23, 26], "\ubc1b\uc740": [0, 19], "\uac83\uac19\uc740": 0, "\uc774\ubbf8\uc9c0\ub4e4\uc774": [0, 4], "photo\uc758": 0, "\uc5ec\ub7ec": [0, 2, 10, 12, 18, 19, 20, 24, 28], "\ub300\uc0c1\ub4e4\uacfc": 0, "\uc0ac\uc6a9\ub418\ub294": [0, 10, 18, 19, 24], "\ud2b9\uc131\uc744": [0, 10, 21], "\uac00\uc9c0\uace0\uc788\uc5b4\uc11c": 0, "\uadf8\ub7f0": [0, 14, 17, 21], "\uac83\uc774\ub77c\ub294": [0, 18], "\uc0dd\uac01\uc774": [0, 18], "\ub4e4\uc5c8\uace0": 0, "\uc774\ub97c": [0, 4, 8, 10, 12, 13, 15, 17, 18, 19, 20, 21, 24, 25, 27, 28, 29], "\uccb4\ud06c\ud574\ubcf4\uae30": 0, "\ud2b9\uc815\ud55c": [0, 2, 6, 17, 19, 20], "photo\uc640": 0, "\uc6a9\ub3c4\ub85c": 0, "prompt\ub4e4\ub85c": 0, "\uc0dd\uc131\ubcf4\uc558\uc2b5\ub2c8\ub2e4": 0, "\ub300\uc0c1": [0, 21], "\uc138\uac00\uc9c0\ub85c\ub294": 0, "cat": [0, 8, 27, 28], "sea": 0, "pirate\uc744": 0, "\uc0ac\uc6a9\ud588\uace0": [0, 3, 9, 15, 21], "\ube44\uc2b7\ud558\uac8c": [0, 19], "\ud14d\uc2a4\ud2b8": [0, 3, 6, 10, 18, 19, 26], "\uc138\uac00\uc9c0\ub294": 0, "illustr": 0, "anim": [0, 22], "wallpaper\ub97c": 0, "\uc774\ubbf8\uc9c0\ub294": [0, 5, 10, 15, 16, 21, 26], "\uae00": 0, "\ub9c8\uc9c0\ub9c9": [0, 8, 9, 17, 18], "\ubd80\ubd84\uc758": 0, "appendix\uc5d0": 0, "\uc788\uc2b5\ub2c8\ub2e4": [0, 1, 4, 6, 10, 17, 18, 21, 24, 25, 27, 28, 29], "\ub300\uc0c1\uc744": [0, 21], "\uc9c0\uce6d\ud558\ub294": 0, "\ud14d\uc2a4\ud2b8\uc758": 0, "\ub300\uc0c1\uc758": [0, 24], "\ud2b9\uc9d5\uc774": 0, "\uc798": [0, 1, 3, 4, 5, 9, 10, 11, 14, 15, 16, 17, 18, 19, 21, 24, 25, 29], "\ub4dc\ub7ec\ub098\ub294": 0, "\uc0dd\uc131\ub418\uc5c8\uc9c0\ub9cc": 0, "\ub300\uc0c1\uacfc": [0, 10, 21], "\ud568\uaed8": [0, 9, 10, 13, 15, 20, 21, 29], "\uc4f0\uc774\ub294": [0, 18, 21, 25], "\uc78e\uc0ac\uadc0\uc758": 0, "\ud2b9\uc9d5\uc744": [0, 4, 24], "\uac00\uc9c0\ub294": [0, 1, 17], "\uc77c\ubd80": [0, 3, 9, 10, 13, 17], "\uc0dd\uc131\ub418\uc5c8\uc2b5\ub2c8\ub2e4": 0, "tuning\ud55c": 0, "400\uc7a5": 0, "\uc0dd\uc131\ud558\uc5ec": 0, "\ud6c8\ub828\ud588\uc2b5\ub2c8\ub2e4": 0, "result_bas": 0, "\ucd94\uac00": [0, 5, 8, 10, 14, 15, 20, 28], "\ud65c\uc6a9\ud55c": [0, 6, 9, 20, 24, 25], "9": [0, 3, 11, 14, 15, 21], "84": 0, "result_now": 0, "kaggle\uc5d0\uc11c": 0, "\uc81c\uacf5\ud558\ub294": [0, 6, 19], "test": [0, 19, 20, 21, 26], "set\uc5d0": [0, 18], "\uc801\uc6a9\ud588\uc744": 0, "\ub54c\ub294": [0, 2, 18], "baseline\uc774": [0, 19], "94": 0, "\uacbd\uc6b0\uac00": [0, 4, 7, 21], "93": 0, "\uc5ec\uc11c": 0, "baseline\ubcf4\ub2e4": 0, "\uc5bb\uc9c0\ub294": 0, "\ubabb": 0, "\ud6c8\ub828": [0, 4, 15, 18, 21, 26], "\uc911\uac04\uc911\uac04\uc5d0": 0, "\uc77c\uc815": 0, "step\ub9c8\ub2e4": 0, "\uc0dd\uc131\ud558\uac8c\ud574\uc11c": 0, "\ud6c8\ub828\uc5d0": [0, 17], "\ubaa8\ub2c8\ud130\ub9c1\uc774": 0, "\uc788\uc73c\uba74": 0, "\uc88b\uaca0\ub2e4\ub294": 0, "\uc0dd\uac01\uc744": 0, "\ud6c8\ub828\uc2dc": 0, "hyperparamet": [0, 7, 10, 16, 23, 28], "tuning\uc744": [0, 4, 10, 13, 18, 19], "\uc880": [0, 4, 6, 16, 26], "\ucca0\uc800\ud558\uac8c": 0, "\ud574\uc57c\uaca0\ub2e4\ub294": 0, "\uc2e4\uc81c\ub85c": [0, 3, 11, 15, 17, 18, 21, 25, 29], "\uc870\uac74\uc744": [0, 10, 19], "\ub9cc\uc871\ud558\ub294\uc9c0": 0, "\uac80\uc218\ud560": 0, "\ubc29\uc548\uc774": 0, "\ud544\uc694\ud569\ub2c8\ub2e4": 0, "\ub0b4\uc5d0\uc11c\ub3c4": 0, "\uce74\ud14c\uace0\ub9ac\ub97c": 0, "\ub098\ub20c": 0, "\uc788\ub2e4\uba74": [0, 6, 8, 20], "\ub098\ub220\uc11c": [0, 26], "\uac01\uac01\uc5d0": [0, 6, 17, 18], "tuning\ud560": [0, 5, 13], "\uc218\ub3c4": [0, 6, 17, 20, 21, 28], "\ud65c\uc6a9\ud574\ubcfc": 0, "submiss": 0, "score\uc5d0\uc11c": [0, 18], "baseline\uc744": 0, "\uc774\uae30\uc9c0": 0, "\ud588\uc9c0\ub9cc": 0, "text": [0, 1, 4, 6, 8, 10, 12, 15, 16, 17, 18, 20, 22, 26, 27, 28], "\uc774\uc6a9\ud55c": [0, 16, 18], "data\uc758": [0, 11, 16], "\uac00\ub2a5\uc131\uc744": [0, 7], "\ubcfc": [0, 1, 6, 10, 15, 17, 18, 19, 21, 22, 23, 27], "\uc788\uc5c8\ub2e4\uace0": [0, 13, 27, 28], "\uc0dd\uac01\ud569\ub2c8\ub2e4": [0, 17], "\uc55e\uc5d0\uc11c": 0, "\uc5b8\uae09\ud55c": [0, 4, 15, 27], "prompt\uc5d0": [0, 5, 9], "\uc608\uc2dc\uc785\ub2c8\ub2e4": [0, 1], "nsfw\ub85c": 0, "\ud310\ub2e8\ub418\uc5b4": 0, "\uac80\uc740\uc0c9\uc73c\ub85c": 0, "\ub098\uc654\uc2b5\ub2c8\ub2e4": [0, 17], "pirat": 0, "wallpap": 0, "sangwoo": [1, 24, 25, 27, 28, 29], "jo": [1, 24, 25, 27, 28, 29], "09": [1, 20], "\uc774\ubc88": [1, 27, 28], "\ud3ec\uc2a4\ud305\uc5d0\uc11c\ub294": [1, 6], "\uc9c1\uc811": [1, 11, 14, 20, 25, 29], "\ud559\uc2b5\ud574\ubcf4\uace0": 1, "\uc2e4\ud5d8\ud55c": [1, 10], "\uacb0\uacfc\ub4e4\uc744": [1, 6, 24, 28], "\uacf5\uc720\ud560\ub824\uace0": 1, "\ud569\ub2c8\ub2e4": [1, 4, 6, 10, 18, 21, 24, 25, 27, 28, 29], "\uc6b0\uc120\uc801\uc73c\ub85c": [1, 22, 28, 29], "\ud559\uc2b5\ub370\uc774\ud130\ub294": 1, "bryandle": 1, "data": [1, 13, 17, 20, 21, 25], "\uacf5\uac1c\ub41c": [1, 13, 27], "yolov5": 1, "\ubaa8\ub378": [1, 2, 3, 5, 6, 7, 9, 10, 11, 13, 14, 15, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28], "\ubc0f": [1, 3, 6, 10, 13, 15, 18, 19, 21, 23, 25, 26, 27, 28], "waifu2x": 1, "\ud6c4\ucc98\ub9ac": [1, 15], "\uae30\ubc95\uc744": [1, 5, 9, 11, 27], "\ud65c\uc6a9\ud558\uc5ec": [1, 9, 10, 15, 22, 23, 24], "\ud504\ub9ac\ub4dc\ub85c\uc6b0\uc5d0": 1, "\ub4f1\uc7a5\ud558\ub294": 1, "\uc778\ubb3c": [1, 21], "\uc0ac\uc9c4\ub4e4\uc744": [1, 24], "\uc218\uc9d1\ud588\uc2b5\ub2c8\ub2e4": 1, "\ub17c\ubb38\uc5d0\uc11c\ub294": [1, 4, 6, 8, 9, 10, 12, 13, 17, 18, 19, 20, 21, 22, 23, 24, 27, 28, 29], "3": [1, 6, 12, 19, 20, 24, 25, 26, 29], "5": [1, 2, 8, 9, 10, 14, 20, 21, 25, 28, 29], "\uc7a5\uc73c\ub85c": 1, "fine": [1, 4, 6, 10, 13, 16, 17, 19, 22, 26, 30], "tune": [1, 6, 10, 13, 22, 26, 30], "\uac00\ub2a5\ud558\ub2e4\uace0": [1, 17], "\uc81c\uc2dc\ub418\uc5b4\uc788\uc9c0\ub9cc": 1, "\uc0ac\uc9c4": [1, 2, 5, 19, 21, 26], "\ub9ce\uc740": [1, 4, 6, 9, 15, 18, 19, 21, 22, 26], "\ud559\uc2b5\ud558\uba74": [1, 8, 24], "\uc131\ub2a5\uc774": [1, 8, 11, 13, 15, 18, 20, 23, 27, 28], "\uc88b\uc544\uc838\uc11c": 1, "15": [1, 3, 10], "20": [1, 3, 9, 11, 24], "\uc7a5\uc758": [1, 6, 20], "\ud559\uc2b5\ud558\uc600\uc2b5\ub2c8\ub2e4": 1, "\ud559\uc2b5\ud55c": [1, 5, 6, 9, 11, 13, 16, 18, 20, 24, 26, 27], "\uc774\ubbf8\uc9c0\ub4e4": [1, 15], "\uc2e4\ud5d8\ud558\uba74\uc11c": 1, "\ub300\ud45c\uc801\uc73c\ub85c": [1, 24, 28, 29], "\uadf8\ub9ac\uace0": [1, 10, 18, 19, 24, 25, 27, 28, 29], "\ub9c8\uc9c0\ub9c9\uc73c\ub85c": [1, 10, 17, 24, 27, 28, 29], "\ubc18\uc601\ud558\ub294": 1, "\uc815\ub3c4\ub97c": [1, 7, 11], "\uc870\uc808\ud558\ub294": [1, 4, 7, 10, 18], "prior_loss_weight": [1, 24], "\ubc14\uafd4\uac00\uba74\uc11c": 1, "\ud559\uc2b5\ud574\ubcf4\uc558\uc2b5\ub2c8\ub2e4": 1, "\uc0ac\uc804\ud559\uc2b5\ub41c": [1, 18, 24], "\ubaa8\ub378\ub85c": [1, 5, 10, 17, 18, 22, 25, 27, 28], "\ucc98\uc74c\uc5d0\ub294": [1, 3, 13, 18], "hakurei": 1, "waifu": 1, "diffus": [1, 2, 4, 6, 7, 10, 13, 15, 20, 25, 27], "\ubaa8\ub378\uc744": [1, 3, 5, 6, 7, 8, 9, 10, 11, 14, 15, 18, 19, 20, 21, 24, 26, 27, 28], "\uc2dc\ub3c4\ud574\ubd24\uc9c0\ub9cc": 1, "\uacb0\uacfc\uac00": [1, 8, 9, 18, 21, 23], "\ub9cc\uc871\uc2a4\ub7fd\uc9c0": 1, "\ubabb\ud574": 1, "runwayml": 1, "stabl": [1, 4, 10, 11, 13, 15, 18, 20, 24, 27], "v1": [1, 10], "\uc791\uc5c5\uc744": [1, 19, 28], "\uc9c4\ud589\ud588\uc2b5\ub2c8\ub2e4": [1, 10, 25, 27, 28], "\uc81c\uc678\ud55c": 1, "\ub3d9\uc77c\ud55c": [1, 3, 10, 11, 15, 18, 20, 21, 24, 27, 28], "configur": [1, 23, 25], "\uc73c\ub85c": [1, 2, 6, 10, 13, 19, 20, 22, 24, 26, 27, 28], "\uacb0\uacfc\uc785\ub2c8\ub2e4": [1, 12, 18, 28], "model_nam": 1, "instance_prompt": 1, "A": [1, 3, 4, 5, 6, 10, 13, 17, 19, 20, 24, 26, 28], "sk": [1, 16, 19], "girl": 1, "class_prompt": 1, "python3": 1, "train_dreambooth": [1, 24], "py": [1, 20, 24], "pretrained_model_name_or_path": [1, 24], "pretrained_vae_name_or_path": 1, "stabilityai": 1, "sd": [1, 15, 20, 28], "vae": [1, 5, 11, 24, 25], "ft": 1, "mse": [1, 8], "output_dir": 1, "revis": [1, 24], "fp16": 1, "with_prior_preserv": [1, 24], "1": [1, 2, 4, 6, 10, 12, 15, 17, 19, 20, 21, 24, 25, 26, 29], "0": [1, 2, 3, 4, 5, 6, 7, 8, 12, 14, 15, 17, 18, 20, 21, 22, 24, 25, 28, 29], "seed": 1, "1337": 1, "resolut": [1, 9, 11, 12, 18, 20, 23, 27], "512": [1, 15, 25], "train_batch_s": 1, "train_text_encod": [1, 24], "mixed_precis": 1, "use_8bit_adam": 1, "gradient_accumulation_step": [1, 24], "gradient_checkpoint": 1, "learning_r": 1, "1e": [1, 16], "6": [1, 3, 5, 14, 15, 16, 20, 21], "lr_schedul": [1, 24], "constant": [1, 11, 23], "lr_warmup_step": 1, "num_class_imag": 1, "200": [1, 15, 26], "sample_batch_s": 1, "4": [1, 6, 12, 17, 20, 21, 25], "max_train_step": 1, "800": 1, "save_interv": 1, "100": [1, 11, 18, 20, 21], "save_sample_prompt": 1, "concepts_list": 1, "json": 1, "w": [1, 2, 4, 5, 8, 12, 13, 17, 22, 26], "o": [1, 16, 27], "\uc544\ub798": [1, 2, 4, 6, 11, 17, 18, 21, 22, 24, 25, 26, 28, 29], "\uadf8\ub9bc\ucc98\ub7fc": [1, 6, 13, 25, 26], "infer": [1, 8, 15, 20, 28, 29], "\uc785\ub825\ud588\uc744": 1, "\ub54c": [1, 2, 3, 4, 5, 6, 7, 8, 10, 11, 12, 14, 15, 16, 17, 18, 19, 20, 21, 23, 24, 25, 29], "\uc81c\uc678\ud568\uc73c\ub85c\uc368": 1, "input": [1, 3, 4, 5, 6, 13, 17, 19, 20, 21, 22, 24, 25, 27, 28], "\uac00\uae4c\uc6b4": [1, 3, 19, 21, 22], "\uc6f9\ud230": 1, "\uc788\uc5c8\uc2b5\ub2c8\ub2e4": [1, 4, 6, 10, 18, 27], "\ub610\ud55c": [1, 3, 4, 5, 8, 9, 10, 12, 13, 15, 18, 21, 24, 27, 28], "\ud551\ud06c\uc0c9": 1, "\uba38\ub9ac\ub97c": 1, "\ud55c": [1, 2, 6, 8, 9, 10, 11, 13, 15, 17, 18, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29], "\uc774\ubbfc\uc9c0": 1, "\uce90\ub9ad\ud130\ub97c": 1, "\uc5b4\ub290": [1, 17, 18, 19], "\uc815\ub3c4": [1, 3, 7, 11, 13, 17, 18], "\uc0dd\uc131\ud558\ub294": [1, 3, 4, 6, 8, 9, 10, 12, 14, 17, 18, 21, 24, 25, 26, 27, 29], "\ubd80\ubd84\ub3c4": [1, 27], "\ud655\uc778\ud560": [1, 11, 14, 15, 18, 24, 27, 28], "pink": 1, "hair": [1, 16, 17], "With": 1, "without": [1, 13, 16, 17], "\ub3c4": [1, 3, 6, 8, 10, 16, 20, 24, 28, 29], "\uce90\ub9ad\ud130\uc758": [1, 24], "\ubd80\uc790\uc5f0\uc2a4\ub7ec\uc6b4": 1, "\ubd80\ubd84\uc774\ub098": 1, "\uc800\ud574\uc0c1\ub3c4": 1, "\uacbd\uc6b0\ub4e4\uc774": 1, "\uc885\uc885": [1, 21], "\ubc1c\uc0dd\ud588\ub294\ub370": 1, "\ud1b5\ud574": [1, 3, 5, 7, 8, 9, 10, 13, 14, 15, 17, 18, 19, 21, 22, 23, 24, 25, 28, 29], "\ud004\ub9ac\ud2f0\uc758": [1, 11, 14, 16, 18], "ugli": 1, "disfigur": 1, "deform": 1, "low": [1, 6, 10, 11, 14, 22, 28], "\ub17c\ubb38\uc5d0\uc11c": [1, 4, 9, 12, 17, 18, 20, 21, 22, 24, 25, 29], "\uc81c\uc2dc\ud55c": [1, 5, 6, 9, 14, 20, 22, 25, 26], "\uc678\uc5d0": 1, "style": [1, 6, 10, 16, 19, 24], "\ub77c\ub294": [1, 2, 4, 6, 10, 18, 19, 21, 23, 26], "\ub85c": [1, 2, 3, 4, 6, 8, 9, 10, 11, 12, 13, 14, 15, 17, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30], "\ud559\uc2b5\uc744": [1, 5, 9, 12, 15, 17, 18, 21, 23, 27], "\uc2dc\ub3c4\ud574\ubcf4\uae30\ub3c4": 1, "\ud2b9\uc815": [1, 5, 6, 7, 8, 9, 10, 11, 13, 15, 16, 18, 19, 26, 28], "\uc5ec\uc790": 1, "\uce90\ub9ad\ud130\uc5d0": 1, "\uc815\ubcf4\ubfd0\ub9cc": 1, "\uc544\ub2c8\ub77c": [1, 5, 6, 8, 9, 13, 15, 17, 21, 24], "\ud504\ub9ac\ub4dc\ub85c\uc6b0": 1, "\uadf8\ub9bc\uccb4": 1, "\uc790\uccb4\ub97c": [1, 6, 11], "\ub2f4\uc544\ub0b4\uae30": 1, "\uc704\ud55c": [1, 3, 6, 10, 11, 12, 13, 15, 18, 19, 21, 23, 29], "\ubaa9\uc801\uc774\uc600\uc2b5\ub2c8\ub2e4": 1, "differ": [1, 6, 13, 17], "\uc2dc": [1, 22, 23, 24, 25, 26, 27, 28], "\ud504\ub9ac\ub4dc\ub85c\uc6b0\uc758": 1, "\uadf8\ub9bc\uccb4\uac00": [1, 6], "\ubc18\uc601\ub41c": [1, 6], "\ub0a8\uc790\uac00": 1, "\uc0dd\uc131\ub418\ub3c4\ub85d": 1, "boi": 1, "\uc785\ub825\ud588\uc744\ub54c\uc758": 1, "\ud639\uc740": [1, 2, 5, 6, 12, 15, 24, 29], "\uc791\uac00\ub2d8\uc758": 1, "\uc7a5\uba74\ub4e4\ub85c": 1, "\uc804\uccb4\uc801\uc73c\ub85c": 1, "\ud559\uc2b5\ud558\uac8c": [1, 27, 28], "\ub41c\ub2e4\uba74": [1, 15], "\ub2e4\uc591\ud55c": [1, 3, 5, 6, 10, 11, 12, 15, 16, 18, 19, 21, 22, 23, 24, 26, 28], "\uac83": [1, 2, 6, 8, 18, 19, 20, 21, 26], "num_inference_step": [1, 28], "24": [1, 18], "step": [1, 5, 6, 7, 8, 9, 11, 16, 18, 20, 23, 24, 25, 28], "\uc744": [1, 2, 3, 5, 6, 7, 8, 9, 10, 11, 12, 13, 15, 17, 19, 20, 21, 22, 23, 24, 25, 26, 28, 29], "\ub298\ub824\uac00\uba74\uc11c": 1, "\ucd94\ub860\ub41c": 1, "\ud004\ub9ac\ud2f0\uac00": [1, 3, 15, 28], "\uc0c1\uc2b9\ud558\ub294": 1, "\uc2e4\ud5d8\ub3c4": 1, "\uc9c4\ud589\ud588\ub294\ub370": 1, "\uc791\uc744\uc218\ub85d": [1, 18], "\uc640": [1, 2, 3, 5, 6, 7, 8, 9, 10, 11, 12, 13, 16, 17, 19, 20, 22, 24, 25, 26, 27, 28, 29], "\ubb34\uad00\ud55c": [1, 27], "random": [1, 2, 5, 7, 8, 10, 13, 14, 15, 19, 24, 25, 27, 28], "\uc0dd\uc131\ud558\uac8c": [1, 18, 24, 26, 28, 29], "\ub429\ub2c8\ub2e4": [1, 4, 6, 10, 17, 18, 21, 24, 25, 27, 28, 29], "\ucd5c\uc885\uc801\uc73c\ub85c": [1, 17, 18, 28], "num_infer": 1, "\uac12\uc740": [1, 11, 18, 23, 24], "\uac01\uac01": [1, 2, 3, 4, 5, 6, 9, 10, 19, 20, 24, 25, 28, 29], "\uacfc": [1, 2, 3, 6, 7, 8, 11, 15, 16, 19, 20, 22, 24, 26, 28], "\uc124\uc815\ud558\uc600\uc2b5\ub2c8\ub2e4": 1, "increas": [1, 6], "number": [1, 23, 28], "guidance_scal": [1, 28], "\uc81c\uc678\ud574\ubcf8": 1, "\uc0dd\uc131\ub41c": [1, 6, 9, 10, 14, 15, 17, 18, 19, 20, 21, 23, 24, 25, 26, 27, 28, 29], "\ub0a8\uc790\uc758": 1, "\uba38\ub9ac\uce74\ub77d\uc774": 1, "\uae38\uc5b4\uc9c0\uace0": 1, "\uc5ec\uc131\uc2a4\ub7ec\uc6b4": 1, "\uc0dd\uae40\uc0c8\ub97c": [1, 19], "\ub180\ub77c\uc6b4": [1, 6, 14, 18], "\uc0ac\uc2e4\ub3c4": 1, "\uadf8": [1, 2, 3, 6, 8, 10, 12, 14, 15, 17, 18, 19, 21, 28], "\uc678": [1, 14, 21], "\ub530\ub978": [1, 4, 6, 11, 18, 20, 22, 24, 27, 29], "\uc7ac\ubbf8\uc788\ub294": 1, "\uc2e4\ud5d8\uacb0\uacfc\ub4e4\uc744": 1, "\uacf5\uc720\ud569\ub2c8\ub2e4": [1, 24, 28], "\uc544\uc9c1": [1, 6, 19, 23], "\uc190\uc758": 1, "\ubaa8\uc591\uc744": 1, "\uc0dd\uc131\ud558\uc9c0": 1, "\ubabb\ud558\ub294": [1, 10, 25], "\uc7ac\ucc28": 1, "climb": 1, "up": [1, 3, 8], "mountain": 1, "paint": [1, 24, 27], "2": [1, 2, 6, 10, 12, 17, 19, 20, 21, 24, 25, 26], "hand": 1, "draw": [1, 16], "\ud558\ub2e8\uc758": 1, "\uc88c\uce21\uacfc": 1, "\uc6b0\uce21": 1, "\uc0ac\uc9c4\uc740": 1, "\uc774\ub77c\ub294": [1, 23, 26], "\ub098\ube44\ub97c": 1, "\uc0dd\uc131\ud558\ub77c\ub294": 1, "\ucd94\ub860\ud574\ubcf8": 1, "\uc218\uc2dd\ud558\ub294": 1, "\uba85\uc0ac\uac00": 1, "\uc774\ub3c4\ub85d": 1, "\uc218\uc815\ud568\uc73c\ub85c\uc368": [1, 11], "butterfli": 1, "\uc0ac\uc9c4\uc744": [1, 2, 21, 23], "\uc0dd\uc131\ud560\ub54c": 1, "\uc870\uae08\uc774\ub098\ub9c8": 1, "\uc6f9\ud230\uc758": 1, "\uadf8\ub9bc\uccb4\ub97c": 1, "\ubc18\uc601\ud560": 1, "\uc788\uc5c8\ub358": 1, "scale": [3, 5, 6, 9, 13, 17, 23, 26, 28], "autoregress": 3, "multi": [3, 5, 6, 19, 26, 28], "modal": [3, 6, 19, 26], "refer": [2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 14, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29], "paper": [2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30], "http": [2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29], "scontent": 3, "gmp1": 3, "xx": 3, "fbcdn": 3, "net": [3, 8, 12, 20, 24, 26], "v": [2, 3, 5, 6, 8, 10, 12, 13, 16, 19, 22, 24, 25, 28], "t39": 3, "2365": 3, "358725877_789390529544546_1176484804732743296_n": 3, "pdf": [3, 6, 10, 14, 19, 20, 22, 27], "_nc_cat": 3, "108": 3, "ccb": 3, "_nc_sid": 3, "3c67a6": 3, "_nc_ohc": 3, "plfu_ur_vyaax_nagu8": 3, "_nc_ht": 3, "oh": 3, "00_afdrhahxv1pcf0lqicjiynmorpvcgeq0emv5_ve2_tncvg": 3, "oe": 3, "652ff632": 3, "code": [2, 3, 4, 5, 8, 12, 13, 17, 19, 20, 21, 22, 23, 24, 25, 28, 29, 30], "x": [2, 3, 4, 6, 7, 9, 10, 12, 13, 17, 20, 21, 22, 23, 24, 25, 26, 28, 29], "jun": 3, "hyoung": 3, "lee": 3, "oct": [3, 9, 10, 14, 18, 28], "\ubcf5\uc7a1\ud558\uac8c": 3, "\uad6c\uc131\ub41c": [3, 28, 29], "\uac1d\uccb4": [3, 6, 21, 27], "\uc190": 3, "\uc0dd\uc131\ud55c\ub2e4": [3, 15], "\ud14d\uc2a4\ud2b8\uc640": [3, 6, 9, 10, 19], "\ub458": [3, 9], "\ub2e4": 3, "\ub2a5\ub825\uc744": [3, 4, 6, 10, 18, 19, 20], "\uac00\uc9c4": [3, 9, 11, 15, 16, 18, 19, 21, 22, 24, 28, 29], "\uac80\uc0c9": 3, "\uc99d\uac15": 3, "\ud1a0\ud070": 3, "\uae30\ubc18": [3, 10, 14, 15, 18, 19, 22], "\ub514\ucf54\ub354": 3, "\uc804\uc6a9": [3, 26], "\uba40\ud2f0": 3, "\ubaa8\ub2ec": 3, "\ubaa8\ub378\uc774\ub2e4": [3, 9, 15], "cm3": 3, "\uc544\ud0a4\ud14d\ucc98\ub97c": [3, 15], "\uc0ac\uc6a9\ud558\uba70": 3, "\uad6c\uc870\uc801": 3, "\uc2a4\ud0c0\uc77c": [3, 21], "\ub370\uc774\ud130\uc5d0": [3, 15, 17, 18], "tun": 3, "\ud560": [3, 4, 6, 8, 10, 15, 16, 17, 18, 19, 20, 22, 23, 24, 26, 28], "\uc788\ub294": [3, 4, 6, 9, 10, 11, 15, 17, 18, 19, 20, 21, 23, 24, 27, 28], "\uac00\uc84c\ub2e4": 3, "\ubaa8\ub378\uc5d0": [3, 5, 6, 8, 9, 11, 15, 16, 18, 19, 20, 24, 27, 28], "\ub9de\ub3c4\ub85d": [3, 15], "\ud559\uc2b5\ud588\ub2e4": [3, 15], "larg": [3, 5, 18, 20, 28], "scale\uc758": 3, "\ub2e8\uacc4\ub97c": [3, 10, 15, 25], "\ud3ec\ud568\ud55c\ub2e4": [3, 15], "\ub370\uc774\ud130\ub294": 3, "\ub77c\uc774\uc13c\uc2a4\uac00": 3, "shutterstock\uc758": 3, "scale\ub85c": 3, "\ud559\uc2b5\ud55c\ub2e4": [3, 8, 15, 20], "sft": 3, "\ub2e8\uacc4\ub85c": 3, "\uc9c4\ud589\ud588\ub2e4": 3, "\uc785\ub825\uacfc": [3, 21], "\ucd9c\ub825": [3, 21], "\ubaa8\ub450": [3, 5, 6, 7, 10, 11, 14, 15, 16, 17, 18, 19, 21, 23, 24, 27, 28], "\uc774\ubbf8\uc9c0\uc640": [3, 6, 7, 9, 10, 16, 19, 24, 27, 29], "\ud1a0\ud070\uc744": [3, 19], "\uc11e\uc744": 3, "\uc788\ub2e4": [2, 3, 7, 9, 11, 13, 14, 15, 16, 19, 20, 21, 23, 26], "\ud504\ub86c\ud504\ud2b8\uc5d0": 3, "\ub9de\ub294": [3, 24], "\uc774\ubbf8\uc9c0\ub9cc": [3, 23], "\uc0dd\uc131\ud558\ub294\ub370": [3, 28], "cm3leon\uc740": 3, "\uace0\ud574\uc0c1\ub3c4": [3, 15, 18, 21], "output\uc744": [3, 4, 9, 15], "self": [3, 4, 7, 8, 10, 13, 17, 25, 26, 28, 29], "contain": 3, "\uc18c\uac1c\ud55c\ub2e4": [3, 9, 15], "iamg": 3, "\ubd80\ud130": [2, 3, 9, 22], "control": [3, 16, 17, 28], "segmentation\uae4c\uc9c0": 3, "\uac00\ub2a5\ud558\ub2e4": [3, 7, 8, 14, 16, 23], "3\uc5b5": 3, "\uac1c\uc758": [3, 10, 13, 15, 17, 19, 20, 21, 22, 24], "\ud1a0\ud070\uc73c\ub85c": [3, 22], "\ud559\uc2b5\ud588\ub294\ub370": 3, "generation\ub3c4": 3, "\uc218\ud589\ud55c\ub2e4": [3, 20], "\ud559\uc2b5": [3, 4, 5, 6, 7, 8, 9, 11, 12, 13, 14, 15, 16, 18, 20, 21, 22, 23, 24, 25, 26, 28], "\uc5f0\uc0b0\uc744": 3, "5\ubc30\ub85c": 3, "\uc904\uc600\ub2e4": 3, "zero": [3, 6, 9, 13, 18, 20, 22, 26], "shot": [3, 6, 9, 16, 18, 20, 22, 26], "coco\ub85c": [3, 26], "fid\ub97c": [3, 7], "\uce21\uc815\ud55c": 3, "88": 3, "\uc810\uc73c\ub85c": 3, "google\uc758": 3, "parti": 3, "\ubaa8\ub378\uc758": [3, 4, 5, 6, 9, 10, 11, 13, 15, 16, 17, 18, 19, 23, 28], "\uc131\ub2a5\uacfc": [3, 14], "\uc218\uc900\uc744": 3, "\ub2ec\uc131\ud588\ub2e4": 3, "ra": 3, "cm3\ub97c": 3, "\uae30\ubc18\uc73c\ub85c": [3, 6, 10, 12, 14, 16, 21, 24, 29], "t2i": [3, 6, 10, 15], "\ub3c4\uba54\uc778\uc5d0\uc11c": 3, "\uc7a0\uc7ac\ub825\uc744": 3, "\uc5f0\uad6c\ud588\ub2e4": 3, "gafni\uc758": 3, "tokenizer\ub97c": [3, 19], "\uc0ac\uc6a9\ud588\ub2e4": [3, 15, 21], "tokenizer\ub294": 3, "256x256": [3, 9, 15, 18, 27], "8192\uac1c\uc758": 3, "vocabulary\uc5d0\uc11c": 3, "1024\uac1c\uc758": 3, "\uc778\ucf54\ub529\uc744": 3, "\uc9c4\ud589\ud55c\ub2e4": [3, 20], "\ud14d\uc2a4\ud2b8\uc5d0\uc11c\ub294": 3, "zhang\uc758": 3, "\ucee4\uc2a4\ud140": 3, "56320": 3, "vocabulari": 3, "size": [3, 13, 15, 17, 21, 25, 26, 27, 28, 29], "\uc0c8\ub85c\uc6b4": [2, 3, 5, 10, 14, 15, 17, 18, 19, 23, 24, 25, 26, 27, 29], "\uc2a4\ud398\uc15c\ud55c": 3, "\ud1a0\ud070\uc778": 3, "break": 3, "figure_8_9": 3, "modality\uac04": 3, "transition\uc744": 3, "\ud558\uac8c": [3, 6, 15, 20, 23, 25, 27, 28, 29], "\ud55c\ub2e4": [2, 3, 9, 12, 19, 20, 23, 26], "\ubaa9\uc801": 3, "\uc785\ub825": [3, 18, 19, 21, 24, 27, 29], "sequence\uc5d0": 3, "\ub9de\ucdb0": [3, 9, 22], "\uad00\ub828\uc131\uc774": 3, "\ub192\uace0": 3, "\ubb38\uc11c": 3, "from": [3, 6, 8, 17, 25], "memori": [3, 13, 22, 26], "bank": 3, "\uac80\uc0c9\ud558\ub294": 3, "\uac83\uc774\ub2e4": [3, 9, 14, 20], "dens": [3, 13], "strategy\uc744": 3, "\ud3ec\ud568\ud558\uace0": [3, 20, 21], "\ucffc\ub9ac": 3, "q": [3, 9, 12, 22, 23], "\uc608": 3, "sequenc": [3, 6, 13], "mathcal": [3, 4, 8, 9, 12, 13], "m": [3, 6, 7, 8, 12, 16], "\ub85c\ubd80\ud130": [3, 4, 6, 9, 21, 24, 25, 28, 29], "\ud6c4\ubcf4": 3, "\uac00\uc9c0\uace0": [2, 3, 6, 9, 10, 15, 17, 21, 22, 25, 28, 29], "\uad00\ub828\uc131": 3, "\uc810\uc218": [3, 26], "r": [3, 8, 11, 12, 13, 16, 27], "return": [3, 4, 5, 7, 8, 13, 17, 20, 25, 28, 29], "\ud574\uc900\ub2e4": [3, 23], "retriv": 3, "\ubc29\ubc95\uc740": [3, 9, 10, 13, 19, 21, 24], "clip": [3, 5, 6, 10, 15, 19, 20, 22, 24, 26, 27, 28], "\uae30\ubc18\uc778": 3, "bi": 3, "encod": [3, 6, 9, 12, 22, 24, 25, 26, 27, 28, 29], "\uad6c\uc870\ub97c": [3, 4, 12, 17, 21, 22, 24, 26, 28], "\ub530\ub790\ub2e4": 3, "karpukhin": 3, "\ubb38\uc11c\ub97c": 3, "\ud30c\ud2b8\ub85c": 3, "\ubd84\ub9ac\ud558\uace0": 3, "\uc778\ucf54\ub354": 3, "vit": [3, 9, 15, 20, 24], "b": [2, 3, 4, 8, 9, 10, 13, 17, 20, 22, 26], "32": [3, 7, 8, 13, 17, 18, 20, 22, 23, 28], "\ubb38\uc11c\uc758": 3, "vector": [3, 6, 12, 17, 19, 22, 24], "representation\ub85c\uc368": 3, "\ub450": [2, 3, 4, 6, 8, 10, 11, 15, 17, 18, 20, 21, 24], "\uac1c\ub97c": 3, "\ud3c9\uade0\uc744": 3, "\ub0b8\ub2e4": [3, 19], "\ucd5c\uc885": [3, 8, 15, 20, 28], "\uac80\uc0c9\uc740": 3, "\uc810\uc218\uc5d0": [3, 26], "\uc815\ub82c\ub41c": 3, "\ubaa9\ub85d\uc744": 3, "\uc5bb\uae30": 3, "maximum": 3, "inner": [3, 13], "product": [3, 6], "search\ub85c": 3, "generator\ub97c": [3, 17, 25], "\uc720\uc6a9\ud55c": 3, "\ucd94\ucd9c\ud558\uae30": 3, "\uc138": [3, 6, 15, 17, 21, 24], "\uac00\uc9c0": [3, 4, 6, 10, 15, 17, 19, 21, 24], "\uc694\uc18c\ub97c": [3, 10, 18], "\uace0\ub824\ud588\ub2e4": 3, "relev": [3, 7], "\uac80\uc0c9\ub41c": 3, "\ubb38\uc11c\ub294": 3, "\uad00\ub828\uc788\uc5b4\uc57c": 3, "\uc810\uc218\ub97c": [3, 6, 9, 22, 26], "\uc0ac\uc6a9\ud55c\ub2e4": [3, 9, 15], "\ud14d\uc2a4\ud2b8\ub85c": [3, 4], "\ubb38\uc11c\ub85c": 3, "\ub610\ub294": [3, 9, 10, 14, 19, 21], "divers": [3, 6, 10, 11, 23, 24], "\ub2e4\uc591\uc131\uc740": 3, "\ubb38\uc11c\uc5d0\uc11c": 3, "\uc911\ubcf5\uc131\uc744": 3, "\ud53c\ud558\uae30": 3, "\ud544\uc218\uc801\uc778": 3, "\uc808\ucc28\ub2e4": 3, "\ub2e8\uc21c\ud558\uac8c": 3, "\uae30\ubc18\ud574": [3, 13], "top": [3, 6, 15, 20], "\ubb38\uc11c\ub9cc": 3, "\uac00\uc838\uc628\ub2e4\uba74": 3, "\uc911\ubcf5\uc774": 3, "\ubc1c\uc0dd\ud560": 3, "downstream": [3, 13], "\uc548\uc88b\uc740": 3, "\ub07c\uce60": 3, "\uc810\uc218\uac00": [3, 22, 26], "\uc774\ud558\ub85c": 3, "queri": [3, 5, 12, 13, 19], "dropout": 3, "\uac80\uc0c9\uc5d0": 3, "\uc0ac\uc6a9\ub41c": [3, 21], "\ucffc\ub9ac\uc758": 3, "\uc0ad\uc81c": [3, 8], "\uc801\uc6a9\ud588\ub2e4": 3, "\ub2e4\uc591\uc131\uacfc": [3, 18], "\uc815\uaddc\ud654\ub97c": [3, 17], "\uc2dc\ucf30\ub2e4": [3, 15], "\ud14d\uc2a4\ud2b8\ub97c": [3, 6, 17, 18, 19], "\uac80\uc0c9\ud55c\ub2e4": 3, "\ud559\uc2b5\uc5d0\uc11c\ub294": 3, "\ub370\uc774\ud130\uc14b\uc758": [3, 6, 9, 18, 20, 27], "\ubaa8\ub4e0": [3, 5, 7, 10, 13, 14, 15, 16, 17, 19, 20, 21, 23], "\ucea1\uc158": [3, 9], "\uc30d\uc5d0": 3, "\ub300\ud574": [3, 4, 5, 8, 9, 10, 11, 15, 18, 19, 20, 21, 24, 25, 27, 28, 29], "\uc0d8\ud50c": [3, 6, 12, 17, 18], "3\uac1c\ub97c": 3, "\ubb34\uc791\uc704\ub85c": [3, 19], "\uc120\ud0dd\ud55c\ub2e4": 3, "\uc0ac\uc2e4\uc0c1": 3, "\uc0ac\uc804": [3, 6, 19, 24], "\ud559\uc2b5\uc5d0\uc11c": 3, "\uc0ac\uc6a9\ud560": [3, 4, 9, 13, 18], "\uc218\uc758": [3, 20], "4\ubc30\uc774\ub2e4": 3, "chameleon": 3, "\ubcc0\ud615\uc2dc\ucf1c": 3, "mask": [3, 22, 27, 28], "infil": 3, "\ud45c\ud604\ud55c\ub2e4": 3, "\ucd94\uac00\ub418\uc5c8\uace0": 3, "\ub2e8\uc5b4\uc758": 3, "\uc7ac\ubc30\uce58\uac00": 3, "\uc9c4\ud589\ub410\ub2e4": 3, "\ud559\uc2b5\uc5d0\ub294": 3, "\ub2e4\uc74c": [2, 3, 17, 19, 21, 26, 28], "\uc608\uce21\ud558\ub294": [3, 8, 9, 10], "\ub2e4\uc6a9\ub3c4": 3, "\uac00\uc838\uc654\ub2e4": [3, 15], "generation\uc5d0\uc11c\ub294": 3, "cm3\uac00": 3, "\ud504\ub86c\ud504\ud2b8\ub85c": [3, 18], "\uc0dd\uc131\ud558\uace0": [3, 19, 22, 24], "cm3\ub294": 3, "\ud504\ub86c\ud504\ud2b8\ub97c": [3, 10, 15, 18], "\ud65c\uc6a9\ud55c\ub2e4": 3, "\ub514\ucf54\ub354\ub9cc": 3, "\uc0ac\uc6a9\ud558\ub294": [3, 4, 6, 8, 17, 21, 23, 24, 26, 27], "transform": [3, 8, 9, 11, 15, 18, 20], "\uc544\ud0a4\ud14d\uccd0\ub97c": [3, 6], "zhang\uc5d0": 3, "bia": [3, 6, 8], "term": [3, 11, 21, 29], "layer": [3, 4, 7, 8, 13, 17, 20, 21, 25, 26, 28], "norm\uc758": 3, "\uac00\ub2a5\ud55c": [3, 7, 8, 12, 16, 20, 21, 28], "\ud30c\ub77c\ubbf8\ud130\ub97c": [3, 10, 13, 15, 18], "\uc81c\uac70\ud588\ub2e4": [3, 15], "length\ub97c": 3, "2048": 3, "4096\uae4c\uc9c0": 3, "\ud655\uc7a5\ud588\ub2e4": 3, "weight": [3, 5, 7, 10, 13, 18, 27, 28], "\ucd08\uae30\ud654": 3, "\ud3c9\uade0": [2, 3, 8, 24, 29], "\ud45c\uc900": [3, 21], "\ud3b8\ucc28": 3, "006": 3, "\uc778": [2, 3, 6, 12, 20], "truncat": 3, "3\uc73c\ub85c": [3, 26], "\uc798\ub9b0": [3, 15], "normal": [3, 7, 8, 17, 21, 25, 26], "distribut": [3, 6, 7, 8, 20, 22, 23, 28, 29], "output": [3, 4, 6, 8, 13, 21, 22, 24, 27], "0\uc73c\ub85c": [3, 4, 18, 21, 23, 27], "0\uc5d0": 3, "0002\ub85c": [3, 21], "posit": [3, 8, 9, 16], "embed": [3, 4, 5, 6, 8, 9, 13, 16, 22, 23, 24, 28], "\ucd08\uae30\ud654\ud55c\ub2e4": 3, "metaseq": 3, "\ud559\uc2b5\ub410\ub2e4": 3, "\uc0ac\uc774\uc988": 3, "350m": 3, "760m": 3, "7b": 3, "4t": 3, "trillion": 3, "9t": 3, "\uc8fc\uc694\ud55c": [3, 10], "\ud558\uc774\ud37c": 3, "\ud30c\ub77c\ubbf8\ud130\ub294": [3, 28], "learn": [3, 6, 13, 16, 17, 18, 19, 21, 26, 28], "rate": [3, 5, 7, 16, 28], "batch": [3, 5, 13, 21, 24, 25, 28], "size\ub85c": 3, "\uba40\ud2f0\ubaa8\ub2ec": 3, "\ub9de\uac8c": [3, 6, 9, 13, 15, 18], "\uc124\uc815\ud588\ub2e4": [3, 21], "\ucc38\uace0": [2, 3, 6, 18, 20], "perplex": 3, "ppl": [3, 24], "\uc5b8\uc5b4": [3, 20], "\ud3c9\uac00": [3, 5, 14, 21, 24], "\ubc29\ubc95": [3, 6, 20, 23, 26, 30], "\ud558\ub098\uc774\ub2e4": 3, "\ud5f7\uac08\ub9ac\ub294": 3, "\uac12\uc774": [3, 5, 7, 11, 16, 21, 22, 23, 26], "\ub0ae\uc744": [3, 4], "\uc218\ub85d": 3, "\uc88b\ub2e4": [3, 6, 13, 23], "\ubaa8\ub378\uc5d0\uc11c": [3, 15, 18, 19, 20, 23, 26], "\uc54c\uace0\ub9ac\uc998\uc5d0": 3, "\uc0c1\ub2f9\ud55c": 3, "\uc5f0\uad6c\uac00": [3, 9, 10], "\uc9c4\ud589\ub418\uc5b4": 3, "\uc654\ub2e4": [3, 11], "dall": [3, 6, 9, 10, 24, 26, 27], "e\ub294": [3, 22], "\uc544\uc6c3\ud48b\uc758": 3, "\ud5a5\uc0c1\ub418\ub294": [3, 18], "e": [2, 3, 5, 6, 7, 8, 11, 12, 14, 16, 17, 24, 25, 26, 28, 29], "\ub294": [2, 3, 4, 6, 8, 9, 10, 11, 12, 13, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 27, 28, 29], "\uc0d8\ud50c\ub9c1\uacfc": 3, "512\uac1c": [3, 22], "re": 3, "rank": 3, "\uc804\ub7b5\uc744": [3, 10], "\ucc44\ud0dd\ud588\ub2e4": 3, "make": [3, 13], "scene": [3, 6, 27], "\uae30\ubc18\uc758": [3, 10, 20, 27], "guidance\ub85c": 3, "ranking\uc5d0": 3, "\uc624\uc9c1": [3, 21], "16": [3, 8, 12, 13, 15, 17, 20, 22, 23, 24, 28], "\uc0d8\ud50c\ub9cc": 3, "\ud544\uc694\ud558\uac8c": 3, "\ub428\uc73c\ub85c\uc368": [3, 27], "\ud6c4\ubcf4\uc758": 3, "\uc218\ub97c": [3, 15, 22], "\ud655\ub960\uc801": 3, "\uae30\uc220\ub85c": [3, 19], "\uc0ac\uc6a9\ub41c\ub2e4": [3, 8, 15], "\uc0d8\ud50c\ub9c1\uc5d0\uc11c": 3, "softmax\uc758": 3, "temperature\ub97c": 3, "\uc218\uc815\ud574": [3, 6], "\uc608\uce21": [3, 6, 7, 8, 10, 22], "\ubb34\uc791\uc704\uc131\uc744": 3, "\uc81c\uc5b4\ud55c\ub2e4": 3, "nucleu": 3, "\uc0d8\ud50c\ub9c1\uc73c\ub85c\ub3c4": 3, "\ubd88\ub9ac\uace0": 3, "\ubbf8\ub9ac": [3, 19], "\uc815\uc758\ud55c": [3, 11], "\uc784\uacc4\uac12\uc744": [3, 26], "\ucd08\uacfc\ud558\ub294": 3, "\ub204\uc801": 3, "\ud655\ub960\uc744": [3, 6, 25], "\uac00\uc7a5": [2, 3, 5, 6, 9, 10, 17, 18, 19, 20, 21, 24], "\uc791\uc740": [2, 3, 5, 8, 10, 13, 15, 18, 19, 20, 21], "\uc0c1\uc704": 3, "\uc138\ud2b8\uc5d0\uc11c": 3, "\uc0d8\ud50c\ub9c1\uc744": [3, 18, 25], "begin": [3, 8, 28], "align": [3, 5, 8, 18, 19, 26, 27, 28], "operatornam": 3, "logit": 3, "_": [3, 4, 10, 12, 13, 17, 20, 24, 25, 28, 29], "cond": [3, 15], "t": [2, 3, 4, 5, 6, 7, 9, 11, 12, 20, 22, 23, 24, 28], "left": [3, 8, 10, 12, 13, 15, 23, 26, 29], "t_y": 3, "mid": [3, 8, 13, 14], "t_x": 3, "right": [3, 8, 10, 12, 13, 23, 26, 29], "uncond": 3, "bf": [3, 23], "mathrm": [3, 8], "cf": 3, "alpha_c": 3, "cdot": [3, 12, 25, 28], "end": [3, 8, 17], "cfg\ub294": 3, "uncondit": [3, 8, 9, 23], "\uc0d8\ud50c\uc744": [3, 9, 10, 19, 21, 29], "condit": [2, 3, 5, 6, 8, 11, 14, 18, 20, 23, 24, 27, 28], "\uc0d8\ud50c\uc5d0": [3, 15, 18], "\ud558\ub294": [2, 3, 4, 8, 9, 10, 11, 14, 15, 16, 17, 18, 19, 21, 23, 24, 26, 28, 29], "\uac83\uc744": [3, 4, 6, 8, 10, 11, 14, 15, 17, 18, 19, 21, 22, 23, 24, 26, 27, 28], "\uc758\ubbf8\ud55c\ub2e4": [3, 14], "text\ub97c": [3, 9, 22], "\ubaa9\ud45c\uc758": 3, "\ub9c8\uc2a4\ud06c": [3, 9], "\ub300\uccb4\ud55c\ub2e4": 3, "\ubaa9\ud45c\ub97c": 3, "\ud559\uc2b5\uc758": 3, "\ud575\uc2ec": [3, 10, 14, 18, 20, 23], "\uc774\uc810": 3, "\ud558\ub098\uc774\uba70": 3, "finetun": [3, 5], "\uc5c6\uc774": [3, 5, 7, 8, 9, 10, 15, 16, 18, 19, 20, 21, 22, 23, 26, 28], "\uc5c6\ub294": [3, 15, 16, 21, 23, 24, 26], "guidance\ub97c": [3, 9, 23, 26], "\uc218\ud589\ud560": [3, 6, 20], "\ucd94\ub860\uc5d0\uc11c\ub294": 3, "stream\uc744": 3, "\ud14d\uc2a4\ud2b8\uc5d0": [3, 6], "\ub2ec\ub77c\uc9c0\ub294": [3, 24], "stream\uacfc": 3, "\ud1a0\ud070\uc5d0": 3, "condition\ub41c": 3, "stream": 3, "cfg\uc5d0\uc11c": 3, "logit\uc758": 3, "\ube84\uc148": 3, "\uc5f0\uc0b0\uc774": [3, 12], "\ud14d\uc2a4\ud2b8\uc5d0\uc11c": [3, 10], "\ubc29\ubc95\uc758": [3, 19], "log": [3, 8, 13, 18, 22, 23, 25, 29], "probability\ub97c": 3, "\ube84\uc148\ud558\ub294": 3, "\uc5f0\uc0b0\uacfc": 3, "\ube44\uc2b7\ud558\ub2e4": [3, 15], "ms": [3, 9, 22, 26], "coco": [3, 9, 22, 26, 28], "30k": 3, "fid": [3, 6, 8, 9, 11, 12, 15, 17, 22, 23, 26, 28], "\uce21\uc815\ud588\ub2e4": [3, 20], "onli": [3, 7, 15, 16, 22], "\ud6a8\uc728\uc131\uc774": 3, "\ucd94\ub860\uc5d0\uc11c": 3, "1\uac1c": [3, 22], "2\uac1c\ub85c": 3, "\uc608\uc81c\ub85c": 3, "\ub3d9\uc791\ud560": [3, 18], "\uc6b0\uc218\ud55c": [3, 7, 10, 15, 21, 22], "\uae30\ub85d\ud588\ub2e4": [3, 16], "\uace0\ud488\uc9c8": [3, 6, 10, 15], "\ud655\uc7a5\uc2dc\ud0a4\ub294": 3, "\uac80\uc0c9\uc758": 3, "\uc911\uc694\uc131\uc744": [3, 10, 21], "\ubcf4\uc5ec\uc900\ub2e4": [3, 7, 9, 15, 16, 19, 26], "figure5": 3, "llm\uc5d0\uc11c": 3, "\uc911\uc694\ud55c": [3, 6, 7, 11, 17, 19, 21], "\ub2e8\uacc4\uc774\ub2e4": 3, "\uba85\ub839\uc5b4": 3, "\uc774\ud574\ud558\ub294": 3, "\ub3c4\uc640\uc8fc\uba70": 3, "task\uc5d0\uc11c\ub3c4": 3, "\uc5bb\uc5c8\ub2e4": [3, 15], "\ud29c\ub2dd\uc774": 3, "task\uc5d0": [3, 4, 12, 13, 19, 21], "\ub208\uc5d0": 3, "\ub744\uac8c": 3, "\uc99d\ud3ed\uc2dc\ud0a4\ub294": 3, "\ubc1c\uacac\ud588\ub2e4": 3, "cm3leon\uc744": 3, "task\ub97c": [3, 13, 14, 19, 22], "\uc11e\uc5b4": 3, "\ub113\uc740": 3, "\ubc94\uc704\uc5d0\uc11c": 3, "\ud588\ub2e4": [3, 15, 20, 21], "\uacfc\uc815\uc740": 3, "\ub530\ub974\uba70": 3, "instruction\uacfc": 3, "\ucd9c\ub825\uc744": 3, "\uacb0\ud569\ud574": 3, "objective\ub97c": [3, 13, 19], "figure6": 3, "\uae30\ubc18\ud55c": [3, 7], "initi": [3, 13], "image\ub97c": [3, 5, 8, 9, 10, 12, 14, 18, 19, 21, 22], "\uc218\uc815\ud558\ub294": [3, 19], "task\uc774\ub2e4": 3, "instructpix2pix": 3, "\ud558\ub298\uc758": 3, "\uc0c9\uc744": 3, "\ud30c\ub780\uc0c9\uc73c\ub85c": 3, "\ubcc0\uacbd\ud574\uc918": 3, "\ud3b8\uc9d1\uc774": 3, "\uc774\uac83\uc740": [3, 6, 10, 18], "cm3leon\uc774": 3, "\ub3d9\uc2dc\uc5d0": [3, 5, 15, 16], "\uc774\ud574\ud558\uace0": 3, "\uc788\uc5b4\uc11c": 3, "feature\uacfc": [3, 15], "\uc0dd\uc0b0\ud558\ub294": 3, "controlnet": [3, 28], "\uc0dd\uc131\uc5d0": [3, 9, 10, 18, 19, 28], "\uacf5\uac04\uc801": 3, "\uc815\ubcf4": [3, 16], "\uc704\uce58": 3, "\ud1b5\ud569\uc2dc\ud0ac": [3, 6], "\uc788\ub3c4\ub85d": [3, 10, 15, 19, 21, 27], "figure16": 3, "flamingo": 3, "1000\uc5b5": 3, "openflamingo": 3, "400\uc5b5": 3, "30\uc5b5": 3, "\uc740": [2, 3, 5, 6, 8, 15, 17, 19, 20, 21, 23, 24, 25, 26, 28, 29], "\ud1a0\ud070\uc784\uc5d0\ub3c4": 3, "\ubd88\uad6c\ud558\uace0": [3, 6, 9, 18, 21], "\ub3d9\ub4f1\ud55c": 3, "ad": [4, 28], "arxiv": [2, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29], "org": [2, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29], "ab": [2, 4, 5, 7, 8, 9, 11, 12, 13, 14, 16, 17, 18, 19, 21, 22, 23, 24, 25, 26, 28, 29], "2302": [4, 28], "05543": 4, "lllyasviel": 4, "mai": [4, 12, 19, 23, 24], "28": [4, 29], "\uae30\uc874\uc758": [4, 6, 14, 18, 28], "\ubaa8\ub378\ub4e4\uc740": [4, 5, 6, 23], "prompt\ub85c": [4, 16, 19], "\uc870\uc808\ud560": [4, 17, 18], "\ud558\uc9c0\ub9cc": [4, 5, 6, 7, 9, 10, 11, 13, 14, 16, 18, 19, 20, 21, 23, 24, 25, 29], "\uc774\ub7f0": [4, 6, 17, 18], "control\ub9cc\uc73c\ub85c": 4, "\uc870\uc808\ud558\ub294\ub370": 4, "\ud55c\uacc4\uac00": [4, 16, 19, 21, 26, 28], "condition\uc744": [4, 5, 15], "\ucd94\uac00\uc801\uc73c\ub85c": [4, 6, 9, 15, 18], "\uc918\uc11c": 4, "\uc0dd\uc131\ub418\ub294": [4, 14, 16, 17, 18, 23, 28], "controlnet\uc774\ub77c\ub294": 4, "\uc2e0\uacbd\ub9dd": [4, 10], "\uc81c\uc548\ud569\ub2c8\ub2e4": [4, 6, 10], "\uadf8\ub9bc\uc740": [4, 6, 12, 15, 17, 18], "high": [4, 6, 10, 11, 12, 14, 15, 17, 18, 22, 25, 26, 28], "qualiti": [4, 7, 16, 22, 23, 25, 26, 29], "detail": [4, 6, 10, 16, 20], "profession": 4, "prompt\uc640": [4, 5, 9], "\uc67c\ucabd": [4, 10, 15, 18], "\uc544\ub798\uc758": [4, 6, 12, 15, 26], "canni": 4, "edge\ub97c": 4, "input\uc73c\ub85c": [4, 12, 15, 17], "\ubc1b\uc544\uc11c": [4, 6, 10, 17, 29], "\uc624\ub978\ucabd\uc758": 4, "\uc2dd\uc73c\ub85c": 4, "\ucd94\uac00\uc801\uc778": [4, 5, 8, 9, 12, 13, 14, 17, 19, 21, 28], "\uadf8\ub9bc\uc5d0\uc11c\ub294": 4, "edg": [4, 21, 28], "\ubc1b\uc544": [4, 8, 18, 23], "\uac83\uc774": [2, 4, 6, 8, 9, 10, 13, 17, 18, 19, 21, 23, 24, 26, 27, 28, 29], "controlnet\uc774": 4, "\uc5ed\ud560\uc785\ub2c8\ub2e4": 4, "gener": [4, 6, 9, 10, 11, 16, 17, 19, 20, 22, 23, 24, 25, 26, 28], "conrolnet": 4, "\uadf8\ub7ec\uba74": [4, 17], "\uc5b4\ub5a4": [4, 6, 8, 9, 13, 15, 18, 19, 20, 21, 29], "\uac00\ub2a5\ud558\uac8c": [4, 13, 17], "\ud588\uc744\uae4c\uc694": [4, 6], "\uc774\uc81c\ubd80\ud130": 4, "\uc54c\uc544\ubcf4\ub3c4\ub85d": [4, 17], "\ud558\uaca0\uc2b5\ub2c8\ub2e4": [4, 10, 17, 29], "controlnet\uc758": 4, "\uad6c\uc870\ub294": [4, 17, 18, 28], "\ub2e4\uc74c\uacfc": [2, 4, 6, 9, 10, 12, 17, 19, 21, 24, 25, 28, 29], "\uac00\uc9d1\ub2c8\ub2e4": [4, 12], "pretrain": [4, 5, 9, 13, 14, 15, 16, 18, 19, 20, 22], "lock": 4, "copy\uc640": 4, "trainabl": [4, 7, 10, 11, 13], "copy\ub97c": 4, "\uc0ac\uc6a9": [4, 5, 8, 9, 11, 16, 21, 22, 23, 26], "\uc65c": [2, 4, 6, 11, 20], "\uc774\ub807\uac8c": [4, 10, 18, 19, 20, 25], "\uc124\uacc4\ud588\ub294\uc9c0": 4, "\uc54c\uc544\ubd05\uc2dc\ub2e4": 4, "\uc6b0\uc120": [4, 6, 9, 15, 24], "\uc774\uc720\ub294": [4, 8, 15], "\uae30\uc874\uc5d0": [4, 5, 12, 13, 14, 17, 18], "\ubc29\ub300\ud55c": 4, "\uc591\uc758": 4, "\ud559\uc2b5\uc2dc\ud0a8": [4, 9, 20], "\uc720\uc9c0\ud558\uae30": 4, "\uc704\ud574\uc11c\uc785\ub2c8\ub2e4": 4, "\ub370\uc774\ud130\uac00": [4, 8, 15, 17, 18, 19, 21, 25, 29], "\uc591\uc774": [4, 18], "\uacbd\uc6b0\uc5d0": [4, 18, 21, 28], "\uc624\ubc84\ud53c\ud305\uc744": 4, "\ud53c\ud560": 4, "\ud6a8\uacfc\ub3c4": 4, "convolution\uc774\ub780": 4, "weight\ub791": 4, "bias\uac00": [4, 17], "\ucd08\uae30\ud654\ud55c": 4, "1x1": 4, "convolution\uc744": 4, "\ub9d0\ud569\ub2c8\ub2e4": [4, 21], "\ud6c8\ub828\uc774": 4, "\uc2dc\uc791\ub418\uae30": 4, "\uc804\uc5d0\ub294": 4, "input\uc5d0": [4, 22], "model\uacfc": [4, 9, 11, 12], "output\uc774": [4, 21], "\ub611\uac19\uc544\uc9d1\ub2c8\ub2e4": 4, "\ubaa8\ub378\uc774\ub791": 4, "\ub611\uac19\uc740": 4, "\uac00\uc9c0\uac8c\ub418\ubbc0\ub85c": 4, "\uc720\uc9c0\ud560": [4, 10, 14, 19], "\uc788\uc73c\uba70": [4, 10, 18, 21, 22, 26], "\uac83\uacfc": [4, 9, 19, 21, 27], "\ube44\uc2b7\ud558\ubbc0\ub85c": 4, "scratch\ubd80\ud130": 4, "\ud559\uc2b5\ud558\ub294": [4, 8, 11, 14, 17, 21, 25], "\uac83\uc5d0": [4, 19, 21], "\ube60\ub974\uac8c": [4, 10, 11, 23, 26], "\ud6c8\ub828\uc2dc\ud0ac": 4, "\uc788\uac8c\ub429\ub2c8\ub2e4": 4, "convolution\uc740": 4, "\uc5b4\ub5bb\uac8c": [4, 6, 8, 18, 19], "\ud558\ub294\uc9c0": 4, "\uc790\uc138\ud788": [4, 6, 10, 17, 24], "\uba3c\uc800": [4, 10, 17, 18, 20, 22, 25], "\uc704\uc758": [2, 4, 6, 7, 9, 11, 18, 20], "\uadf8\ub9bc\uc5d0\uc11c": [4, 6, 17, 18, 21, 24], "\ud574\ub2f9\ud558\ub294": [4, 9, 27], "\ubd80\ubd84\uc744": [4, 9, 15, 18, 24, 27, 28], "\uc218\uc2dd\uc73c\ub85c": 4, "\ud45c\ud604\ud558\uaca0\uc2b5\ub2c8\ub2e4": 4, "mathbf": [4, 17], "y": [2, 4, 6, 7, 8, 13, 17, 21, 22, 23, 28], "f": [4, 7, 8, 9, 12, 13, 20, 21, 24, 25], "theta": [4, 8, 9, 10, 12, 13, 22, 23, 24, 25, 28, 29], "featur": [4, 5, 6, 8, 15, 16, 17, 20, 24, 28], "map": [4, 6, 8, 16, 21, 28, 29], "neural": [4, 7, 11, 21], "network": [4, 7, 8, 10, 14, 21, 25], "paramet": [4, 7, 8, 10, 13, 16, 22, 24, 26, 28], "\uc758\ubbf8\ud569\ub2c8\ub2e4": [4, 10, 18], "\uc704": [2, 4, 6, 8, 10, 11, 13, 16, 18, 19, 20, 21], "\uadf8\ub9bc\uc758": [4, 6], "\ud45c\ud604\ud558\uae30\uc704\ud574": 4, "\ub9cc\ub4e4\uc5b4\uc11c": [4, 6, 19], "parameter\ub97c": [4, 5, 8, 18, 22, 23], "theta_": 4, "c": [4, 5, 8, 9, 10, 12, 16, 17, 20, 24, 28], "\ub77c\uace0\ud558\uace0": 4, "\uace0\uc815\uc2dc\ucf1c\ub450\uaca0\uc2b5\ub2c8\ub2e4": 4, "z": [2, 4, 8, 12, 13, 16, 22, 23, 25, 29], "\ud45c\ud604\ud558\uace0": 4, "convolution\uc758": 4, "z1": 4, "z2": 4, "\ub450\uaca0\uc2b5\ub2c8\ub2e4": 4, "\ud45c\ud604\ud560": [2, 4, 10], "\uadf8\ub7f0\ub370": [4, 17], "weight\uc640": [4, 18], "bias\uc758": 4, "\ucd08\uae43\uac12\uc774": 4, "0\uc774\ubbc0\ub85c": 4, "\uc9c4\ud589\ub418\uc9c0": 4, "\uc54a\uc558\uc744": [4, 18], "\uc785\ub2c8\ub2e4": [4, 6, 10, 18, 21, 25, 29], "\uc2dc\uc791": 4, "controlnet\uacfc": 4, "\ub0b4\ubbc0\ub85c": 4, "\ubcf4\uc874\ud560": 4, "\uc804\ubd80": 4, "\ucd08\uae30\ud654\ub418\uc5b4\uc788\uc73c\uba74": 4, "gradient\uac00": [4, 21], "0\uc774\ub77c\uc11c": 4, "\uc548": [4, 6, 21], "\ub418\ub294\uac70": 4, "\uc544\ub2d0\uae4c\uc694": 4, "\ud655\uc778\ud558\uae30": [4, 9], "\uac04\ub2e8\ud55c": [2, 4, 15, 19], "\uacbd\uc6b0\ub97c": [4, 10], "\uc0dd\uac01\ud574\ubcf4\uc8e0": 4, "wx": 4, "gradient\ub294": 4, "frac": [4, 8, 12, 17, 23, 25, 29], "partial": [4, 5, 8], "0\uc774\uace0": [4, 18], "neq0": 4, "\uc774\ub77c\uace0": [2, 4, 6, 19], "\ud558\uba74": [4, 6, 18, 20, 21], "\uccab": [4, 15, 17, 21], "\ubc88\uc9f8": [4, 15, 17, 18], "gradient": [4, 7, 8, 11, 13, 23, 25], "step\uc5d0\uc11c": [4, 8, 11], "weight\ub294": [4, 13], "0\uc774": [4, 7, 8], "\uac12\uc73c\ub85c": [4, 9, 11, 13, 29], "\uac00\uac8c\ub418\uace0": 4, "\ub418\ubbc0\ub85c": 4, "\uc5ec\uae30\uc11c": [2, 4, 8, 10, 12, 17, 18, 19, 21, 29], "\ud575\uc2ec\uc801\uc778": [4, 9], "\uac00\uc815\uc774": 4, "\uc778\ub370": 4, "\ubd80\ubd84\uc740": [4, 9, 10, 12, 17, 18, 28], "\ud6c8\ub828\ub41c": [4, 10, 18, 19], "\uc0ac\uc6a9\ud558\uace0": [4, 5, 10, 13, 16, 18, 19, 24, 27], "\uc788\uae30": [4, 21], "\ub54c\ubb38\uc5d0": [4, 6, 8, 10, 13, 17, 18, 21, 24, 28, 29], "\uc704\ubc30\ub420": 4, "\uac00\ub2a5\uc131\uc774": [4, 11], "\uc9c0\uae08\uae4c\uc9c0": [4, 8], "\uc598\uae30\ud55c": 4, "diffusion\uc5d0": 4, "\uc801\uc6a9\ud55c": [4, 9, 11, 14, 15, 28], "\uadf8\ub9bc\uacfc": [4, 17, 21, 22, 25, 26, 29], "overal": [4, 6, 26], "structur": [4, 12, 13, 17, 28], "loss\ub294": [4, 8], "diffusion\uc5d0\uc11c": 4, "\ucd94\uac00\ub41c": [4, 8], "\ud615\ud0dc\uc785\ub2c8\ub2e4": [4, 17], "loss": [4, 7, 10, 11, 12, 14, 16, 20, 22, 23, 24, 25, 29], "training\uc744": [4, 9], "50": [4, 8, 18, 19, 20, 21], "\ud655\ub960\ub85c": 4, "empti": [4, 9], "string\uc73c\ub85c": 4, "\ubc14\uafd4\uc8fc\uc5c8\ub2e4\uace0": 4, "prompt\uac00": [4, 5], "\uc8fc\uc5b4\uc9c0\uc9c0\uc54a\uc744": 4, "semantics\ub97c": 4, "\ubc30\uc6b0\ub294": 4, "\uacbd\ud5a5\uc774": [4, 5, 6, 16], "\uc0dd\uc131\uc744": [4, 10, 19, 26], "\ud5a5\uc0c1\uc2dc\ucf1c\uc904": 4, "\uc788\ub2e4\uace0": [4, 6, 8, 9, 15, 18, 19, 22, 28, 29], "\uacb0\uacfc\ub294": [4, 9, 11, 18, 21], "training\uc774": 4, "\ubc29\ubc95\ubcf4\ub2e4": 4, "\ud6a8\uc728\uc801\uc774\ub77c\ub294": 4, "\ubcf4\uc5ec\uc90d\ub2c8\ub2e4": [4, 21, 24, 28], "effici": [4, 6, 7, 13, 26], "\uacb0\uacfc\ub4e4\uc740": 4, "\uacb0\uacfc\ub4e4\uc785\ub2c8\ub2e4": 4, "\ub17c\ubb38\uc5d0": [4, 8, 18, 29], "\uc788\uc73c\ub2c8": [2, 4], "\ucc38\uace0\ud558\uc2dc\uae30": 4, "\ubc14\ub78d\ub2c8\ub2e4": 4, "pose": [4, 16, 17, 24, 28], "limitation\uc774\ub77c\uace0": 4, "\uc774\ubbf8\uc9c0\uc785\ub2c8\ub2e4": [4, 6], "\uc8fc\uc5c8\uc74c\uc5d0\ub3c4": 4, "\uc6d0\ud558\ub294": [4, 5, 6, 14, 16, 17, 18, 21], "\uc0dd\uc131\ub418\uc9c0": 4, "\uc54a\ub294": [4, 6, 10, 18, 21, 24], "\ubc1c\uc0dd\ud588\uc2b5\ub2c8\ub2e4": 4, "limit": 4, "\ucf54\ub4dc\ub294": 4, "\uacf5\uc2dd": 4, "\uad6c\ud604": [4, 20, 25, 29], "\uac00\uc838\uc654\uc2b5\ub2c8\ub2e4": 4, "\ucd08\uae30\ud654\ud558\ub294": 4, "\ucf54\ub4dc\ub85c": [4, 15], "\ub9cc\ub4e4": [2, 4, 6, 19, 22], "\uc0ac\uc6a9\ub429\ub2c8\ub2e4": [4, 10], "def": [4, 7, 8, 13, 17, 20, 25, 28, 29], "zero_modul": 4, "modul": [4, 8, 13, 17, 25, 28, 29], "out": [4, 8, 20, 28, 29], "p": [4, 6, 11, 23, 24, 29], "detach": [4, 20, 25], "zero_": 4, "\uae30\ubcf8\uc801\uc73c\ub85c": [4, 13, 16, 26, 28], "nn": [4, 8, 13, 17, 25, 28, 29], "sequential\uacfc": 4, "\uac19\uc740\ub370": 4, "time": [2, 4, 5, 7, 8, 9, 11, 12, 13, 15, 16, 20, 22, 23, 26, 28], "step\uac19\uc740": 4, "input\uc744": 4, "\ubc1b\uc544\uc904": 4, "\uc788\uac8c": [4, 6, 15, 18, 26], "\ub9cc\ub4e0": [4, 17, 18, 21], "timestepembedsequenti": 4, "sequenti": [4, 8, 17, 25, 28], "timestepblock": 4, "pass": [4, 6], "timestep": [4, 5, 6, 20, 24, 28], "children": 4, "support": 4, "an": [4, 6, 17, 19, 29], "extra": [4, 6], "forward": [4, 9, 11, 13, 14, 17, 21, 24, 25, 28, 29], "emb": [4, 8], "context": [4, 6, 8, 13, 15, 16, 22, 24], "none": [2, 4, 8, 17, 20, 28], "isinst": 4, "elif": [4, 8, 20, 24], "spatialtransform": 4, "els": [4, 7, 8, 13, 17, 20, 24, 28], "github\uc758": 4, "cldm": 4, "py\uc5d0": 4, "class\uc785\ub2c8\ub2e4": 4, "init": [4, 13], "\uae38\uc5b4\uc11c": 4, "\uc0dd\ub7b5\ud588\uc2b5\ub2c8\ub2e4": 4, "__init__": [4, 7, 8, 17, 25, 28, 29], "make_zero_conv": 4, "channel": [4, 8, 17, 23, 27, 28], "conv_nd": 4, "dim": [4, 8, 17, 20, 24, 28], "pad": [4, 8, 28], "hint": [4, 5], "kwarg": 4, "t_emb": 4, "timestep_embed": 4, "model_channel": 4, "repeat_onli": 4, "fals": [4, 7, 8, 13, 20, 25, 28], "time_emb": 4, "guided_hint": 4, "input_hint_block": 4, "h": [4, 8, 12, 13, 20, 22, 23, 28], "type": [4, 24, 25], "dtype": [4, 8, 20, 24], "zero_conv": 4, "zip": [4, 7, 8], "input_block": 4, "append": [4, 8, 17, 25, 28], "middle_block": 4, "middle_block_out": 4, "customizi": 5, "To": [5, 6, 7], "cvpr": [2, 5, 11, 12, 17, 24, 27], "2212": [5, 27], "04488": 5, "offici": [5, 7, 22, 23], "seunghwan": [5, 7, 11, 14, 16], "ji": [5, 7, 11, 14, 16], "aug": [5, 11, 16], "\ub6f0\uc5b4\ub09c": [5, 7, 11, 18, 22], "\ubcf4\uc774\ub294": [5, 7, 11, 14, 21], "\ucd94\uc138": 5, "user\uc758": 5, "private\ud55c": 5, "concept\uc744": [5, 19], "\uc0dd\uc131\ud558\uace0\uc790\ud558\ub294": 5, "\uc695\uad6c\ub294": 5, "\ud480\uc9c0": 5, "\ubabb\ud568": 5, "diffusion\uc740": 5, "partial\ud55c": 5, "\ubd80\ubd84\ub9cc\uc744": 5, "\ud559\uc2b5\uc2dc\ud0b4\uc73c\ub85c\uc368": 5, "\uae30\uc874\ubcf4\ub2e4": 5, "\ube60\ub978": [5, 10], "\ubc29\uc2dd\uc744": [5, 9, 14, 28], "\uc81c\uc548": [5, 7, 11, 18, 19, 21, 22], "\ubfd0": 5, "concept\uc5d0": [5, 19], "\ud559\uc2b5\uc774": [5, 8, 16, 21, 28], "\uac00\ub2a5": [2, 5, 8, 9, 23, 28], "\ud558\ub098\uc758": [5, 8, 13, 16, 17, 19, 21, 29], "compress\ud558\ub294": 5, "\ucd5c\uadfc": [5, 11, 14, 16, 21], "\ubaa8\ub378\ub4e4\uc774": [5, 6, 17, 18, 24, 28], "\ud65c\ubc1c\ud558\uac8c": 5, "\uc5f0\uad6c": [5, 11, 14], "\ub418\uc5b4\uc9d0": 5, "\uc785\ub825\ub9cc\uc73c\ub85c": 5, "\uc0dd\uc131\ud574\ub0b4\ub294": [5, 6, 11, 14], "\uc218\uc900\uae4c\uc9c0": [5, 11], "\uc774\ub984": 5, "\uc774\ub7ec\ud55c": [5, 6, 10, 11, 15, 16, 18, 19, 21, 24, 29], "general\ud55c": 5, "\uc0dd\uc131\ud558\uc9c0\ub9cc": [5, 10, 25], "user\uac00": 5, "specif": [5, 17, 24], "concept\uc758": [5, 19], "g": [5, 7, 9, 11, 16, 17, 21, 24, 25, 28], "\ud589\ubcf5\ud55c": 5, "\uc6b0\ub9ac": [5, 19, 21], "\uac00\uc871": 5, "\uc6b0\ub9ac\uc9d1": 5, "\uac15\uc544\uc9c0": 5, "\ubf40\uc090\uac00": 5, "\ud30c\ub9ac\ub85c": 5, "\uc5ec\ud589\uc744": 5, "\ub5a0\ub098\ub294": 5, "\ub4f1": [5, 7, 9, 10, 13, 16, 19, 24], "\uacfc\uc815\uc911\uc5d0": 5, "\ub370\uc774\ud130\ub97c": [5, 6, 8, 9, 11, 15, 16, 18, 21, 25, 29], "\ubcf4\uc9c0": [5, 21, 24], "\ubabb\ud588\uae30\ub54c\ubb38\uc5d0": 5, "model\uc5d0\uac8c\ub294": 5, "\ub2f9\uc5f0\ud55c": 5, "\uba87\uc7a5\uc758": 5, "\ud3ec\ud568\ud558\ub294": [5, 19, 21, 27], "\uc774\ubbf8\uc9c0\ub9cc\uc73c\ub85c": [5, 16], "finetuning\ud558\ub294": [5, 10], "\ubc29\uc2dd": 5, "In": [5, 20], "person": [5, 6, 10, 19], "\ubaa9\ud45c": [5, 6, 19, 21], "\ud559\uc2b5\ud558\uace0\uc790\ud558\ub294": 5, "\uc0dd\uc131\ud574\ub0b4\uc57c\ud568": 5, "\ud559\uc2b5\ub418\uc5c8\ub358": 5, "finetuning\ud55c": 5, "\ud6c4\uc5d0\ub3c4": [5, 10], "customization\uc774": 5, "\uc5b4\ub824\uc6b4": [5, 19], "\uc774\uc720": [5, 11, 21], "\uc9c4\ud589\ud558\ub2e4\ubcf4\uba74": 5, "\ud559\uc2b5\ud588\ub358": 5, "\uc78a\uc5b4\ubc84\ub9ac\uac70\ub098": 5, "\uc65c\uace1\ud574\ubc84\ub9bc": 5, "draft": 5, "overfit": [5, 24], "\ub418\uc5b4\uc11c": 5, "\uacb0\uacfc\ubb3c\uc758": [5, 15], "variation\uc774": [5, 17], "\ub0ae\uc544\uc9d0": 5, "\uc880\ub354": [5, 7, 11, 16], "\ub098\uc544\uac00": 5, "\uc5b4\ub824\uc6c0": 5, "text\ub85c": 5, "\uacfc\uc815": [5, 8, 11, 19], "\uc131\ub2a5": [5, 6, 13, 18, 20, 21, 22, 23, 24, 25, 28], "\uc720\uc9c0\ub97c": 5, "real": [5, 14, 16, 25], "image\uc640": [5, 9, 21, 22], "caption\uc744": 5, "regular": [5, 8, 20, 29], "data\ub85c": 5, "tuning\ub3d9\uc548": 5, "augment": [5, 16, 18, 20], "\uc18c\uac1c": [2, 5, 6], "gan": [5, 7, 9, 14, 16, 21, 22, 28], "\ubc29\uc2dd\uc758": [5, 7, 14, 15, 23], "model\ub4e4\uc774": [5, 7], "\ubcf4\uc5ec\uc8fc\uace0\uc788\uc74c": 5, "\uac8c\ub2e4\uac00": [5, 6, 9, 15], "control\ub3c4": 5, "\uac00\ub2a5\ud568": [5, 13, 19, 22, 26], "general\ud558\uc9c0": 5, "\uc54a\uc740": [5, 6, 10, 11, 15, 17, 18, 20, 21], "\uc0dd\uc131\uc740": 5, "\ubd88\uac00\ub2a5\ud568": 5, "new": [5, 7, 10, 17, 19], "global\ud55c": 5, "distribution\uc744": [5, 16, 17, 22], "\uc774\ubbf8": [5, 21, 26], "\ud3ec\ud568\ud55c": [5, 14], "\uc18c\ub7c9\uc758": [5, 8], "\uae30\ubc95": [5, 13, 20], "learning\uc740": 5, "\uc0dd\uac01\ubcf4\ub2e4": 5, "\ud6a8\uacfc\uc801\uc774\uace0": 5, "\uc720\uc6a9\ud568": 5, "\ub300\ubd80\ubd84": [5, 11, 16, 18], "\uc2dc\uc5d0\ub294": [5, 8, 11, 22, 28], "\uc804\uccb4\ub97c": [5, 27], "\ud559\uc2b5\ud558\uac70\ub098": 5, "\ucd94\uac00\ud574": [5, 8, 14, 15], "\uc7ac\ud559\uc2b5": [5, 7, 23], "\uc704\uc5d0\uc11c": [5, 9, 10, 14, 15], "customization\uc758": 5, "\ubb38\uc81c\ub97c": [5, 13, 15, 19, 26], "\uc77c\uc73c\ud0a4\uae30": 5, "\uc26c\uc6c0": 5, "etc": [5, 14], "\uc544\uc8fc": 5, "\uc77c\ubd80\ub9cc\uc744": 5, "\ub300\uc0c1\uc73c\ub85c": [5, 21], "\ucee8\uc149\uc73c\ub85c": 5, "finetuning\uc744": [5, 10], "\ud1b5\ud55c": [5, 14, 19, 23], "\uc5f0\uad6c\ub4e4\uc774": [5, 11, 16], "\uc788\uc74c": [2, 5, 8, 11, 13, 18, 19, 21, 22, 23, 26], "textual": [5, 10, 24], "invers": [5, 6, 14, 24], "vs": [5, 7, 9, 15, 21, 22, 23, 26, 27, 28], "\ubaa8\ub378\ub4e4\uc744": [5, 27], "compress\ud560": 5, "finetuning\ud568\uc73c\ub85c\uc368": 5, "resourse\ub97c": 5, "\uc808\uc57d\ud560": 5, "backbone\uc73c\ub85c": 5, "latent": [5, 6, 10, 14, 15, 16, 17, 20, 24, 27, 28, 29], "\ucc44\ud0dd": [5, 23], "l": [5, 9, 10, 12, 20, 28, 29], "dm\uc758": 5, "equat": [5, 6, 7, 9, 11, 14, 16, 23], "x_": [5, 7, 8, 9, 16, 23, 24], "\uc2dc\uc810\uc5d0": [2, 5], "noise\uac00": [5, 8, 9, 11, 17], "\uc11e\uc778": 5, "text\ub098": 5, "\ubc14\ub85c": [5, 6, 8, 15, 17, 20], "\uc0ac\uc6a9\ud558\uc9c0\uc54a\uace0": 5, "space\ub85c": [5, 14, 19], "embedding\ub41c": 5, "\uac12\uc744": [2, 5, 7, 11, 14, 16, 22, 23, 26], "us": [5, 6, 7, 19, 21, 22, 26, 30], "\u03b5": [5, 7], "nois": [5, 7, 8, 10, 11, 14, 15, 17, 18, 20, 23, 24, 25, 28, 29], "\u03b5_": 5, "\u03b8": 5, "\ub080": 5, "\u03b5\ub97c": 5, "\uc608\uce21\ud574\ub0b4\ub294": [5, 8], "\uc989": [2, 5, 6, 8, 9, 14, 19, 20, 21, 22, 23, 26, 28], "ldm": [5, 10, 12, 15, 16], "tuning\ud560\ub54c\ub294": 5, "layer\uc5d0\ub300\ud574": 5, "update\ud558\ub294\uac8c": 5, "\uae30\ubcf8": [5, 15, 18], "\ubc29\uc2dd\uc740": [5, 10, 13, 19, 28], "resource\uac00": 5, "\ube44\ud6a8\uc728\uc801\uc73c\ub85c": 5, "\ub9ce\uc774\ub4e4\uace0": 5, "\uc774\ubbf8\uc9c0\uc5d0": [5, 7, 11, 14, 16, 18, 19, 20, 21, 22], "overfitting\ub418\uae30": 5, "\ubcc0\ud654\ub7c9\uc744": 5, "\uccb4\ud06c": 5, "delta": [2, 5, 10, 13], "while": 5, "\ubd80\ubd84\uc5d0\ube44\ud574": 5, "cross": [5, 10, 15, 16, 24, 26, 28], "attent": [5, 8, 9, 10, 12, 13, 15, 16, 22, 23, 26, 28], "\uc5f0\uc0b0\uc758": [5, 12], "wegith": 5, "\ubcc0\ud654\ub7c9\uc774": [2, 5], "\ud07c": 5, "fig": [5, 7, 11], "latent\uc5d0": 5, "\uc8fc\uc785\ud558\ub294": 5, "mechan": 5, "kei": [5, 12, 13, 14, 16], "valu": [5, 7, 13], "parameter\uc5d0": 5, "\ub2e8": [5, 10, 14, 16, 24], "\ucc28\uc9c0": 5, "\uc758\ubbf8\ud558\ub294": [5, 18, 29], "\ud3ec\ud568\ub418\ub294": 5, "k": [5, 8, 12, 13, 14, 22, 25, 28], "\ub9cc": [5, 6, 8, 13, 20, 24], "\ub098\uba38\uc9c0\ub294": [5, 16, 21], "freez": [5, 10, 13, 24, 26], "\uc2e4\uc81c\ub85c\ub294": [5, 7], "\uc4f0\uc9c0\uc54a\ub294": 5, "\ub2e8\uc5b4\ub85c": 5, "\ud615\uc2dd\uc73c\ub85c": 5, "captioning\ud55c": 5, "\ud6c4\uc5d0": [5, 23], "\ub610": [5, 11, 14, 15, 16, 20, 24, 27], "finetuning\uc911\uc5d0": 5, "\uc78a\uc5b4\ubc84\ub9ac\ub294": 5, "\ud604\uc0c1\uc774": [5, 17, 27], "\uc788\uc744\uc218\uc788\uc74c": 5, "moon": 5, "\uc0dd\uc131\ud558\uba74": [5, 21], "finetuning\ud588\ub358": 5, "moongat": 5, "\uc0dd\uc131\ud574\ubc84\ub9bc": 5, "\ubc29\uc9c0\ud558\uae30\uc704\ud574": 5, "world\uc758": 5, "image\uc5d0\uc11c": [5, 21, 22], "target": [5, 6, 8, 16, 19, 21, 24], "\uc720\uc0ac\ud55c": [5, 10, 12, 19, 21, 22, 24, 29], "200\uc7a5\uc758": [5, 16], "regul": 5, "\uc720\uc0ac\ud558\ub2e4": 5, "clip\uc5d0\uc11c": [5, 18], "\ucd94\ucd9c\ud55c": [5, 20], "space\uc0c1\uc758": 5, "vector\uac00": 5, "similar\ud558\ub2e4": 5, "joint": [5, 9, 22, 23], "trane": [5, 21], "\uac01\uac01\uc758": [2, 5, 16], "\uac16\ub294": [2, 5, 6, 9], "rare\ud55c": 5, "key\ub97c": 5, "\ubd80\uc5ec\ud574": [5, 28], "i": [2, 5, 8, 9, 12, 13, 14, 15, 17, 22, 24, 25, 26, 28, 29], "constrain": 5, "optim": [5, 7, 10, 16, 19, 22, 24, 29], "merg": [5, 13], "concept\uc73c\ub85c": 5, "\ud559\uc2b5\ub41c": [5, 18, 19, 20, 21, 23, 26], "weight\ub97c": [5, 13, 18], "w_0": [2, 5, 13], "appendix": 5, "a\uc5d0\ub294": 5, "\ub77c\uace0": [2, 5, 6, 10, 13, 19, 20, 26, 29], "\ub098\uc640\uc788\ub294\ub370": 5, "\uc624\ud0c8\uc790\uc77c": 5, "\uac00\ub2a5\uc131": 5, "c_": [5, 15, 16, 24], "reg": 5, "caption\uc758": 5, "\ubf51\uc544": [5, 23], "concat": [5, 6, 15], "\uacf1\ud55c": 5, "\uac12\uacfc\uc758": 5, "norm\uc744": [5, 21], "\uacc4\uc0b0\ud588\uc744\ub54c": 5, "n\uac1c\uc758": [5, 22], "attention\uc774": 5, "\ub3d9\uc791\ud558\ub294": [5, 12], "\ucc3e\uc544": [5, 19], "\ud558\ub098\ub9cc": 5, "\uc0ac\uc6a9\ud558\uc790": 5, "250": [5, 20], "two": [5, 6, 17, 19, 22, 24, 28], "500": [5, 18], "8": [5, 8, 11, 12, 13, 14, 17, 22, 25, 28], "10": [2, 5, 8, 10, 11, 14, 15, 18, 20, 23, 25, 28], "resiz": 5, "veri": 5, "small": [5, 19, 27, 28], "far": 5, "awai": 5, "zoom": 5, "techniqu": [5, 8, 14, 26], "qualit": [5, 10, 22, 28], "evalu": [5, 6, 7, 9, 27], "quant": [5, 19], "kid": [5, 14], "\uc5bc\ub9c8\ub098": [5, 6, 8, 14, 18, 19, 23], "\ub300\uc751\ub418\ub294": 5, "\uc0dd\uc131\ud574\ub0c8\ub294\uac00": 5, "image\uc758": [5, 9, 11, 12, 14, 16, 19], "\ud45c\ud604\ud574\ub0c8\ub294\uac00": 5, "tabl": [5, 10, 11, 13, 15, 16, 18, 20, 21, 23, 28], "\uc815\uc131\uc801": [5, 22], "\uc815\ub7c9\uc801": [5, 10, 22], "human": [5, 6, 9, 17, 19, 27], "prefer": [5, 26], "studi": [5, 10, 16], "baselin": [5, 22], "customdiffus": [5, 10], "all": [5, 6, 13], "\uc120\ud638": 5, "inversion\uc740": [5, 19], "alignment\ub294": 5, "\uc120\ud638\ub3c4\uc640": 5, "\ube44\uc2b7\ud558\uc9c0\ub9cc": [5, 18, 21], "alignment\uc218\uce58\ub97c": 5, "diffusion\uc774": 5, "\ub9e4\uc6b0": [2, 5, 6, 7, 8, 13, 19, 20, 21, 26], "\ub192\uc544": 5, "overfitting\ub41c": [5, 16], "ablat": [5, 10, 16, 23], "\u314cgen": 5, "\ub300\uc2e0": [5, 8, 10, 11, 15, 16, 19, 21, 22], "generate\ub41c": 5, "\uc218\uce58\ub294": [5, 11, 23], "regulat": 5, "world": 5, "customizing\uc774": 5, "\uac00\ub2a5\ud558\uace0": 5, "resourse\uac00": 5, "Of": 5, "category\uc758": 5, "object\uc5d0": 5, "\ub300\ud574\uc11c\ub294": [5, 9, 11, 18], "\ub3d9\uc791\ud558\uc9c0": [5, 11], "\uc54a\uc74c": [5, 8, 19, 26], "hierarch": 6, "2022": [6, 9, 12, 21, 26], "2204": 6, "06125v1": 6, "seonhoon": [2, 6, 20], "sep": [6, 26, 27], "18": [6, 24], "2022\ub144\uc5d0": 6, "\uacf5\uac1c\ub418\uc5b4": 6, "\uc138\uc0c1\uc744": 6, "\ub180\ub77c\uac8c": 6, "\ub2a5\ub825\ub3c4": 6, "\ub6f0\uc5b4\ub0ac\uace0": 6, "\uc0ac\uc6a9\uc790": 6, "\uc785\ub9db\uc5d0": 6, "\uc870\uc791\ud560": 6, "\ub418\uc5c8\uc8e0": 6, "\uc774\ub984\uc740": 6, "\uc77c\uae4c\uc694": 6, "\ucd08\ud604\uc2e4\uc8fc\uc758": 6, "\ud654\uac00": 6, "salvador": 6, "dali": 6, "wall": 6, "\ud569\uc131\uc5b4\uc785\ub2c8\ub2e4": 6, "\uc0dd\uc131\ud574\ub0b8": 6, "\uacb0\uacfc\ubb3c\uc774": [6, 15, 22], "\uacfc\uc5f0": 6, "\uc5b4\ub5bb\uae38\ub798": 6, "\uacb0\uacfc\ubb3c": [6, 22], "\uc0dd\uc804": 6, "\ubaa8\uc2b5": 6, "vibrant": 6, "portrait": [6, 16], "robot": 6, "half": [6, 20], "face": [6, 10, 17, 25], "\uc2e4\uc81c": [6, 10, 13, 14, 18, 19, 21, 24, 25], "\ubaa8\uc2b5\uc774": [6, 21], "\ubcf4\uc774\ub124\uc694": 6, "\ucd08\ud604\uc2e4\uc8fc\uc758\uc801": 6, "\uac19\uae30\ub3c4": 6, "corgi": 6, "\uc5b4\ub5a4\uac00\uc694": 6, "s": [6, 7, 16, 17, 19, 24, 25, 26, 27, 30], "head": [6, 8, 16, 23], "depict": 6, "explos": 6, "nebula": 6, "\ubaa8\uc2b5\uc744": [6, 19], "\uc131\uc6b4\uc758": 6, "\ud3ed\ubc1c\ub85c": 6, "\ubb18\uc0ac\ud574\ub2ec\ub77c\uace0": 6, "\ud588\uc744": [6, 20, 24], "\uadf8\ub9bc\uc785\ub2c8\ub2e4": [6, 29], "nasa": 6, "\ucd2c\uc601\ud55c": 6, "\ucd08\uc2e0\uc131": 6, "\ud3ed\ubc1c\uc758": 6, "\uc794\ud574\uc785\ub2c8\ub2e4": 6, "\uc815\ub9d0": 6, "\uadf8\ub7f4\ub4ef\ud558\uc9c0": 6, "\uc54a\ub098\uc694": 6, "thi": [6, 7, 8, 13, 19, 24, 30], "mosaic": 6, "one": [2, 6, 16, 21], "largest": 6, "ever": 6, "taken": 6, "hubbl": 6, "space": [6, 10, 15, 19, 22, 24, 28, 29], "telescop": 6, "crab": 6, "six": 6, "light": 6, "year": 6, "wide": 6, "expand": [6, 23, 28], "remnant": 6, "star": 6, "supernova": 6, "\uc8fc\uc758\uc0ac\ud56d": 6, "\ubcf8": [2, 6, 8, 10, 13, 18, 20, 22, 23, 26], "\ub0b4\uc6a9\uc744": [6, 10], "\ube44\uc120\ud615\uc801\uc73c\ub85c": 6, "\uc0b4\ud3b4\ubd05\ub2c8\ub2e4": 6, "\ub9c8\uce58": 6, "\uc624\ud508\uc6d4\ub4dc": 6, "\uac8c\uc784\ucc98\ub7fc": 6, "\ub9d0\uc774\uc8e0": 6, "\ud575\uc2ec\uc774": 6, "\ub418\ub294": [6, 22, 23, 24, 27, 28], "\uc9c8\ubb38\ub4e4\uc744": 6, "\ub358\uc9c0\uba70": 6, "\ud30c\ud5e4\uccd0": 6, "\uac81\ub2c8\ub2e4": 6, "\ud3ec\uc2a4\ud305\uc740": 6, "openai": 6, "blog": [6, 19], "assemblyai": 6, "youtub": [2, 6, 13, 22], "eden": 6, "meyer": 6, "\ucc38\uace0\ud588\uc2b5\ub2c8\ub2e4": 6, "\ubcf8\uaca9\uc801\uc73c\ub85c": 6, "\ud559\uc2b5\ud558\uae30": 6, "\uc804\uc5d0": [6, 24], "\uc54c\uc544\uc57c\ud560": 6, "\uac83\uc740": [6, 8, 18, 19, 20, 21, 28], "\ubaa8\ub378\uc785\ub2c8\ub2e4": [6, 12, 17], "The": [6, 17], "fundament": 6, "principl": 6, "ar": [6, 7, 8, 13, 15, 28], "quit": 6, "simpl": [6, 11, 23, 27, 28], "first": [6, 7, 13], "associ": 6, "caption": [6, 9, 20, 22, 26], "through": [6, 9, 19], "respect": [6, 23, 26], "object": [6, 7, 19, 20, 22, 24, 27], "dimension": [6, 8], "Then": [6, 13], "cosin": [6, 19, 23, 24], "similar": [6, 22, 24], "each": [6, 17, 24], "pair": [6, 13, 14, 26, 28], "comput": [6, 7, 13, 21, 23, 24, 26, 27, 28], "simultan": 6, "maxim": [6, 22], "between": [2, 6, 9, 22, 26], "n": [2, 6, 8, 9, 12, 13, 22, 28], "correct": [6, 24], "minim": 6, "incorrect": [6, 10, 24], "\ud1b5\ud569\uc2dc\ucf30\uc2b5\ub2c8\ub2e4": 6, "\ucd5c\ucd08\ub294": 6, "\uc815\ub2f5\uc740": 6, "\uc544\ub2d9\ub2c8\ub2e4": [6, 17], "22\ub144": 6, "5\uc6d4": 6, "\uc0ac\uc6a9\ud558\uc9c0": [6, 10, 20], "imagen": [6, 10, 20, 24], "\uc5d0\uac8c": [6, 20], "sota": [6, 16, 18, 22, 26, 28], "\ub0b4\uc8fc\uc5c8\uc2b5\ub2c8\ub2e4": 6, "\uc544\ud0a4\ud14d\uccd0": [6, 22, 23, 26], "\ucc0d\uba39\ud558\uae30": 6, "\ub0b4\uc758": [6, 15], "semant": [6, 10, 15, 19, 21, 28], "\ud3ec\ucc29\ud574\ub0bc": 6, "\ud45c\ud604": [6, 8], "\ub04c\uc5b4\uc62c\ub9ac\uae30": 6, "\uc704\ud574\uc11c": [6, 9, 17, 20, 21], "\uc800\uc790\ub4e4\uc740": [6, 9, 15, 18, 19, 22, 23, 26], "\ud1b5\ud569\ud55c": 6, "stage": [6, 28], "\uc774\uac83\uc774": 6, "\uc778\ub370\uc694": 6, "unclip": 6, "\ubd80\ub985\ub2c8\ub2e4": 6, "level": [6, 14, 15, 17, 28], "overview": [6, 29], "architectur": [6, 7, 10, 17, 19, 25, 26, 27, 29], "\ubcf5\uc7a1\ud574\ubcf4\uc774\ub2c8": 6, "assembl": 6, "ai": [6, 10, 21], "\ub2e8\uc21c\ud654\ub41c": 6, "\uadf8\ub9bc\uc744": [6, 15, 17, 18, 21], "\uc0b4\ud3b4\ubcfc\uac8c\uc694": 6, "www": [2, 6, 13, 22], "com": [2, 6, 13, 19, 21, 22, 24], "watch": [2, 6, 13, 22], "f1x4fhzf4mq": 6, "360": 6, "ab_channel": [2, 6], "decod": [6, 22, 24, 28, 29], "\ubaa8\ub378\uc778": [6, 14, 20, 26], "\uac19\ub124\uc694": 6, "\ucea1\uc158\uc744": 6, "\uc0c1\uc751\ud558\ub294": 6, "\uc0dd\uc131\ud569\ub2c8\ub2e4": [6, 10, 21, 28, 29], "autogregress": 6, "\ube44\uad50\ud558\ub294": [6, 10], "\uc2e4\ud5d8": [6, 10, 18, 19, 20, 23, 25], "\uc218\ud589\ud588\uc2b5\ub2c8\ub2e4": 6, "computation": [6, 28], "\ud558\uace0": [6, 12, 15, 20, 28], "\ud6c4\ubc18\ubd80\uc5d0\ub294": 6, "\uc2e4\ud5d8\ud569\ub2c8\ub2e4": 6, "\ubaa8\ub378\ub9cc": 6, "\uc774\ub791": [6, 22, 29], "\uc0ac\uc6a9\ud588\uc744\uae4c\uc694": 6, "represent": [2, 6, 12, 22], "\ud559\uc2b5\ud558\ub294\ub370": [6, 19, 24], "\ud070": [6, 8, 10, 11, 13, 15, 17, 18, 19, 20, 21, 28], "\uc131\uacf5\uc744": 6, "\uac70\ub450\uace0": 6, "shift": [6, 20], "robust": [6, 20, 28], "capabl": 6, "\ub6f0\uc5b4\ub0ac\uc2b5\ub2c8\ub2e4": 6, "vision": [6, 14, 18, 27], "task": [6, 13, 21, 22, 25, 26], "\ub418\uc5b4": [6, 10, 17, 25, 28], "\ub2ec\uc131\ud574\ub0c8\uc2b5\ub2c8\ub2e4": 6, "video": [2, 6, 21], "tak": 6, "\uac31\uc2e0\ud558\ub294": 6, "\uc911\uc774\uc5c8\uc8e0": 6, "non": [6, 11, 20, 23, 28], "determinist": [6, 7, 23], "\ub355\ubd84\uc5d0": 6, "\uc874\uc7ac\ud558\uc9c0": [6, 21], "essenti": 6, "\ubcc0\uc8fc\ud558\uba74\uc11c": 6, "\uc720\uc9c0": [6, 28], "\uc788\uc8e0": 6, "variat": [6, 8, 29], "\uc67c\ucabd\uc758": 6, "\ub4e4\uc740": [6, 26], "\ubcf4\uc874\ub429\ub2c8\ub2e4": 6, "\uadf8\ub4e4\uc774": 6, "\ud45c\ud604\ub418\ub294": 6, "\ubc29\uc2dd\uc774\ub098": 6, "\uc870\uae08\uc529": [6, 18], "\ubc14\ub01d\ub2c8\ub2e4": 6, "\uadf8\ub7fc\uc5d0\ub3c4": [6, 21], "\ud2b9\uc720\uc758": 6, "\ud654\ud48d\uc740": 6, "\uc720\uc9c0\ub418\ub294": 6, "\ubcc0\uc8fc\uace1\ucc98\ub7fc": 6, "\ub9e4\ubc88": [6, 13, 22], "\uc0c8\ub86d\uac8c": [6, 19], "\uc5f0\uc8fc": 6, "\ud574\ub0bc": 6, "\uc788\ub294\uac81\ub2c8\ub2e4": 6, "\ud30c\ud5e4\uce58\uae30": 6, "\uc774\ubc88\uc5d0\ub294": 6, "\uc0b4\ud3b4\ubcf4\uc8e0": 6, "\uc790\uccb4\uc758": 6, "\uc124\uba85": [6, 23], "\uc0ac\uc2e4": [6, 11], "\uc870\uac74\uc73c\ub85c": 6, "\ubc1b\ub294": [6, 8, 17], "\uc790\uccb4\ub3c4": 6, "\ubc1b\uc2b5\ub2c8\ub2e4": 6, "\ubb3c\ub860": [6, 18], "\ubc1b\uaca0\uc8e0": 6, "\uc11c\ub85c": [2, 6, 17, 20, 21], "1\ub3001": 6, "\ub300\uc751\ub418\uae30": 6, "duel": 6, "\ubb38\uc81c\ub420": 6, "\uc5c6\ub2e4\uace0": [6, 17], "\ubcc0\ub860\ud569\ub2c8\ub2e4": 6, "\ud004\ub9ac\ud2f0\ub97c": 6, "\ub192\uc774\uae30": [6, 11], "2\uac1c\uc758": [6, 28], "\uc8fc\uc5b4\uc9c4": [6, 9, 10, 19, 20, 22], "\ub192\uc740": [6, 8, 9, 10, 11, 13, 14, 15, 16, 18, 19, 20, 22, 23, 26], "dot": 6, "\uc0ac\uc6a9\ud588\ub2e4\uace0": [6, 9, 18, 24], "modifi": 6, "glide": [6, 27], "project": [6, 12, 13, 23, 28], "\uc8fc\uc7a5\ud569\ub2c8\ub2e4": [6, 28], "\ud1b5\ud569\uc2dc\ud0a4\ub0d0\ud558\uba74": 6, "\ucd94\uac00\ud558\uace0": [6, 8, 10], "token": [6, 13, 19, 22, 24, 28], "\ud558\ub294\uac70\uc8e0": 6, "\ubc29\ubc95\uc73c\ub85c": [6, 18, 19, 23], "\uc6d0\ubcf8": [6, 14, 18, 19, 24, 28, 29], "\uc601\uc0c1\uc744": [6, 20], "process": [2, 6, 11, 14, 19, 20, 22, 24, 25, 28], "\uc0ac\uc6a9\ud568\uc73c\ub85c\uc368": [6, 23, 27], "\uc788\ub358": 6, "photorealist": [6, 9, 18, 26], "\ud65c\uc6a9\ud560": [6, 19], "\uadf8\ub807\ub2e4\uba74": [2, 6], "\ud544\uc694\ud560\uae4c\uc694": 6, "obtain": 6, "full": [6, 8, 11, 16, 27], "we": [6, 19, 24, 28], "combin": 6, "which": [6, 19, 26], "possibl": 6, "given": 6, "\ub531\ud788": [6, 21], "\uc640\ub2ff\uc9c0\ub294": 6, "\uc54a\uc2b5\ub2c8\ub2e4": [6, 21], "\uc2e4\ub9dd\ud558\uae34": 6, "\uc774\ub985\ub2c8\ub2e4": 6, "\uc720\ubb34\uc5d0": 6, "\ud488\uc9c8\uc744": [6, 15, 18, 19], "\uc2e4\ud5d8\uc744": [6, 9, 12], "\uc218\ud589\ud588\ub2e4\uace0": [6, 9], "\ud55c\ubc88": [6, 8, 24, 29], "\uc0b4\ud3b4\ubcfc\uae4c\uc694": 6, "\uc218\ud589": [6, 19, 22, 25], "\ubaa8\ub378\ucc98\ub7fc": [6, 20], "\uc8fc\uc5b4": [6, 20, 21], "\uac16\ucd94\uace0": 6, "\ud6cc\ub96d\ud588\uc2b5\ub2c8\ub2e4": 6, "\ud2b9\ud788": [6, 15, 19, 21, 26, 28], "3\uac00\uc9c0": [6, 10, 26, 27, 28], "\uacbd\uc6b0\uc758": [2, 6, 21], "\uc544\ud0a4\ud14d\uccd0\uc5d0": 6, "sampl": [2, 6, 12, 16, 20, 23, 24, 25, 26, 28, 29], "signal": [6, 7], "same": [6, 13], "\uadf8\ub807\uc9c0\ub9cc": [6, 26], "\uc758\ubb38\uc774": [6, 18], "\ub9d0\ub054\ud788": 6, "\ud574\uc18c\ub418\uc9c0\ub294": 6, "\uc65c\ub0d0\ud558\uba74": [2, 6, 21], "95": 6, "\uc2dc\uac04": [2, 6, 8, 13, 21], "\ub3d9\uc548": [6, 15, 25], "\ubc29\uc2dd\uc73c\ub85c": [6, 8, 10, 13, 16, 23], "\ubc29\uc2dd\uc5d0": [6, 19], "\uadf8\ub300\ub85c": [6, 9, 20, 23, 28], "\uc801\uc6a9\ud574": [6, 15], "\uc2e4\ud5d8\ud588\uc2b5\ub2c8\ub2e4": 6, "\uacf5\uc815\ud55c": 6, "\uc2e4\ud5d8\uc774\ub77c\uace0": 6, "\ubcf4\uae34": 6, "\uc5b4\ub824\uc6b8": 6, "true": [6, 7, 8, 13, 25, 28], "\ud559\uc2b5\uc2dc\ucf30\uc744": 6, "\ub54c\uc758": [6, 15, 18], "\ube44\uad50": [6, 7, 11, 20, 22, 23, 26], "\uc2e4\ud5d8\uc740": 6, "\uc5c6\uc2b5\ub2c8\ub2e4": [6, 21, 28], "\uac1c\uc778\uc801\uc73c\ub85c": [6, 17, 18], "\uc800\ub294": [6, 17], "\ubcf4\uace0": [6, 8, 18], "\ubc18\ub4dc\uc2dc": [6, 14], "\uc368\uc57c\ud558\ub294": 6, "\uadfc\uac70\uc5d0": 6, "\uc124\ub4dd\ub825\uc774": 6, "\ub5a8\uc5b4\uc9c4\ub2e4\uace0": 6, "\uc0dd\uac01\ud588\uc2b5\ub2c8\ub2e4": 6, "\uc368\uc57c\ud560\uae4c\uc694": 6, "\uac1d\uccb4\ub97c": [6, 21], "\ubb18\uc0ac\ud55c": 6, "\uac1d\uccb4\uc758": 6, "\uc2dc\uac01\uc801": [6, 15, 19], "\ubc1c\ud604": 6, "\uc0ac\uc774\uc758": [2, 6, 8, 9, 21], "\uc758\ubbf8\ub860\uc801": 6, "\uad00\uacc4\ub97c": [6, 7, 11], "\ud559\uc2b5\ud588\uc2b5\ub2c8\ub2e4": 6, "\ub2a5\ub825\uc774": [6, 20], "\uc911\uc694\ud558\ub2e4\uace0": [6, 21, 24], "manipul": [6, 14, 16], "diff": 6, "appli": 6, "interpol": [6, 11], "normalis": 6, "produc": 6, "descript": [6, 24], "\ud558\ub294\uc9c0\ub294": 6, "\uace7": 6, "\uc0b4\ud3b4\ubcf4\uaca0\uc2b5\ub2c8\ub2e4": [6, 25, 28], "\uadf8\ub798\uc11c": [6, 9, 18, 21, 25], "\ubb50\uac00": 6, "\uc88b\uc740\uac00\uc694": 6, "\ud3c9\uac00\ud558\uae30": [6, 18], "\uc0dd\uc131\ubb3c\uacfc": 6, "\uc0dd\uc131\ubb3c\uc744": 6, "\uc0ac\ub78c\ub4e4\uc5d0\uac8c": 6, "\uc81c\uc2dc\ud558\uace0": 6, "photor": [6, 9, 26], "\ub300\ud574\uc11c": [6, 9, 10, 12, 17, 18, 19, 20, 21, 24, 25], "\ub9e4\uae30\ub3c4\ub85d": 6, "when": [6, 17, 25, 26], "guidanc": [6, 18, 27, 28], "both": [6, 20], "comparison": [6, 9, 13, 14, 16, 23, 24, 26], "versu": 6, "\uacb0\ub860\uc740": 6, "compar": [6, 19, 21], "\ud6e8\uc52c": [6, 9, 10, 11, 18, 20, 21], "\uac00\ub2a5\ud569\ub2c8\ub2e4": [6, 10, 24, 28], "bipartit": 6, "\uad6c\uc870": [6, 7, 17, 18, 21], "z_i": [6, 12], "x_t": [2, 6, 8, 9, 12, 20, 23, 28], "\uc778\ucf54\ub529": [6, 12], "\ud65c\uc6a9\ud574\uc11c": [6, 23, 29], "ddim": [6, 9, 18, 19, 28], "\ub41c": [6, 10, 13, 15, 18, 20, 24, 28], "\uc5bb\uc73c\uba70": 6, "\ubcf5\uc6d0\ud558\ub294\ub370": 6, "\ud544\uc694\ud55c": [6, 19, 20, 21], "\uc794\uc5ec": 6, "\uc815\ubcf4\ub4e4\uc744": [6, 28], "\uc9c0\ub2d9\ub2c8\ub2e4": 6, "\ubcc0\uc8fc\ud558\uae30": 6, "\u03b7": [6, 7], "\uc801\uc6a9\ud569\ub2c8\ub2e4": [6, 21], "\uc77c": [2, 6, 25], "\ud574\uc9c0\uace0": 6, "\ubcf5\uc6d0\ud574\ub0c5\ub2c8\ub2e4": 6, "\ucee4\uc9c8\uc218\ub85d": [6, 7, 11, 16], "\uc5d0\ub294": [6, 8, 26], "stochast": [2, 6, 7, 9, 14, 25], "\uc0dd\uae30\uace0": 6, "\uadfc\ucc98\uc5d0\uc11c": 6, "perceptu": [6, 21], "centere": 6, "\ub9cc\ub4e4\uc5b4\ub0bc": [6, 9], "\ud0a4\uc6b0\uba74": 6, "\uc6b0\ub9ac\ub294": [6, 21, 29], "\uc874\uc7ac\ud558\uace0": 6, "\uc720\uc2e4\ub418\uc5c8\ub294\uc9c0": 6, "\ud0d0\uc0c9": 6, "\ud0d0\uc0c9\ud574\ub0bc": 6, "\uc788\ub294\uac70\uc8e0": 6, "\uac83\ub3c4": [6, 8, 18, 29], "\ud574\uc11c": [6, 19, 23, 25], "\uc900\ub2e4\uba74": 6, "\ucea1\uc158\uc774": 6, "\uc8fc\uc5b4\uc838\uc788\uc744": 6, "\uc6b0\ub9ac\uac00": [6, 8, 21], "method": [6, 7, 17, 20, 22, 23], "z_t0": 6, "current": [6, 7, 8], "\uc774\uace0": [2, 6, 25, 28, 29], "z_t": [6, 12, 28], "\uc774\ub77c\uba74": 6, "embd": 6, "\uc870\uc791\ub429\ub2c8\ub2e4": 6, "typograph": 6, "attak": 6, "attack": 6, "\ub0b4": 6, "\uc0ac\ubb3c": 6, "\uc704\uc5d0": [6, 9], "\uae00\uc528\uac00": 6, "\uc4f0\uc5ec": 6, "\uacbd\uc6b0\uc785\ub2c8\ub2e4": [6, 17], "multimod": [6, 20, 26], "\ub9ce\uc774": [6, 18, 19, 20, 23, 25], "\ud65c\uc6a9\ud574": [6, 15, 20, 21, 23, 28], "\uc0ac\ubb3c\uc744": 6, "\ud310\ub2e8\ud558\ub294": 6, "ipod": 6, "\uc885\uc774\uac00": 6, "\ubd99\uc740": [6, 21], "\uc0ac\uacfc\ub97c": 6, "\ubd84\ub958\ub97c": 6, "\uc218\ud589\ud574\ubcf4\uc558\uc2b5\ub2c8\ub2e4": 6, "\uc5ed\uc2dc": [6, 29], "granni": 6, "smith": 6, "\uac70\uc758": [6, 9, 11, 21], "\uac00\uae5d\ub2e4\uace0": 6, "\ud310\ub2e8\ud588\uc2b5\ub2c8\ub2e4": 6, "\uc0ac\uacfc\uc758": 6, "\uc0ac\uc9c4\uc73c\ub85c": [6, 21], "recov": 6, "\ud574\ub0c5\ub2c8\ub2e4": 6, "\uc774\ucc98\ub7fc": [6, 29], "\ub354\uc6b1": [6, 18, 22, 26], "\ub2e8\uc810\uc740": 6, "\uc5c6\ub098\uc694": 6, "cube": 6, "\uadf8\ub4e4\uc758": [6, 19], "\uc18d\uc131": [6, 21], "color": [6, 16, 24, 27, 28], "\ub9e4\uce6d\uc2dc\ud0a4\ub294": 6, "\ub5a8\uc5b4\uc9d1\ub2c8\ub2e4": 6, "red": [6, 22], "blue": [6, 22], "\ud30c\ub780": [6, 20], "\ud050\ube0c": 6, "\ube68\uac04": [6, 18], "\ud050\ube0c\ub97c": 6, "\uadf8\ub824\ub2ec\ub77c\uace0": 6, "\ud050\ube0c\uc640": 6, "\ud050\ube0c\uc5d0": 6, "\uc0c9\uc0c1": 6, "attribut": [6, 17, 27], "\ubd80\uc5ec\ud574\uc57c\ud560\uc9c0": 6, "\ud5f7\uac08\ub824\ud569\ub2c8\ub2e4": 6, "\uc77c\uad00\uc131\uc788\uac8c": 6, "sign": 6, "sai": 6, "deep": [6, 11, 18, 26], "\ub9cc\uc758": 6, "\ubb38\uc81c\ub294": 6, "\uc5b4\ub824\uc6cc\ud558\ub294": 6, "\ubb38\uc81c\uc785\ub2c8\ub2e4": 6, "\ubcf5\uc7a1\ud55c": [6, 9, 18], "\uc0c1\ud669\uc5d0\uc11c": [6, 8], "\ub514\ud14c\uc77c\uc744": [6, 19], "\ubb18\uc0ac\ud558\ub294": 6, "show": [6, 26], "some": 6, "complex": [6, 28], "\ub124\uc628": 6, "\uc0ac\uc778\ub4e4\uc758": 6, "\ub514\ud14c\uc77c\ub4e4\uc774": 6, "\ub5a8\uc5b4\uc9c0\ub294": [6, 11, 18], "\ud655\uc778\ud558\uc2e4": 6, "\ub17c\ubb38\uc758": [6, 10, 14, 17, 18, 23, 29], "\uc5d0\uc11c\ub294": [6, 9, 11, 20, 24, 25, 26], "\uc218\ud559\uc801": 6, "justifi": 6, "\ub77c": [6, 28], "\ud569\uc2dc\ub2e4": [6, 17], "\uadf8\uc5d0": [6, 14, 29], "\uc800\uc790\uc758": 6, "\uc8fc\uc7a5": [6, 23, 26], "\uc0d8\ud50c\ub9c1\ud560": 6, "equal": 6, "hold": 6, "becaus": [2, 6], "function": [6, 19, 25, 29], "second": 6, "chain": [6, 8, 23, 24], "rule": 6, "\ud3ec\uc2a4\ud305\uc744": 6, "\ubd80\uac00": 6, "\uc774\ubbc0\ub85c": [2, 6, 23], "\uc4f8": [6, 25], "\uacf5\uc2dd\uc744": 6, "\ud480\uc5b4\uc11c": 6, "\ud574\uc124\ud574\ubcf4\uba74": 6, "\uc0ac\uc6a9\ud574": [6, 13, 15, 16, 18, 19, 26], "\uc0d8\ud50c\ub9c1\ud558\uace0": [6, 20, 29], "\uc0d8\ud50c\ub9c1\ud568\uc73c\ub85c\uc368": 6, "\uc0d8\ud50c\ub9c1\uc774": 6, "\uac00\ub2a5\ud574\uc9c0\ub294": 6, "\uc5c6\ub294\uc9c0": 6, "\uad81\uae08\ud574\uc11c": 6, "\uacf5\ubd80\ud574\ubd24\uc2b5\ub2c8\ub2e4": 6, "\uc788\ub294\uc9c0": 6, "\ud574\uc18c\ud558\uae30": 6, "\ub178\ub825\uc744": 6, "\ud558\uace0\uc788\ub294\uc9c0": 6, "\ub300\uccb4": [6, 13], "\uc815\ub7c9\uc801\uc73c\ub85c": 6, "\ud3c9\uac00\ud560": [6, 27], "\uc870\uc0ac\ud574\ubd24\uc2b5\ub2c8\ub2e4": 6, "\uacb0\uacfc\ubd80\ud130": 6, "\ub9d0\uc500\ub4dc\ub9ac\uba74": 6, "\ucc98\ub7fc": [6, 17, 21, 22, 27, 28], "\uc6f9\ud06c\ub864\ub9c1": 6, "\uc874\uc7ac\ud55c\ub2e4\uace0": 6, "\ud558\uace0\uc788\ub294\uc9c0\ubd80\ud130": 6, "preview": 6, "\ud604\uc7ac": [6, 7, 16, 18], "safeti": 6, "\ub178\ub825": 6, "\ub370\uc774\ud130\uc5d0\uc11c": [6, 18], "violent": 6, "hate": 6, "adult": 6, "\uc81c\uac70\ud568\uc73c\ub85c\uc368": 6, "\ub178\ucd9c\ub418\ub294": 6, "\uc2dc\uac04\uc744": [2, 6], "\ucd5c\uc18c\ud654\ud588\ub2e4\uace0": 6, "polici": 6, "\uc704\ubc18\ud55c": 6, "\uc790\uc815\ud558\ub294": 6, "\uc2dc\uc2a4\ud15c\uc744": 6, "\ubcf4\uc720\ud558\uace0": 6, "\uc2e0\ub8b0\ud560": 6, "\uc804\ubb38\uac00\ub4e4\uacfc": 6, "\uac80\ud1a0\ub97c": 6, "\uc9c4\ud589\ud588\ub2e4\uace0": [6, 9, 27], "eval": [6, 7], "\uc0dd\uc131\ud615": 6, "\ud3c9\uac00\ud558\ub294": [6, 18], "\uae30\ubc95\uc774": [6, 9], "2202": 6, "04053": 6, "j": [6, 8], "min": [6, 7, 13], "dallev": 6, "contribut": [6, 23], "\ucd94\ub860": [6, 20, 22], "\ub2a5\ub825": 6, "3\uac00\uc9c0\ub97c": 6, "\ub370\uc774\ud130\uc14b": [6, 13, 20, 22, 26, 27, 28], "\uc81c\uacf5\ud569\ub2c8\ub2e4": [6, 10], "\ucd5c\uadfc\uc758": [6, 21], "recognit": [6, 18], "skill": 6, "\uc0c1\ub300\uc801\uc73c\ub85c": [6, 10, 23], "\ub6f0\uc5b4\ub098\uc9c0\ub9cc": [6, 11], "count": [6, 27], "spaial": 6, "relat": [2, 6, 20, 30], "\uc774\ud574": [2, 6], "\ub2a5\ub825\uc740": [6, 20], "\ub5a8\uc5b4\uc9d0\uc744": 6, "\uc874\uc7ac\ud558\ub294": [6, 20], "gender": 6, "skin": 6, "tone": 6, "bias": 6, "\uce21\uc815\ud558\ub294": [6, 20, 24], "metric": [6, 7, 8, 14, 21, 24, 27], "\ubd84\uc11d": [6, 19, 23], "\ucd5c\ucd08\uc758": [2, 6], "\ub17c\ubb38": [6, 8, 13, 18, 21, 22, 24], "web": 6, "\ud559\uc2b5\ud588\uc74c\uc744": 6, "\ubcf4\uc5ec\uc8fc\uc5c8\uc2b5\ub2c8\ub2e4": [6, 12], "social": 6, "\uce21\uc815": [6, 19], "sec": 6, "\uc790\uc138\ud55c": [6, 18, 28], "diagnost": 6, "ex": [2, 6, 8, 21], "who": 6, "work": [6, 20], "nurs": 6, "\ucd1d": [6, 16, 17, 18, 19, 21], "252\uac1c\uc758": 6, "\uc81c\uacf5": 6, "\uc774\ubbf8\uc9c0\ub85c\ubd80\ud130": 6, "\ud0d0\uc9c0\ud569\ub2c8\ub2e4": 6, "autom": 6, "detect": 6, "verifi": 6, "reliabl": 6, "blip": 6, "\uc8fc\uba74\uc11c": 6, "\uc601\uc0c1": 6, "\uc0ac\ub78c\uc758": [6, 17], "\uc131\ubcc4\uc744": 6, "\ub9de\ucd94\uac8c": 6, "\ub2f5\ubcc0\uc744": 6, "\uce21\uc815\ud569\ub2c8\ub2e4": 6, "\uc2e0\uacbd\ub9dd\uc73c\ub85c": 6, "facial": [6, 16], "landmark": 6, "\ucd94\ucd9c\ud558\uace0": [6, 15], "illumin": 6, "\ubcf5\uc7a5\uc744": 6, "\ud0d0\uc9c0\ub41c": 6, "unbias": 6, "uniform": [6, 7, 20, 28], "\uc73c\ub85c\ubd80\ud130": 6, "skew": 6, "\ub418\uc5b4\uc788\ub294\uc9c0": 6, "result": [6, 7, 10, 11, 13, 15], "expert": 6, "per": 6, "profess": 6, "exampl": [6, 7, 17, 19, 24, 27], "averag": [6, 7], "\ud3c9\uac00\ud558\ub294\ub370\uc5d0": 6, "\uc131\uacf5\ud588\uc2b5\ub2c8\ub2e4": 6, "satbl": 6, "\uc6f9\ud06c\ub864\ub9c1\uc744": 6, "\uc874\uc7ac\ud588\uc2b5\ub2c8\ub2e4": 6, "\uce21\uc815\ud558\uae30": 6, "\ub178\ub825\uc774": 6, "\uc9c0\uc18d\ub418\uace0": 6, "\ubbf8\ub798\uc5d0\ub294": 6, "\uc548\uc804\ud558\uac8c": 6, "\ud65c\uc6a9\ub420": 6, "\uc788\uae30\ub97c": 6, "\uae30\ub300\ud569\ub2c8\ub2e4": 6, "denois": [7, 10, 13, 14, 15, 18, 28], "implicit": [7, 25, 28], "iclr": [7, 13, 29], "2021": [7, 11, 13, 22, 23], "2010": 7, "02502": 7, "april": 7, "23": [7, 21], "ddpm\uc758": [7, 9, 11, 16, 23], "\ub2e8\uc810\uc778": 7, "markov": [7, 8], "process\ub97c": [7, 8, 23], "process\ub85c": [7, 8, 11, 23], "\uc815\uc758\ud568\uc73c\ub85c\uc11c": 7, "deterministic\ud55c": 7, "sampling\uc774": [7, 23], "\ubd84\uc57c\uc5d0\uc11c": [2, 7, 10, 11, 16], "adversari": [7, 10, 14, 17, 25], "\ubcf4\uc5ec\uc8fc\uace0\uc788\ub2e4": 7, "gan\uc740": [7, 17, 21], "\uacfc\uc815\uc5d0\uc11c": [7, 8, 9, 10, 13, 14, 16, 18, 19, 21], "\ubd88\uc548\uc815\uc131\uc744": [7, 21], "\ub9ce\ub2e4": 7, "generator\uc640": 7, "discriminator\uc758": 7, "imbalanced\uc5d0": 7, "\uc758\ud55c": [7, 19], "mode": [7, 13, 28], "collaps": [7, 21], "\uadf8\ub7ec\ub358": 7, "ddpm\uacfc": [7, 9, 14], "ncsn\uac19\uc740": 7, "training\uad6c\uc870\uac00": 7, "\ub4f1\uc7a5\ud558\uc600\uace0": 7, "\uc131\uacf5\uc758": 7, "\ubcf4\uc5ec\uc8fc\uc5c8\ub2e4": [7, 16, 20], "ddpm\uc740": [7, 23], "process\uc5d0\uc11c": [7, 11, 14, 23], "\uac70\uce58\ub294\ub370": 7, "\uc774\ub54c\ubb38\uc5d0": 7, "gan\uc5d0": 7, "\ub290\ub9b0": 7, "performance\ub97c": 7, "50k": 7, "less": 7, "than": 7, "about": [7, 30], "20h": 7, "256": [7, 15, 21, 22, 25, 26], "1000h": 7, "ddim\uc740": [7, 23], "chain\uc5d0": 7, "\ub300\uccb4\ud558\uc600\uace0": 7, "\uacb0\uad6d": [7, 9, 11, 22, 23], "\ube60\ub974\uace0": [7, 10], "\ube44\uad50\uc801": [7, 11, 28], "quality\uc758": [7, 9, 11, 14, 16], "\uc0dd\uc131\ud574\ub0b4\uace0": [7, 16], "accel": 7, "ddpm\uacfc\ub294": 7, "\ub2e4\ub974\uac8c": [7, 10, 15, 18, 24], "consistency\ud55c": 7, "\ubcf4\uc5ec\uc90c\uc73c\ub85c\uc368": 7, "latent\uac04\uc758": 7, "interpolation\uc774": 7, "consist": 7, "If": 7, "equival": 7, "process\ub294": [7, 9], "\ub3d9\uc791\ud55c\ub2e4": 7, "\ubbf8\ub798": 7, "\uc2dc\uc810\uc744": [2, 7], "\uc608\uce21\ud558\uae30\uc704\ud574": 7, "\uc2dc\uc810\uc758": [2, 7, 23], "\uc774\uc6a9\ud55c\ub2e4": [7, 9], "\uc2dc\uc810\uc740": 7, "\uacfc\uac70": 7, "\uac12\uc5d0\ub294": 7, "\ub3c5\ub9bd\uc801\uc778": 7, "\uac16\ub294\ub2e4": 7, "t\ub294": 7, "ddpm\uc5d0\uc11c": [7, 9, 11, 23], "\uc88c\uc9c0\uc6b0\uc9c0\ud558\ub294": 7, "hyper": [7, 11, 13, 16], "parameter\uc774\ub2e4": 7, "\ub300\ucda9": 7, "1000": [2, 7, 8, 15, 18], "\ubc88\uc758": 7, "\uacfc\uc815\uc744": [7, 8, 9, 10, 14, 15, 19, 20, 29], "sequential\ud558\uac8c": 7, "\uac70\uccd0\uc57c\ud558\uace0": 7, "\ubcf4\ub2e4": [7, 8, 11, 13, 18, 19, 20, 21, 22, 23, 24, 26, 28], "\ud604\uc800\ud788": [7, 11], "\uc18d\ub3c4\ub97c": [7, 10], "\uc694\uc18c\uac00": 7, "\ub41c\ub2e4": [7, 20, 21, 26], "\uc815\uc758": [7, 11], "\uad6c\ud558\uae30\uc704\ud574": 7, "\uac12\uacfc": 7, "\ucc38\uc870": [7, 10], "\uac12\ub9cc\uc744": 7, "\u03c3\ub294": 7, "process\uc758": [7, 11], "stochastic\ud55c": 7, "chap": 7, "And": 7, "unifi": 7, "revers": [7, 9, 11, 14, 20, 28], "\uc2dd\uc744": [7, 23], "\uc774\uc6a9\ud574": [7, 9, 14, 18, 19, 29], "\uc0d8\ud50c\ub9c1": [7, 18, 23, 26], "\uad00\uacc4": 7, "noise\ub97c": [7, 8, 11, 14, 17], "\uacc4\uc0b0": [7, 8, 19, 20], "fix": [7, 8, 10], "t\uc2dc\uc810\uc758": 7, "\uc608\uce21\ud55c": [7, 9, 10], "\u03c3": 7, "\u03c3\uac00": 7, "\uac00\uc9c8": 7, "\uc218\uc2dd\uacfc": 7, "\ub3d9\uc77c\ud558\ub2e4": 7, "explan": 7, "acceler": [7, 23, 24], "deterministic\ud558\uae30\ub54c\ubb38\uc5d0": [7, 23], "\uacc4\uc0b0\ud560": [7, 23], "\ud544\uc694": [7, 23], "subset\uc758": [7, 23], "\uc2dc\uc810\ub9cc\uc73c\ub85c": [7, 23], "method\ub294": [7, 19, 23], "\uc57d\uac04\uc758": [7, 9, 23], "\uc800\ud558\uac00": [7, 10, 23], "\uc788\uc9c0\ub9cc": [7, 9, 21, 22, 23, 28], "efficiency\ub97c": [7, 23], "\ucda9\ubd84\ud788": [7, 10, 23], "\uc99d\uac00\uc2dc\ud0ac": [7, 23], "ddim\uc758": [7, 23], "od": 7, "encoding\uc774": 7, "\uc720\ub3c4\ud560": 7, "table1": 7, "euqat": 7, "simple\ud558\uac8c": 7, "control\ud558\uae30\uc704\ud55c": 7, "\ud69f\uc218": [7, 10], "\ub0ae\uc740": [7, 8, 10, 11, 15, 20, 22, 24], "3\uc758": [7, 22], "\u03b7\uac00": 7, "step\uc5d0": [7, 11], "figur": [7, 11, 15, 16, 18, 19, 20, 21, 24, 26], "step\uacfc": 7, "time\uc774": 7, "linear\ud55c": 7, "step\uc5d0\uc11c\ub3c4": 7, "\uc5b4\ub290\uc815\ub3c4\uc758": 7, "object\ub97c": 7, "kera": 7, "io": [7, 8, 11, 19, 20, 21, 22], "diffusionmodel": 7, "image_s": 7, "width": [7, 23], "block_depth": 7, "super": [7, 8, 17, 18, 21, 25, 27, 28, 29], "get_network": 7, "unet": [7, 8, 15, 20, 24, 28], "denorm": 7, "convert": [7, 24], "pixel": [7, 22, 28], "back": 7, "rang": [7, 22, 24, 25, 28], "mean": [7, 8, 9, 20, 24], "varianc": [7, 8, 9, 18], "tf": 7, "clip_by_valu": 7, "diffusion_schedul": 7, "diffusion_tim": 7, "angl": 7, "start_angl": 7, "aco": 7, "max_signal_r": 7, "end_angl": 7, "min_signal_r": 7, "diffusion_angl": 7, "signal_r": 7, "co": [7, 8], "noise_r": 7, "sin": [7, 8], "note": 7, "squar": [7, 24], "sum": [7, 13], "alwai": 7, "noisy_imag": 7, "exponenti": [7, 15], "move": [7, 15], "ema_network": 7, "predict": [7, 8, 10, 24, 26, 28], "compon": 7, "calcul": 7, "pred_nois": [7, 8], "pred_imag": 7, "train_step": 7, "have": 7, "standard": [2, 7, 17, 20], "deviat": 7, "like": 7, "shape": [7, 8, 17, 19, 24, 25, 27], "batch_siz": [7, 17, 20, 29], "minval": 7, "maxval": 7, "mix": [7, 18], "accordingli": 7, "gradienttap": 7, "tape": 7, "separ": [7, 17, 24], "noisi": [7, 28], "noise_loss": 7, "image_loss": 7, "trainable_weight": 7, "apply_gradi": 7, "noise_loss_track": 7, "update_st": 7, "image_loss_track": 7, "name": [7, 13], "reverse_diffus": 7, "initial_nois": 7, "diffusion_step": 7, "num_imag": 7, "step_siz": 7, "import": [7, 11, 13], "line": 7, "pure": 7, "its": 7, "assum": 7, "nonzero": 7, "next_noisy_imag": 7, "ones": 7, "remix": 7, "next": 7, "next_diffusion_tim": 7, "next_noise_r": 7, "next_signal_r": 7, "generated_imag": 7, "probabilist": [8, 13, 18], "neurip": [8, 23, 26], "2020": [8, 11], "2006": [8, 13], "11239": [8, 13], "pytorch": [8, 13, 17, 22, 25, 29], "implement": [8, 13, 16, 20, 24, 25, 29], "review": [8, 13, 19, 30], "pr": [8, 13, 24], "409": [8, 13], "beomsoo": [8, 13], "park": [8, 9, 13], "apr": [8, 13, 17, 21, 25, 29], "19": [8, 13], "sourc": [2, 8, 16, 17, 19, 21, 22, 27], "velog": [8, 21, 22], "yetsyl0705": 8, "what": 8, "inference\ub85c": 8, "\ud559\uc2b5\uc2dc\ucf1c": [8, 13], "parameter": 8, "model\uc740": [8, 9, 12, 13, 19], "markov\uac00": 8, "distribution\uc758": 8, "\ud615\ud0dc\ub97c": 8, "\ub54c\uae4c\uc9c0": 8, "\ub354\ud574\uac00\ub294": 8, "\uc5ed\uc73c\ub85c": 8, "\uac70\uce58\uba70": 8, "\uad6c\uc131\ub428": 8, "\uc815\uc758\ud558\uae30": 8, "\uc27d\uace0": 8, "\ud559\uc2b5\uc2dc\ud0a4\ub294": [8, 9, 21], "\ud3b8\ub9ac\ud568": 8, "\ud488\uc9c8\uc758": [8, 9, 21], "\uc0dd\uc131\uc774": [8, 10, 16, 22, 23, 26], "\ubcc0\ubd84\ucd94\ub860": [8, 29], "\uc0ac\ud6c4\ud655\ub960": 8, "posterior": [8, 22, 29], "\ubd84\ud3ec": [8, 22], "\ub2e4\ub8e8\uae30": [8, 29], "\uc26c\uc6b4": [8, 21, 29], "\ud655\ub960\ubd84\ud3ec": 8, "\uadfc\uc0ac": 8, "approxim": [8, 29], "\ud45c\ud604\uc2dd\uc5d0": 8, "\ud45c\ud604\ud558\ub294": [8, 19, 24], "\ubcf4\ud1b5": [8, 10, 13, 17, 18, 19, 21], "parameter\uc758": [8, 9], "\uc2dd\uc758": 8, "\ucc28\uc218\ubcf4\ub2e4": 8, "\uc218\ub85c": 8, "\uc120\ud0dd": [8, 22], "3\ucc28": 8, "\ud45c\ud604\uc2dd": 8, "2\uac1c": 8, "\ud558\ubbc0\ub85c": 8, "\ucc28\uc218\ub85c\uc758": 8, "\ud568\uc218": [8, 21, 22, 23], "3d": 8, "2d": 8, "\uc0c1\ud0dc\uc5d0\uc11c": [8, 10, 21], "\uc0c1\ud0dc\ub85c": [8, 10, 24, 28], "\ub118\uc5b4\uac08": 8, "\ub2e8\uacc4\uc758": [8, 19], "\uc0c1\ud0dc\uc5d0\ub9cc": 8, "\ud655\ub960": [2, 8, 14, 18, 29], "graphic": [8, 26], "_0": 8, "prod_": 8, "quad": 8, "sqrt": [2, 8, 9, 12, 28], "beta_t": 8, "chain\uc73c\ub85c": 8, "data\uc5d0": [8, 21], "\ucd94\uac00\ud560": 8, "schedul": [8, 11, 20, 23, 24, 28], "beta_1": 8, "\ub354\ud574\uc900\ub2e4": 8, "\uc774\uba74": [8, 14, 26], "mean\uc778": 8, "\uc774\uc804": [8, 9, 13, 17], "\uac16\uc9c0": 8, "\ub178\uc774\uc988\uac00": 8, "\uc99d\uac00\ud568": 8, "\ub2e8\uc21c\ud788": [2, 8, 14, 19, 21], "noise\ub9cc\uc744": 8, "\ub354\ud574\uc8fc\ub294\uac8c": 8, "scaling\ud558\ub294": 8, "variance\uac00": 8, "\ubc1c\uc0b0\ud558\ub294": 8, "\ub9c9\uae30": 8, "\uc704\ud568": [8, 21], "x_1": 8, "x_0": [8, 9], "\ub9cc\ub4dc\ub294": [8, 9, 10, 29], "\uc644\uc804": 8, "destroy\ub41c": 8, "\uc0c1\ud0dc": 8, "p_": [8, 9, 13, 22, 23, 25, 29], "boldsymbol": 8, "mu": [8, 17, 23, 29], "sigma": [8, 17, 23, 29], "\uac00\uc6b0\uc2dc\uc548": 8, "\ub178\uc774\uc988\ub97c": [8, 18, 20], "1994\ub144": 8, "process\uac00": [8, 14], "\uac00\uc6b0\uc2dc\uc548\uc774\uba74": 8, "process\ub3c4": 8, "\uac00\uc6b0\uc2dc\uc548\uc73c\ub85c": 8, "\uc4f0\uba74": 8, "\ub41c\ub2e4\ub77c\ub294": 8, "\uc99d\uba85\uc774": 8, "\ud568": [2, 8, 13, 18, 19, 20, 21, 22, 23, 26], "\ud574\uc57c": 8, "mu_": [8, 9], "\ubd84\uc0b0": [2, 8, 18, 23, 29], "sigma_": [8, 23, 24], "hierarach": 8, "vae\uc5d0\uc11c\uc758": 8, "\uacfc\uc815\uacfc": 8, "\ube44\uc2b7\ud568": [8, 19], "\ubaa9\uc801\uc740": 8, "\uc81c\uac70\ud560": 8, "\uac83\uc778\uac00": 8, "\uc774\ub2e4": [2, 8, 13, 20, 23], "\ub4e4\uc5b4\uc654\uc744": [8, 18], "\uc608\uce21\ud560": 8, "\uc608\uce21\uc774": 8, "\uac00\ub2a5\ud574\uc9d0": [8, 19], "mathbb": [2, 8, 12, 13, 24, 25, 28, 29], "leq": 8, "_q": [8, 12], "sum_": [8, 9, 13, 29], "geq": 8, "neg": [8, 9, 16], "likelihood\ub97c": 8, "\ucd5c\uc18c\ud654": 8, "\ubc29\ud5a5\uc73c\ub85c": [8, 14, 16, 25, 28], "\uc9c4\ud589": [8, 16, 22, 23, 26], "\uc218\uc2dd\uc744": [8, 14, 21, 23], "elbo": [8, 22], "evid": [8, 22], "lower": [8, 14, 15, 22, 26], "bound": 8, "\uc6b0\ud56d\uacfc": 8, "\uc815\ub9ac\ud558\uace0": 8, "\ud480\uc5b4\ub0b4\uba74": 8, "elbo\uc758": 8, "\uc5ed\ud560\uc740": 8, "\uad00\ucc30\ud55c": 8, "\ud798\ub4e0": 8, "\ubd84\ud3ec\ub97c": [8, 15, 18, 21, 25, 29], "\uc774\ub8e8\uace0": 8, "\uc870\uae08": 8, "\ubd84\ud3ec\uc778": 8, "\ud45c\ud604\ud558\ub824": 8, "\ucc28\uc774": [8, 14], "kl": [8, 12, 25, 29], "diverg": 8, "\ud558\uae30": [8, 10, 22, 24, 28, 29], "underbrac": 8, "d_": [8, 10, 13, 25], "_1": 8, "\ub098\uc628\ub2e4": [2, 8, 23], "term\uc73c\ub85c": 8, "\ud559\uc2b5\uc2dc\ud0b4": 8, "reconstruct": [8, 15, 19, 24, 29], "\ub9e4": [8, 28], "\ub2e8\uacc4\uc5d0\uc11c": [8, 10, 15, 19], "\uc9c0\uc6b0\ub294": 8, "\uc9c0\uc6c0": 8, "ddpm\uc5d0\uc11c\ub294": [8, 9, 11], "induct": 8, "bias\ub97c": [8, 17, 19], "\ub298\ub824": [8, 18], "stable\ud558\uace0": 8, "\uc131\ub2a5\ub3c4": [8, 18, 20, 27], "\uac1c\uc120\ud560": [8, 11], "\uc788\uc5c8\uc74c": [8, 13, 19, 21], "\ub9cc\ub098\ubcf4\uc9c0": 8, "\ubabb\ud588\ub358": [8, 24], "\uc815\ud655\ud55c": [8, 19, 20, 21], "\uc608\uce21\uc744": [8, 10], "\uac00\uc815": 8, "\ud480\ub824\ub294": 8, "\ubb38\uc81c\uc5d0": 8, "\uc801\uc6a9\ud558\ub294": [8, 13, 24, 28], "\uace0\uc815": [8, 11, 19], "\ud588\ub354\ub2c8": 8, "\uc798\ub428": 8, "02\ub85c": 8, "linear\ud558\uac8c": 8, "image\uc5d0": [8, 9, 19], "\uac00\uae4c\uc6b8\uc218\ub85d": 8, "\uc801\uac8c": [8, 22], "\uc8fc\ub294": [8, 9, 28], "\uc124\uc815": [2, 8, 13, 18, 22], "parameter\uac00": 8, "\uc5c6\uc5b4": [8, 21, 26], "\ub418\uae30": [8, 13, 21], "tild": [8, 10, 11], "beta": [8, 10], "progress": 8, "posterior\ub97c": 8, "\ub354\ud574": 8, "\ub9cc\ub4e4\uc5c8\uc744\ub54c": 8, "\ubcf5\uc6d0": 8, "simplic": 8, "sjina0722": 8, "\ub9ac\ubdf0": [8, 13], "\uc0c1\uc218\ub85c": 8, "\uac00\uc815\ud588\uace0": 8, "\ubc1b\uae30": [8, 16], "\ud559\uc2b5\uc2dc\ud0a4\uc9c0": 8, "\uc54a\uc544\ub3c4": [8, 28], "\ub41c\ub2e4\uace0": 8, "\uc0dd\uac01\ud574": 8, "term\uc744": 8, "\uc81c\uac70": [8, 10, 15], "residu": [8, 9, 10, 21, 23, 24, 26, 28], "estim": [8, 25, 29], "\uad6c\ud558\uc9c0": [8, 25], "\uc54a\uace0": [8, 13, 25, 27, 29], "epsilon_": [2, 8, 12, 23, 28], "\uad6c\ud574": 8, "\uc815\ud655\ub3c4\ub97c": [8, 18], "\ub192\uc784": 8, "d": [2, 8, 12, 13, 17, 25], "int_": 8, "delta_": 8, "sigma_1": 8, "arrai": 8, "ll": [8, 13, 24], "infti": 8, "255": 8, "case": [8, 27], "\uc0ac\uc774\ub85c": 8, "linearli": 8, "\ub2e8\uacc4\uc5d0\ub294": 8, "\ucd94\uac00\ud558\uc9c0": 8, "divergence\ub97c": 8, "\ub098\ud0c0\ub0c4": [2, 8, 18, 21], "\uc88c\ud45c": 8, "final": [8, 9], "\uc704\uc640": [8, 13, 20, 21, 23], "\ub098\ud0c0\ub09c\ub2e4": 8, "ground": [8, 21, 25], "truth": [8, 21, 25], "output\uac04": 8, "\uc904\uc774\ub294": [8, 10], "\uacfc\uc815\uc774": 8, "denoising\uacfc": 8, "\ube44\uc2b7\ud574": 8, "ddpm\uc774\ub77c\ub294": 8, "\uc774\ub984\uc774": [8, 26], "\ubd99\uc74c": 8, "objective\uc744": 8, "\uc5d0\uc11c\ubfd0\ub9cc": 8, "t\uc5d0": 8, "\ub300\ud574\uc11c\ub3c4": [8, 9, 18, 19, 21, 23], "\uac00\ub2a5\ud558\uae30": 8, "\ud6a8\uacfc\uc801": 8, "psuedo": 8, "algorithm": 8, "\ub354\ud574\ub098\uac00\ub294": 8, "epsilon": [2, 8, 9, 10, 12, 20, 23, 24, 28], "\uc5bc\ub9c8\ub9cc\ud07c": 8, "\ub354\ud574\uc84c\ub294\uc9c0\ub97c": 8, "step\uc758": [8, 9], "gaussian": [8, 13, 17, 23, 24, 28, 29], "\ucd94\uac00\ub418\uc5c8\ub294\uc9c0\ub97c": 8, "\uc608\uce21\ud558\ub3c4\ub85d": [8, 11], "\ud559\uc2b5\ub41c\ub2e4": [8, 19], "\ucf54\ub4dc\uc5d0\uc11c\ub294": [8, 13], "\ub79c\ub364": 8, "\ub178\uc774\uc988\uc640": 8, "\ub2e8\uacc4": [8, 25], "t\ub85c": [8, 9], "\uc5bb\uace0": 8, "p_loss": 8, "x_start": 8, "default": [8, 13], "lambda": [8, 21, 24], "torch": [8, 13, 20, 24, 28, 29], "randn_lik": [8, 24], "q_sampl": 8, "do": [8, 17, 19, 28], "set": [8, 13, 20, 21, 24, 26], "slow": 8, "down": [8, 28], "25": [8, 13, 15, 18, 20, 25], "seem": 8, "significantli": [8, 26], "x_self_cond": 8, "self_condit": 8, "no_grad": 8, "model_predict": 8, "pred_x_start": 8, "detach_": 8, "take": 8, "model_out": 8, "pred_x0": 8, "pred_v": 8, "predict_v": 8, "rais": [8, 20, 24], "valueerror": [8, 24], "unknown": [8, 24], "loss_fn": 8, "reduct": [8, 20, 24], "reduc": [8, 24], "extract": 8, "loss_weight": 8, "network\ub97c": [8, 17], "\ud559\uc2b5\ud558\uace0": [8, 18, 22], "\ub098\uba74": [8, 10], "noise\uc5d0\uc11c": 8, "\uc2dc\uc791\ud574\uc11c": [8, 17], "\uc21c\ucc28\uc801\uc73c\ub85c": [8, 22, 28], "markovian": [8, 23], "p_sampl": 8, "int": [8, 20, 25, 28, 29], "devic": [8, 20, 24], "batched_tim": 8, "long": [8, 24], "model_mean": 8, "model_log_vari": 8, "p_mean_vari": 8, "clip_denois": 8, "pred_img": 8, "exp": 8, "backbon": [8, 15], "u": [8, 12, 20, 24, 26, 28], "\uac01": [8, 10, 13, 15, 17, 18, 19, 21, 22, 23, 24, 27, 28], "upsampl": [8, 9, 23, 26, 28], "\ub2e8\uacc4\ub294": 8, "resnet": [8, 18, 20, 23, 28], "convnext": 8, "\ube14\ub85d": 8, "groupnorm": [8, 23], "upsampling\uc73c\ub85c": 8, "block_klass": 8, "resnetblock": 8, "group": 8, "resnet_block_group": 8, "modulelist": [8, 28], "dim_in": 8, "time_emb_dim": 8, "time_dim": 8, "prenorm": 8, "linearattent": 8, "downsampl": [8, 15, 23, 27, 28], "dim_out": 8, "is_last": 8, "conv2d": [8, 13, 28], "init_dim": 8, "out_dim": 8, "dim_mult": 8, "learned_vari": 8, "learned_sinusoidal_cond": 8, "random_fourier_featur": 8, "learned_sinusoidal_dim": 8, "determin": 8, "dimens": [8, 13, 28], "input_channel": 8, "init_conv": 8, "in_out": 8, "list": [8, 20, 28], "random_or_learned_sinusoidal_cond": 8, "sinu_pos_emb": 8, "randomorlearnedsinusoidalposemb": 8, "fourier_dim": 8, "sinusoidalposemb": 8, "time_mlp": 8, "gelu": 8, "num_resolut": 8, "len": [8, 20, 25, 28], "ind": 8, "enumer": [8, 24, 25, 28], "mid_dim": 8, "mid_block1": 8, "mid_attn": 8, "mid_block2": 8, "default_out_dim": 8, "final_res_block": 8, "final_conv": 8, "zeros_lik": 8, "clone": [8, 28], "block1": [8, 28], "block2": [8, 28], "attn": [8, 16], "pop": 8, "resolution\uc5d0": [8, 18, 21], "conv\uc5d0\uc11c": 8, "\ucc28\uc6d0\uc744": 8, "3\ubc30\ub85c": 8, "\ub298\ub9ac\uace0": 8, "v\ub85c": 8, "\ubd84\ud574": [8, 10], "dim_head": 8, "hidden_dim": 8, "to_qkv": 8, "to_out": 8, "qkv": 8, "chunk": [8, 24, 28], "rearrang": 8, "sim": [2, 8, 12, 25, 28], "einsum": 8, "softmax": [8, 12, 22], "layernorm": 8, "block\uc5d0": [8, 9, 23], "sinusoid": 8, "embedding\uc774": [8, 19], "\ucd94\uac00\ub3fc\uc11c": 8, "\uad6c\ubd84\ub428": 8, "half_dim": 8, "math": 8, "10000": 8, "arang": 8, "score": [8, 9, 21, 22, 26, 28], "is\ub85c": 8, "model\uc778\ub370\ub3c4": 8, "model\ubcf4\ub2e4": [8, 9, 14], "\uc6b0\uc6d4": 8, "codelength\uc5d0\uc11c": 8, "\ucc28\uc774\uac00": [8, 11, 18, 19], "\uc5c6\uae30": [8, 21], "overfitting\uc758": 8, "\uac00\ub2a5\uc131\ub3c4": 8, "\uc801\uc74c": 8, "incept": [8, 18, 22], "v3\uc73c\ub85c": 8, "\uacc4\uc0b0\ud55c": [8, 25], "dataset\uc5d0": [8, 9, 22], "\ud559\uc2b5\ub418\uba74": [8, 19], "label": [8, 21, 23, 26], "\ub4f1\uc758": [8, 10, 19, 21], "\uacc4\uc0b0\ud558\ub294": [8, 29], "\uc131\uc801\uc774": 8, "\uc88b\uace0": 8, "variance\ub97c": [8, 11], "\uc0ac\uc6a9\ud588\uc744": [8, 18, 19, 23], "\ub54c\uc5d0\ub3c4": 8, "\uac10\uc18c\ud558\uc9c0": 8, "icml": [9, 10, 22], "2307": 10, "06949": 10, "hyoungseo": 10, "cho": [10, 12], "\ub5a0\uc624\ub974\uace0": 10, "\uc8fc\uc81c\uc785\ub2c8\ub2e4": 10, "fidelity\uc640": 10, "identity\ub97c": 10, "\uc720\uc9c0\ud55c": [10, 24], "\ub9e5\ub77d\uacfc": 10, "\uc2a4\ud0c0\uc77c\uc744": [10, 16, 18, 21], "\ub17c\ubb38\uc740": [10, 17, 18, 21, 24], "\uc9c4\ud589\ub418\uc5c8\uae30": 10, "\ub17c\ubb38\uc744": [9, 10, 18, 23], "\uc77d\uc5b4": 10, "\ubcf4\uc2dc\uae30\ub97c": 10, "\ucd94\ucc9c\ub4dc\ub9bd\ub2c8\ub2e4": 10, "contribution\uc740": [10, 17], "\ud06c\uac8c": [10, 14, 18, 19, 20, 21, 25, 27, 28, 29], "3\uac00\uc9c0\ub85c": 10, "lighweight": 10, "dreambooth\uc758": 10, "\uc720\uc9c0\ud558\uba74\uc11c": [10, 15, 17], "\ud06c\uae30\ub97c": [10, 15, 23, 28], "\uc904\uc774\uace0": 10, "\ub192\uc77c": [10, 19], "hyperdreambooth\ub97c": 10, "\uad6c\ud604\ud588\uc9c0\ub9cc": 10, "e2": [10, 24, 27], "\uc801\uc6a9\uc774": [10, 13, 17], "\uae30\uc220\ub4e4\uc740": 10, "fidelity\uac00": [10, 16, 19, 23, 26], "\ub5a8\uc5b4\uc9c0\uac70\ub098": 10, "\ubb38\ub9e5\uc744": 10, "\uc81c\uacf5\ud558\uc9c0": 10, "\ubb38\uc81c\uac00": [10, 18, 21], "hypernetwork\ub97c": 10, "\ub3c4\uc785\ud55c": [2, 10, 15], "\uc5f0\uad6c\ub97c": 10, "via": 10, "\ub2e4\uc74c\uc73c\ub85c": 10, "personalization\uc744": 10, "finetuning\uc5d0": 10, "svdiff": 10, "lora": 10, "styledrop": 10, "dreamartist": 10, "\uc608\uc2dc\uac00": 10, "\uc18d\ub3c4": [10, 13], "\uce21\uba74\uc5d0\uc11c": [10, 17], "\ub290\ub9ac\ub2e4\ub294": 10, "\ub2e8\uc810\uc744": [10, 25, 29], "\uad00\ub828": [10, 17], "\uc5f0\uad6c\ub4e4\uc744": 10, "hyperdreambooth\ub294": 10, "\uc18d\ub3c4\uc640": 10, "\ud6a8\uc728\uc131": 10, "\ubc1c\uc804\uc744": 10, "\uc774\ub8e8\uc5c8\ub2e4\uace0": 10, "\uc774\uc804\uc5d0": [10, 19, 23], "\ub098\uc628": [9, 10, 12, 22, 23, 26, 28, 29], "dreambooth\ub294": 10, "\uc8fc\uc81c\uc758": 10, "\uc0dd\uc131\ud558\uae30": [10, 19, 21], "\ub124\ud2b8\uc6cc\ud06c\ub97c": 10, "\ud65c\uc6a9\ud588\uc2b5\ub2c8\ub2e4": 10, "hyperdreambooth\uc758": 10, "\uc601\uac10\uc6d0": 10, "\ud558\ub098\ub85c": [10, 13, 19, 28, 29], "\ud65c\uc6a9\ub418\uc5c8\uc2b5\ub2c8\ub2e4": 10, "adapt": [10, 13, 17], "lora\ub294": [10, 13], "\uac00\uc911\uce58\ub97c": [10, 15, 18], "\ub7ad\ud06c\uc758": 10, "\ud589\ub82c\ub85c": 10, "\uadfc\uc0ac\ud654\ud558\uc5ec": 10, "\ud06c\uae30\uc640": [10, 11], "\ubcf5\uc7a1\uc131\uc744": 10, "\ubc29\ubc95\uc785\ub2c8\ub2e4": [10, 17], "\uae30\uc220\uc744": [10, 15, 21, 26], "\ud6a8\uc728\uc801\uc778": 10, "personalization\uc774": 10, "\uac00\ub2a5\ud558\ub3c4\ub85d": [9, 10, 29], "\uc0b4\ud3b4": 10, "contribution\uc758": 10, "\uc0b4\ud3b4\ubcf4\ub3c4\ub85d": [10, 29], "\uae30\uc220": [10, 15, 18], "\ud558\ub098\uc778": [10, 18], "\uc904\uc5ec\uc11c": 10, "lidb\uc5d0": 10, "\uc124\uba85\ub4dc\ub9ac\uaca0\uc2b5\ub2c8\ub2e4": 10, "lidb\ub294": 10, "residuals\uc758": 10, "\uac00\uc911\uce58": [10, 21, 26], "\uacf5\uac04\uc744": 10, "\uc138\ubd84\ud654\ud558\ub294": 10, "\uc544\uc774\ub514\uc5b4\uc785\ub2c8\ub2e4": 10, "\ub0b4\uc5d0\uc11c": [10, 15, 19, 27], "orthogon": 10, "basis\ub97c": 10, "decompos": 10, "\uc811\uadfc": [10, 19], "lora\uc758": 10, "a\uc640": 10, "\ud589\ub82c\uc744": 10, "\ubd84\ud574\ud558\ub294": 10, "\uac83\uc73c\ub85c\ub3c4": 10, "\uc774\ud574\ud560": 10, "\uad6c\uccb4\uc801\uc73c\ub85c": 10, "\uc0b4\ud3b4\ubcf4\uba74": [10, 14], "\ud589\ub82c\uc740": 10, "a_": 10, "aux": [10, 16], "\ubd84\ud574\ub418\uba70": 10, "b_": [10, 11], "\ubd84\ud574\ud560": 10, "\ub808\uc774\uc5b4\ub294": 10, "\ud589\ubcc4\ub85c": 10, "\uc9c1\uad50\ud558\ub294": 10, "\ubca1\ud130\ub85c": [10, 19], "\ubb34\uc791\uc704": [10, 21], "\ucd08\uae30\ud654\ub418\uace0": 10, "\ud559\uc2b5\ub418\ub294": 10, "\uac00\uc911\uce58\uc785\ub2c8\ub2e4": 10, "\uc120\ud615": 10, "\ub808\uc774\uc5b4\uc758": 10, "residual\uc740": 10, "w_x": 10, "experiment": 10, "\ub418\uc5c8\uc73c\uba70": 10, "\uac1c\uc218\ub294": 10, "\uc57d": [9, 10, 19, 21, 22], "30k\uac1c": 10, "\uc0ac\uc774\uc988\ub294": 10, "120kb\ub85c": 10, "\uacbd\ub7c9\ud654": 10, "\ubcc0\uc218\ub9cc\uc73c\ub85c": 10, "fidel": [10, 23, 24, 26], "edit": [2, 9, 10, 14, 19, 20, 28], "\ub4f1\uc744": [10, 21], "\ud3ec\uc778\ud2b8\uc785\ub2c8\ub2e4": 10, "\ub2e4\uc74c\uc740": 10, "\uc0ac\uc804\uc5d0": [10, 24], "\ub098\ud0c0\ub0b4\uba70": 10, "\ub808\uc774\uc5b4\uc5d0": 10, "\uc544\uc774\ub514\uc5b4\ub294": 10, "x\ub97c": 10, "\uc785\ub825\uc73c\ub85c": [10, 18, 21], "\ubc1b\uace0": [10, 26], "lidb\uc758": 10, "residual\uc778": 10, "hat": [10, 12, 24], "h_": [10, 15], "eta": 10, "\ub3cc\uc785\ud558\ub294": 10, "hypernetwork\ub294": 10, "\ub3c4\uba54\uc778": 10, "\ud2b9\ud654": [10, 22], "\ub370\uc774\ud130\uc14b\uc5d0\uc11c": [10, 19, 20, 26], "\ud6c8\ub828\ub418\uba70": 10, "\ud655\uc0b0": 10, "\ub178\uc774\uc988": [10, 15, 24], "\uc190\uc2e4\uacfc": [10, 21], "\uacf5\uac04": [10, 19, 21], "\uc190\uc2e4\uc744": 10, "alpha": [10, 13, 28], "\ubaa9\ud45c\ub294": [10, 19], "pre": [10, 13, 14, 16, 19, 24, 28], "paramters\uc785\ub2c8\ub2e4": 10, "\uac00\uc911\uce58\ub294": 10, "\uad00\ub828\ub41c": [10, 19, 21], "\uc870\uc815\ub429\ub2c8\ub2e4": 10, "\ub098\ud0c0\ub0c5\ub2c8\ub2e4": 10, "supervisori": 10, "\uc870\uac74\uc774": 10, "\uc124\uc815\ub41c": 10, "\uac1c\uc778\ud654\uc5d0": 10, "\uc0c1\ub300\uc801\uc778": 10, "loss\uc758": [10, 11], "\uc81c\uc5b4\ud558\uae30": 10, "\ud56d\ubaa9\uc758": 10, "\ub370": [9, 10, 18, 21], "\uc9c0\uc6d0\ud558\uae30": 10, "\uc785\ub825\uc785\ub2c8\ub2e4": 10, "\ud504\ub86c\ud504\ud2b8\ub294": 10, "\uc9c0\uc2dc\uc0ac\ud56d": 10, "hyperdreambooth\uc5d0\uc11c\ub294": 10, "\uac1c\uc778\ud654\ub41c": [10, 19], "\ub4dc\ubb3c\uc9c0\ub9cc": 10, "\uc758\ubbf8": 10, "\uc218\uc815\uc744": 10, "\uc0bd\uc785\ud560": [10, 19], "\uc5ed\ud560\uc744": [10, 11, 28], "hyperdreambooth\uc5d0\uc11c": 10, "\uad6c\uc870\ub85c": 10, "\uad6c\uc131\ub418\uba70": [10, 15], "\uc608\uce21\ud569\ub2c8\ub2e4": [10, 21], "\uad6c\uc131": 10, "\uc694\uc18c": 10, "\ud558\ub098\uc785\ub2c8\ub2e4": 10, "\uc774\ud6c4": [9, 10, 13, 19], "\uac00\uc911\uce58\uc5d0": 10, "\ub354\ud558\uc5ec": 10, "\uac1c\uc778\ud654\ub97c": 10, "\uc2e4\ud589\ud569\ub2c8\ub2e4": 10, "iter": 10, "\ubc18\ubcf5\uc801": 10, "\uc218\ud589\ud569\ub2c8\ub2e4": 10, "hypernetwork\uac00": 10, "\ucd08\uae30": 10, "\ubc18\ubcf5\uc801\uc778": 10, "\uac1c\uc120\ud558\ub824\uace0": 10, "\uc2dc\ub3c4\ud558\ub294": 10, "\uc608\uce21\uc740": 10, "\ubc29\ud5a5\uc131\uc774": 10, "\uc62c\ubc14\ub974\uace0": 10, "\uc5bc\uad74\uacfc": [10, 15], "\ubbf8\uc138\ub9cc": 10, "\uc138\ubd80": [10, 19], "\uc7a1\uc544\ub0b4\uc9c0": 10, "\ubabb\ud560": [10, 15], "tuning\ud558\uace0": 10, "\ub098\uc740": 10, "\ub54c\uc5d0": 10, "encoding\uc740": 10, "\ubc88\ub9cc": 10, "\uc218\ud589\ub418\uba70": 10, "\ucd94\ucd9c\ub41c": [10, 20], "\ud2b9\uc9d5": [10, 14], "f\ub294": [10, 21], "\uc2e4\ud589\ud558\uace0": 10, "\uc18d\uc131\uacfc": 10, "\ubc29\ud5a5\uc131\uc5d0": 10, "\uc62c\ubc14\ub974\uac8c": 10, "\ub418\uc9c0\ub9cc": 10, "\uc138\ubd80\uc801\uc778": [10, 17, 21], "detail\uc740": 10, "\ubabb\ud569\ub2c8\ub2e4": 10, "dreambooth\ubcf4\ub2e4": 10, "\ube60\ub974\uc9c0\ub9cc": 10, "\uac15\ud55c": [10, 16], "subject": [10, 24], "diversity\ub97c": [10, 16], "\ub3d9\uc77c\ud558\uac8c": [10, 18, 27, 28], "\ucd08\uae30\ud654\ub41c": 10, "x\uc640": [9, 10, 21], "\uc9c0\uc2dc\uc5b4": 10, "c\uc5d0": 10, "\ucd5c\uc18c\ud654\ud558\ub3c4\ub85d": 10, "\uc870\uc815\ud569\ub2c8\ub2e4": 10, "\uc810\uc740": [10, 18, 19], "\uac1c\ub150\uc785\ub2c8\ub2e4": 10, "\uc8fc\ub85c": [9, 10], "\uc644\ud654\ud558\uc5ec": 10, "rank\ub85c": 10, "hypernetwork\uc758": 10, "\uc608\uce21\ub41c": 10, "\uc8fc\uccb4\uc758": 10, "\uace0\uc8fc\ud30c\uc218": 10, "\uc0ac\ud56d\uc744": 10, "\uadfc\uc0ac\ud654\ud560": 10, "\uc774\ub85c": [10, 26], "\uc778\ud574": [10, 15, 21, 26], "\uc81c\ud55c\ub41c": 10, "\uc5c5\ub370\uc774\ud2b8\ubcf4\ub2e4": 10, "\uc8fc\uc81c": 10, "\ucda9\uc2e4\ub3c4\ub97c": 10, "\ub2ec\uc131\ud560": 10, "relaxed\uc758": 10, "\uac1c\ub150\uc740": 10, "\ubc29\uc2dd\ubcf4\ub2e4": 10, "\uc6b0\uc218\ud558\uac8c": [10, 17], "\uc694\uc778\uc785\ub2c8\ub2e4": 10, "\uc5ec\uae30\uc11c\ub3c4": 10, "\uc0ac\uc6a9\ud558\ub294\ub370": 10, "\uc9c0\uc6d0\ud558\uba70": 10, "\uc5bc\uad74\uc5d0": 10, "\ud2b9\uc131\uacfc": 10, "\ucea1\ucc98\ud558\ub294": 10, "\ub3c4\uc6c0\uc774": [10, 19, 22], "\uace0\ub824\ud560": 10, "40\ubc88\uc758": 10, "\ubc18\ubcf5\uc73c\ub85c": 10, "\uc644\ub8cc\ud560": 10, "dreambooth\uc640": 10, "\ube44\uad50\ud588\uc744": [10, 14, 18], "25\ubc30": 10, "\uc18d\ub3c4\ub77c\ub294": 10, "\uad6c\ud604\ud588\uc2b5\ub2c8\ub2e4": 10, "\ubaa8\ub378\uc5d0\uc11c\ub294": 10, "5\uc758": 10, "unet\uc758": [10, 15], "\ud65c\uc6a9\ud558\uae30": 10, "\uc778\ucf54\ub354\ub3c4": 10, "\uac1c\uc778\ud654\ud558\uae30": 10, "\uc2dc\uac01\ud654\uc5d0": 10, "\uc5bc\uad74": 10, "sfhq": 10, "synthet": [10, 20], "headquart": 10, "\ub370\uc774\ud130\uc14b\uc744": [9, 10, 17, 20], "\ud6c8\ub828\uc2dc\ud0a4\uae30": [9, 10], "celeba": 10, "hq": 10, "000\uac1c\uc758": 10, "galleri": 10, "\uc624\ub978\ucabd": [10, 15, 18], "\uc544\ub798\ub85c": [10, 17, 28], "\uc778\uc2a4\ud0c0\uadf8\ub7a8": 10, "\uc140\uce74": 10, "pixar": 10, "\uce90\ub9ad\ud130": 10, "bark": 10, "skin\uc758": 10, "\ub85d": 10, "\uc2a4\ud0c0": 10, "\uc804\ubb38\uc801\uc778": 10, "\ucd2c\uc601": 10, "inversion\uc758": 10, "\ube44\uad50\ud55c": [9, 10, 27], "\ud45c\uc785\ub2c8\ub2e4": 10, "\ud3c9\uac00\ub97c": [10, 15, 21, 22, 25, 26], "dino\uc640": 10, "\uc9c0\ud45c\ub97c": [10, 18, 21], "\ud45c\ub294": 10, "\ubd80\ubd84\uc785\ub2c8\ub2e4": [10, 17, 18], "hyperparameter\ub97c": 10, "\uc870\uc815\ud558\uc5ec": 10, "\ube44\uad50\ud588\uc2b5\ub2c8\ub2e4": [10, 28], "\ud559\uc2b5\ub960\uc744": 10, "\uc99d\uac00\uc2dc\ud0a4\uace0": 10, "\ubc18\ubcf5": 10, "\uac10\uc18c\uc2dc\ud0a4\uba74": 10, "\uacb0\uacfc\uc758": [10, 19], "agg": 10, "1\uc740": [10, 21, 23], "400\ubc88\uc758": 10, "\ubc18\ubcf5\uc744": 10, "\uc2dc\ud589\ud558\uace0": 10, "2\ub294": [10, 21, 23], "1200\ubc88": 10, "\uc694\uc18c\ub85c": 10, "\ub098\ub204\uc5b4": 10, "\uc911\uc5d0\ub294": 10, "\ud558\uc774\ud37c\ub124\ud2b8\uc6cc\ud06c\ub97c": 10, "\ud558\uc774\ud37c\ub124\ud2b8\uc6cc\ud06c": 10, "\uc608\uce21\ub9cc": 10, "1\ubc88\ub9cc": 10, "\ube44\uad50\ud569\ub2c8\ub2e4": 10, "\uacb0\uacfc\uc801\uc73c\ub85c": [10, 21], "\ubc29\ubc95\uc774": [9, 10, 18, 21], "\uc2e0\ub8b0\uc131": 10, "\uc9c0\ud45c\uc5d0\uc11c": 10, "\ub2ec\uc131\ud55c\ub2e4\ub294": 10, "\ubcf4\uc5ec\uc8fc\uace0": [9, 10, 23, 24, 26], "user": [10, 16, 19], "\uc778\uc2dd": 10, "\uba54\ud2b8\ub9ad": 10, "\uc2dc\ub098\ub9ac\uc624\uc5d0\uc11c": 10, "\uc57d\ud558\ub2e4\uace0": 10, "\ub124\ud2b8\uc6cc\ud06c\uac00": 10, "\uc774\ubbf8\uc9c0\uc5d0\ub9cc": 10, "\ud6c8\ub828\ub418\uc5b4": 10, "\uc788\uace0": [9, 10, 14, 25, 27, 28, 29], "\uc2a4\ud0c0\uc77c\uc5d0\uc11c": 10, "\uc0ac\ub78c\uc744": [10, 21], "\uc778\uc2dd\ud558\ub3c4\ub85d": 10, "\uc788\uc9c0": [10, 11, 21], "\uc54a\uae30": [10, 18], "\ub54c\ubb38\uc774\ub77c\uace0": 10, "\uc8fc\uc7a5\ud558\uba70": 10, "\ubcf4\uc644\ud558\uae30": 10, "study\ub97c": 10, "inversion\uc744": 10, "\ube44\uad50\ud558\uace0": 10, "\uc0ac\uc6a9\uc790\ub4e4\uc758": 10, "\ubc1b\uc558\uc2b5\ub2c8\ub2e4": 10, "ups\uac00": 10, "\uc874\uc7ac\ud569\ub2c8\ub2e4": [10, 21, 24], "direct": 10, "error": [10, 20, 29], "\uc608\uce21\uc5d0\uc11c": 10, "\uc798\ubabb\ub41c": 10, "\uc2dc\ub9e8\ud2f1": [10, 21], "\ub098\uc62c": 10, "\uc5d0\ub7ec\uc785\ub2c8\ub2e4": 10, "\ub208": [10, 21], "\uc0c9\uae54\uc774\ub098": 10, "\ud5e4\uc5b4": 10, "\ud0c0\uc785": 10, "\uc131\ubcc4": [10, 19], "\ub4f1\uc774": [10, 18, 28], "captur": 10, "\uc624\ub958\uac00": 10, "underfit": 10, "identity\ub294": 10, "\uc9c0\ucf1c\uc9c0\ub354\ub77c\ub3c4": 10, "\uc720\uc0ac\ud558\uc9c0": 10, "\uc0d8\ud50c\uc774": [10, 21], "\uc0dd\uc131\ub420": 10, "hypernetwork\uc640": 10, "\uc2a4\ud0c0\uc77c\uc5d0": 10, "\ubb38\uc81c\uc810\uc740": 10, "\ube5b": 10, "\ud3ec\uc988": 10, "\ub4f1\uc73c\ub85c": 10, "ood\uc778": 10, "\uc0d8\ud50c\uc5d0\uc11c": 10, "\ub098\ud0c0\ub0a0": 10, "\uc5f0\uad6c\uc5d0\uc11c\ub294": [9, 10], "hyperdreambooth\ub77c\ub294": 10, "\uc18c\uac1c\ud588\uc2b5\ub2c8\ub2e4": 10, "\ubcc0\ud658\ud558\ub294": [10, 21], "\uac00\ubcbc\uc6b4": 10, "\uac1c\uc778\ud654\ud558\ub294": 10, "\ubaa9\ud45c\ub85c": [10, 19, 21, 24], "hypernetwork\ub77c\ub294": 10, "\ud30c\ub77c\ubbf8\ud130\uc778": 10, "\uc0dd\uc131\ud558\uba70": [10, 18], "\uc774\uc5b4\uc11c": 10, "\uae30\ud0c0": 10, "\ucd5c\uc801\ud654": [10, 15, 19], "\uac1c\uc778\ud654": 10, "\uc791\uc5c5\uc5d0": 10, "\uc0c1\ub2f9\ud788": [9, 10, 13, 19], "\uc904\uc774\uba74\uc11c": [10, 12], "\ubb34\uacb0\uc131\uc744": 10, "\uc2a4\ud0c0\uc77c\uacfc": [10, 19], "\uc758\ubbf8\uc801": [10, 19], "\uc218\uc815\uc774": [10, 19], "\uc801\uc6a9\ub41c": [10, 17, 18, 21], "\uc788\uc74c\uc744": [10, 11, 18, 19, 21], "\uc785\uc99d\ud558\uc600\uc2b5\ub2c8\ub2e4": 10, "2102": [11, 22], "09672": 11, "ddpm\uc744": 11, "\uc57d\uac04": 11, "quality\ub97c": [11, 19], "\uc720\uc9c0\ud558\uace0": [11, 17], "likelihood\uc218\uce58\ub3c4": 11, "\ud5a5\uc0c1\ub41c": [11, 15, 19], "sampling\uc2dc": 11, "base": [9, 11, 15, 16, 17, 19, 20, 23, 26, 27, 28, 30], "step\uc73c\ub85c": [11, 15, 23], "\ub0bc": [11, 23], "scale\uacfc": [11, 17], "quailty\uc640": 11, "\uc218\uce58\uac04\uc758": 11, "ho": [9, 11], "et": [9, 11, 21], "al": [9, 11, 21], "quality\uc5d0": 11, "\ubc18\ud574": [11, 14], "\ubaa8\ub378\uc5d0\ube44\ud574": 11, "\ub5a8\uc5b4\uc84c\ub2e4": 11, "ddpm\uc774": 11, "diversity\uac00": [11, 23], "dataset": [11, 19, 20, 21, 22, 23, 25, 26], "cifar": [11, 18, 25], "lsun": 11, "\ub3d9\uc791\ud588\uc9c0\ub9cc": 11, "dataset\uc5d0\uc11c\uc758": 11, "\ub3d9\uc791\uc740": 11, "\uc99d\uba85\ub418\uc9c0": 11, "\ubabb\ud588\ub2e4": 11, "\uc218\uce58": 11, "\uac1c\uc120": [11, 18, 23, 28], "imagenet\uac19\uc740": 11, "dataset\uc5d0\uc11c\ub3c4": 11, "\ub3d9\uc791": 11, "process\uc5d0\uc11c\uc758": 11, "\uc81c\uc548\ud558\uc600\ub2e4": 11, "\ub0b4\ub294": [11, 13, 19], "\ud655\uc778": [11, 16, 20, 22, 26], "\uc5f0\uad6c\ub4e4\uc5d0\uc11c": 11, "loglikelihood": 11, "\uc218\uce58\uc640": 11, "sample\uc758": 11, "quality\uac04\uc758": 11, "\uc5f0\uad00\uc131\uc744": 11, "\ub9ce\uc558\ub2e4": 11, "distribution\uc5d0": [11, 21], "model\uc774": [9, 11], "\uc218\uce58\ud654\ud55c": 11, "\ub290\ub08c": 11, "\uc218\uce58\uac00": 11, "\uc88b\uc544\uc9c0\uba74": 11, "quality\ub3c4": 11, "\uc99d\uac00\ud558\ub294": 11, "\uacbd\ud5a5\uc744": [11, 16, 21], "\ubcf4\uc600\ub2e4": [11, 15, 16], "ddpm\uc5d0\uc11c\ub3c4": 11, "\uc218\uce58\ub97c": [11, 14, 23], "\uac1c\uc120\ud55c\ub2e4\uba74": 11, "\uc99d\uac00\ud560": 11, "\uc54a\uc744\uae4c": 11, "angeloyeo": 11, "github": [11, 13, 19, 20, 21, 22, 24], "07": 11, "17": [11, 23], "mle": 11, "html": [11, 21], "\uc785\ud78c": [11, 16], "\ud615\ud0dc": 11, "denoising\uc5d0": 11, "parameter\ub85c": [9, 11], "noising\ud560": 11, "denoising\uc744": [11, 14], "\uc544\ub798\uc640\uac19\uc774": 11, "\uc0ac\uc6a9\ud574\ub3c4": [11, 28], "\ubcf4\uc5ec\uc11c": [11, 21], "\ubb38\uc7a5": 11, "\uc758\ubb38\uc810": 11, "\uc815": 11, "\ubc18\ub300\uc758": 11, "parameter\uc778\ub370": 11, "\ubcf4\uc600\uace0": 11, "fix\ub97c": 11, "\ud558\ub294\uac8c": 11, "\ub9de\uc744\uae4c": 11, "step\uac04": 11, "\ucc28\uc774\ub97c": [11, 24], "\ube44\uad50\ud574\ubcf4\uba74": 11, "step\uc774": [11, 23], "\ub450\uac1c\uc758": [11, 21], "\ub3d9\uc77c\ud574\uc9c4\ub2e4": 11, "2\ub97c": [11, 15, 18], "\uc131\ub2a5\uc740": [11, 18, 20], "\ucd08\ubc18\uc5d0": [11, 25], "\uacb0\uc815\ub418\ub294\ub370": 11, "\ucd08\ubc18\uc5d0\ub294": 11, "\uac12\uc758": 11, "\uacb0\uc815\ub418\ub294": 11, "\ubd80\ubd84": [11, 15, 21, 22], "\uae09\uaca9\ud558\uac8c": 11, "\ub450\uace0": 11, "\ub450\ub294\uac83\uc740": 11, "\uc124\uacc4\uc758": 11, "miss": 11, "\ud559\uc2b5\ud558\uae30\uc5d0\ub294": 11, "\ubc94\uc704\uac00": 11, "\ub108\ubb34": [2, 9, 11, 15, 16, 21, 22], "\uc791\uc544\uc11c": 11, "predict\ud558\ub3c4\ub85d": 11, "\uc124\uacc4": 11, "hybrid": [11, 23], "l_": [11, 14, 23, 28], "hyprid": 11, "\u03bbl_": 11, "vlb": 11, "\uc774\ubbf8\uc9c0\uc5d0\ub300\ud574": 11, "\ub3d9\uc791\ud558\uc9c0\ub9cc": 11, "32x32": [11, 23], "64x64": [9, 11, 18, 26, 27, 28], "\uc54a\ub294\uac83\uc744": 11, "scheduling\uc5d0\uc11c": 11, "mode\uc758": 11, "limitation\uc774": 11, "\uc9c0\uc801": 11, "\uac70\ub4ed\ub0a0\uc218\ub85d": 11, "\uc0c1\ub2e8": [11, 15], "noisy\ud574\uc9d0": 11, "skip\ud574\ub3c4": 11, "\uc131\ub2a5\uc5d0": [11, 18], "\uc601\ud5a5\uc774": 11, "\uc5c6\uc74c\uc744": 11, "mode\ub97c": 11, "\uc774\ud6c4\uc758": 11, "noise\ub294": 11, "\uc758\ubbf8\uc788\ub294": 11, "\ubbf8\uce58\uc9c0": 11, "\ubabb\ud55c\ub2e4": 11, "equation\uc744": 11, "\uc0c8\ub85c": [11, 27], "\uc2dd\uc740": 11, "\uc911\uac04": [11, 19, 23], "\ub2e8\uacc4\uc5d0\uc11c\ub294": [11, 15], "\uac15\ud558\uac8c": [11, 16], "\uc785\ud600\uc9c0\uc9c0\ub9cc": 11, "0\uacfc": 11, "\ubd80\uadfc\uc5d0\uc11c\ub294": 11, "\ub35c": [11, 23], "direct\ub85c": 11, "\ucd5c\uc801\ud654\ud558\ub3c4\ub85d": 11, "\uc124\uacc4\ud558\uba74": 11, "best": [11, 16, 22, 23], "\uc774\ubbf8\uc9c0\uc640\uac19\uc774": 11, "\uc790\uccb4\uac00": [11, 19], "unstable\ud574\uc11c": 11, "\ucd5c\uc801\ud654\uc5d0\ub294": 11, "\uc5b4\ub824\uc6c0\uc774": 11, "\uc904\uc774\uae30\uc704\ud574": 11, "\ub3c4\uc785": [2, 11], "2\uc5d0\uc11c": [11, 15], "\ub9d0\uae30\ub294": 11, "\ubcc0\ud654\uc5d0": 11, "\uc5c6\uc73c\ubbc0\ub85c": 11, "\ud655\ub960\uc801\uc73c\ub85c": [11, 17], "\ucd08\ubc18\uc758": 11, "sampling\ud574\uc11c": 11, "\ud559\uc2b5\ud558\ub3c4\ub85d": 11, "\uc801\uc6a9\ud574\ubcf8": 11, "\ubcf4\uc784": [11, 21, 22], "sampling\uc744": [9, 11, 22], "\uc801\uc6a9\ud558\uba74": 11, "\uc801\uc6a9": [11, 14, 15, 22, 28], "\uc804\ubcf4\ub2e4": 11, "\uc88b\uc9c0": [11, 15, 16, 18, 28], "\ubcf4\uc778\ub2e4": [9, 11, 16], "\ub2e4\uc18c": [11, 22], "\ucde8\uc57d\ud588\ub358": 11, "imagenet": [9, 11, 15, 19, 20], "64x64\uc640": 11, "cidar": 11, "\uae30\uc900": [11, 22, 23], "convolut": [11, 17, 21, 22, 28], "\ubaa8\ub378\uc774\ub098": 11, "\ubaa8\ub378\uc911\uc5d0\uc11c\ub294": 11, "fulli": [11, 21], "\ube44\ud574\uc11c\ub294": 11, "\ubd80\uc871\ud55c": [11, 24], "\uba74\uc774": 11, "speed\ub97c": 11, "\uba87\uba87": [9, 11], "step\ub9cc": 11, "\uac00\ub3c4": 11, "fid\uac12\uc744": 11, "metric\uc73c\ub85c": 11, "biggan": [11, 23], "big": 11, "\ubaa8\ub378\ubcf4\ub2e4": [11, 18, 22, 27], "\ud0c0\uac9f\uc5d0": 11, "\uc218\uce58\ub098": 11, "recal": [9, 11], "metric\uc5d0\uc11c": 11, "capacity\ub97c": 11, "fid\uc640": [9, 11, 18, 23], "nll": [11, 21], "\ud559\uc2b5\ub7c9": 11, "\uc5b4\ub290\uc815\ub3c4": 11, "\ube44\ub840\ud568": 11, "synthesi": [9, 12, 14, 17, 24], "2112": [9, 12], "10752": 12, "compvi": 12, "namkyeong": 12, "31": [12, 19, 24], "\uc624\ub298": [12, 17], "\uc54c\uc544\ubcfc": [12, 17, 27, 28], "model\uc785\ub2c8\ub2e4": 12, "\ub2e4\ub918\ub358": [12, 17], "\uc720\uc0ac\ud558\uac8c": [12, 15, 18, 27, 28], "\ucef4\ud4e8\ud130": 12, "\uc790\uc6d0\uc758": 12, "\uc18c\ubaa8\ub97c": 12, "\uc5bb\ub294\uac83\uc774": 12, "\ubaa9\ud45c\uc785\ub2c8\ub2e4": [12, 27], "\uc804\ubc18\uc801\uc73c\ub85c": [12, 18], "\uc8fc\uc5b4\uc84c\uc744\ub54c": 12, "\ud1b5\ud574\uc11c": [12, 17, 20], "\ub514\ucf54\ub529\uc744": 12, "\ub418\ub3c4\ub85d": [12, 16], "\ud14c\uc2a4\ud2b8\ub97c": 12, "\uc9c4\ud589\ud558\uc600\ub2e4": [9, 12], "space\uc5d0\uc11c": [12, 19], "\ubd84\uc0b0\uc774": 12, "\ucee4\uc9c0\uc9c0": 12, "\uc54a\ub3c4\ub85d": 12, "divergence\uc640": 12, "quantiz": [12, 22], "vq": 12, "\ud65c\uc6a9\ud558\uc600\ub2e4": 12, "\uc774\ubbf8\uc9c0\uc678": 12, "\ud14d\uc2a4\ud2b8\ub098": 12, "semat": 12, "map\uacfc": 12, "\uc815\ubcf4\ub294": 12, "tau_": 12, "\uc804\ub2ec\uc744": 12, "\ud558\uc600\uace0": [12, 18], "phi_i": 12, "_k": 12, "_v": 12, "\uc815\uc758\ub418\uace0": 12, "\uc911\uac04\uc758": 12, "matrix\uc774\ub2e4": 12, "attention\uc758": 12, "value\uc5d0": 12, "\ud574\ub2f9\ud558\uba70": 12, "qk": 12, "\uc9c4\ud589\ub41c\ub2e4": [12, 20], "\ud568\uc218\ub294": 12, "\uac19\uc774\ud45c\ud604\ub41c\ub2e4": 12, "\uc8fc\ubaa9\ud560\ub9cc\ud55c": 12, "model\uc5d0\uc11c": 12, "dm": [12, 23], "function\uc73c\ub85c": [12, 22, 23], "\uc9c4\ud589\uc2dc\ud0a4\ub294\ub370": 12, "\ubc14\uafb8\uba74\uc11c": 12, "\uc591\uc744": [12, 15], "\uc904\uc600\ub2e4\ub294": 12, "\uc810\uc774\ub2e4": [12, 15, 23], "\uc9c4\ud589\ud558\uc600\ub294\ub370": 12, "\uadf8\uc911": 12, "\uc77c\ubd80\ub9cc": 12, "\uc18c\uac1c\ud558\ub3c4\ub85d": 12, "\ud558\uaca0\ub2e4": 12, "dataset\uc5d0\uc11c": [12, 16, 19, 22], "\ubf51\uc740": [12, 17], "\uc0d8\ud50c\uacfc": [12, 21], "sample\ub4e4\uc785\ub2c8\ub2e4": 12, "laion": [12, 16, 20, 28], "\uc801\uc808\ud55c": [12, 14, 20], "\uc810\uc218\uc640": [9, 12], "\ud6a8\uc728\uc131\uc744": 12, "layout\uc774": 12, "\uc8fc\uc5b4\uc84c\uc744": 12, "layout": 12, "peft": 13, "effeci": 13, "\ud558\ub098": [13, 22], "\uace0\uc815\ud55c": 13, "\ucc44\ub85c": 13, "\uba87": [13, 19, 21, 25], "fc": 13, "layer\ub9cc": 13, "task\uc758": [13, 21], "\uc5f0\uc0b0\ub7c9\uc744": 13, "\uc904\uc77c": [13, 19], "gpt": 13, "3\uc744": 13, "\uae30\uc900\uc73c\ub85c": [13, 18, 20], "parameter\ub294": [13, 18], "10000\ubc30": 13, "gpu": [13, 28], "\uba54\ubaa8\ub9ac\ub294": 13, "3\ubc30\ub97c": 13, "latency\uac00": 13, "\uc5c6\uc74c": 13, "\ud29c\ub2dd\ud558\ub294": 13, "\ud30c\ub77c\ubbf8\ud130\ub9cc\uc744": 13, "\ud29c\ub2dd\ud568\uc73c\ub85c\uc368": 13, "\uc790\uc6d0\uc73c\ub85c\ub3c4": 13, "\ub192\uac8c": 13, "\uc720\uc9c0\ud558\ub294": 13, "\ubc29\ubc95\ub860": 13, "\ud558\ub294\uac83": 13, "upstream": 13, "\ud559\uc2b5\uc2dc\ud0a4\ub294\uac83": 13, "\uc694\uccad\uc758": 13, "\uc2dc\uc791\ubd80\ud130": 13, "\uc644\ub8cc\uae4c\uc9c0": 13, "\uac78\ub9ac\ub294": 13, "llm\uc740": 13, "\uc2dc\ud0b4": [13, 18], "tuning\uc5d0\uc11c": 13, "\ud559\uc2b5\uc2dc\ud0a4\uba74": [13, 20], "roberta": 13, "\ub2ec\uc774": 13, "\uac78\ub9bc": 13, "\uc5f0\uad6c\uc5d0\uc11c": 13, "over": [2, 13, 21, 26], "model\ub4e4\uc740": 13, "intrins": 13, "dimension\uc5d0": 13, "\uae30\ubc18\ud558\uace0": 13, "\uc0ac\uc2e4\uc5d0": 13, "\uc800\uc790\ub294": [13, 19], "\uacfc\uc815\uc5d0\uc11c\ub3c4": 13, "\uac16\uace0": 13, "\uac00\uc815\ud568": [13, 22], "\uace0\uc815\ud558\uace0": [13, 22], "decomposit": 13, "matrices\ub97c": 13, "\ucd5c\uc801\ud654\ud558\ub294": [13, 25], "\uc2dc\ud0a4\uae30\ub85c": 13, "decomposition\ub41c": 13, "\ub354\ud574\uc90c": 13, "\ud06c\uae30\ub294": [13, 15, 28], "\uc791\uc544": 13, "cost\ub97c": 13, "\ucd5c\ub300": [13, 22], "3\ubc30\uae4c\uc9c0": 13, "\ubc14\uafd4\uc8fc\uba74": 13, "storag": [13, 28], "requir": [2, 13], "switch": 13, "overhead\ub97c": 13, "\uc678\uc5d0\ub3c4": 13, "\uc5c6\ub2e4": [2, 13, 14, 23], "\uae30\ubc95\ub4e4\uacfc": 13, "\uac00\ub2a5\ud558\ub2e4\ub294": 13, "\uc7a5\uc810\uc774": [13, 28], "transformer\uc758": [13, 18, 22], "w_q": [13, 28], "w_k": [13, 28], "w_v": [13, 28], "w_o": 13, "module\uc758": 13, "accumulated\ub41c": 13, "\uc5f0\uad6c\uc758": 13, "convention\uc744": 13, "optimizer\ub294": 13, "adam\uc744": 13, "\uc774\uc6a9": 13, "mlp": [13, 29], "feedforward": 13, "ffn": 13, "agnostic\ud558\uc9c0\ub9cc": 13, "model\uc5d0": [9, 13, 14, 19], "\uc9d1\uc911\ud568": 13, "agnost": [13, 22], "\uad6c\uc560\ubc1b\uc9c0": 13, "\ud574\uc11d\uc774": 13, "max": [2, 13], "phi": [13, 22, 23, 28, 29], "y_t": [2, 13], "y_": [13, 23], "parameterized\ub41c": 13, "x_i": [13, 29], "y_i": 13, "target\uc30d\uc73c\ub85c": 13, "phi_0": 13, "\ub418\uace0": [13, 16, 24, 26, 29], "maximize\ud558\uae30": 13, "\uc5c5\ub370\uc774\ud2b8\ub428": 13, "\ud06c\uae30\uc758": [13, 15], "\ud559\uc2b5\ud574": [13, 19], "\uc5c4\uccad\ub09c": 13, "cost\uac00": 13, "\ubc1c\uc0dd": [13, 24], "\ubc18\uba74": [9, 13, 21, 23], "\uc804\uccb4\uac00": 13, "\uadf8\ubcf4\ub2e4": 13, "\ucc3e\uc544\ub0b4\ub294": 13, "\ubc14\ub00c\uae30": 13, "effecient\ud574\uc9d0": 13, "01": 13, "\uae4c\uc9c0": [2, 13, 20, 27], "\uc791\uc544\uc9c8": 13, "\uae30\uc874\uc5d0\ub3c4": 13, "transfer": [13, 16, 19, 24], "learning\uc5d0\uc11c": [13, 22], "effecient\ub97c": 13, "\uac00\uc9c0\uac00": 13, "perform": [13, 22, 26, 28], "\ucd94\uac00\ud558\ub294": [13, 18, 28], "hardwar": 13, "parellelism\uc774": 13, "\uc5c6\ub2e4\uba74": 13, "bottleneck": [13, 22, 28], "\ucd94\uac00\ud574\ub3c4": 13, "\uc99d\uac00\ud574": 13, "\uc0ac\uc6a9\ud558\uae30": [9, 13], "\uc5b4\ub824\uc6e0\uc74c": 13, "prefix": 13, "tuning\uc740": [13, 18, 19], "optimize\uac00": 13, "ba": 13, "\uacf1\ud574\uc9c4": 13, "vector\ub07c\ub9ac": 13, "coordin": 13, "wise\ud558\uac8c": 13, "\uc774\ub77c": 13, "scaling\ub428": 13, "rate\ucc98\ub7fc": 13, "tuning\ud574\uc11c": 13, "r\uacfc": 13, "\uc774\ub098": 13, "\uc0ac\uc6a9\ud55c\ub2e4\uace0": [13, 20], "actual": 13, "defin": 13, "lora_a": 13, "new_zero": 13, "num_embed": 13, "lora_b": 13, "embedding_dim": 13, "lora_alpha": 13, "matrix": 13, "requires_grad": [13, 25], "reset_paramet": 13, "hasattr": 13, "wai": 13, "zeros_": 13, "normal_": [13, 29], "bool": 13, "merge_weight": 13, "sure": 13, "transpos": 13, "mark": 13, "tensor": [13, 20, 25, 28], "after_a": 13, "padding_idx": 13, "max_norm": 13, "norm_typ": 13, "scale_grad_by_freq": 13, "spars": [13, 22, 28], "w_0x": 13, "bax": 13, "lora\ub97c": 13, "\uc774\uc6a9\ud558\uba74": [13, 26], "inference\uc2dc": 13, "\ud558\ub77d\uc774": 13, "\uacbd\uc6b0\uc5d4": 13, "\ucd94\uac00\ud558\uba74": [13, 18], "overhead\uac00": 13, "\ub0ae\uc74c": 13, "\ucd5c\uc18c\ud654\ud558\uae30": [13, 21], "weight\ub9cc": 13, "\uc801\uc6a9\ud558\uace0": 13, "module\uc740": 13, "\uace0\uc815\ud568": 13, "175b\ub97c": 13, "vram\uc740": 13, "2tb\uc5d0\uc11c": 13, "350gb": 13, "checkpoint": [13, 16], "size\ub294": 13, "350gb\uc5d0\uc11c": 13, "35mb\ub85c": 13, "\uc904\uc784": 13, "\ube68\ub77c\uc9d0": 13, "bert": 13, "\ub300\ubd80\ubd84\uc758": 13, "\uacbd\uc6b0\uc5d0\uc11c": 13, "\uc88b\uc74c": [13, 22], "valid": [13, 18, 20, 25], "accuraci": [13, 20], "transformer\uc5d0\uc11c": [13, 22], "matrix\uc5d0": 13, "r\uc744": 13, "\uac83\ubcf4\ub2e4": [13, 20, 26, 28], "matrices\uc5d0": 13, "\uc88b\uc558\uc74c": 13, "\ub274\ub7f4\ub124\ud2b8\uc6cc\ud06c\uc758": 13, "activation\uc744": 13, "\uc904\uc774\uae30\ub3c4\ud558\uace0": 13, "\ub298\ub9ac\uae30\ub3c4\ud558\ub294": 13, "\uc5b4\ub311\ud130\ub97c": 13, "\uc911\uac04\uc5d0": 13, "\uc0bd\uc785\ud558\ub294": 13, "lora\ubcf4\ub2e4": 13, "\uc0ac\uc6a9\ud558\uba74\uc11c": [13, 19], "\uc54c\ub824\uc838\uc788\uc73c\uba70": 13, "3\ub97c": 13, "\ud588\uc744\ub54c": 13, "\ubcf4\ub2e4\ub3c4": [13, 26], "\uc8fc\uc7a5\ud558\uace0": 13, "\ud559\uc2b5\uc2dc\uac04\ub3c4": 13, "\uc9e7\uc544": 13, "a100": [13, 24], "30\ubd84\ub9cc\uc5d0": 13, "\ud29c\ub2dd\ud560": 13, "loralib": 13, "\uc124\uce58": 13, "pip": 13, "instal": 13, "altern": [13, 25], "git": 13, "microsoft": 13, "befor": 13, "in_featur": 13, "out_featur": 13, "after": 13, "add": [13, 24], "parameter\ub9cc": 13, "bigmodel": 13, "string": 13, "lora_": 13, "mark_only_lora_as_train": 13, "loop": [13, 28], "dataload": [13, 25], "checkpoint\ub97c": [13, 16], "\uc800\uc7a5\ud560": 13, "\ub54c\uc5d4": 13, "state_dict": 13, "\uc800\uc7a5\ud558\uac8c": 13, "save": 13, "checkpoint_path": 13, "lora_state_dict": 13, "\ubd88\ub7ec\uc62c": 13, "load_state_dict": 13, "strict": 13, "load": [13, 15, 24], "ckpt_pretrain": 13, "pt": [13, 22], "ckpt_lora": 13, "llm": [13, 26], "\ud29c\ub2dd": 13, "gpu\ub85c": [13, 16], "\uac00\ub2a5\ud560\uae4c": [13, 18], "\uc18c\uac1c\ud569\ub2c8\ub2e4": [13, 18, 24, 27, 28], "da": 13, "nhctrrve": 13, "guid": [14, 27], "differenti": [9, 14, 29], "2108": 14, "01073": 14, "03": [14, 28], "\ubd84\uc57c\uc5d0\uc11c\uc758": 14, "\uc9c4\ud654": 14, "\uc18d\ub3c4\uac00": 14, "\uacc4\uc18d": 14, "\ub418\uc5b4\uc624\uace0\uc788\ub2e4": 14, "\uc0ac\uc6a9\uc790\uac00": [14, 15, 19], "\uc774\ub04c\uc5b4\ub0b4\ub824\ub294": 14, "\ubd84\uc57c\ub3c4": 14, "\ud65c\ubc1c\ud788": [14, 16], "\uc9c4\ud589\ub418\uace0\uc788\ub2e4": 14, "\ubc29\uc2dd\uc73c\ub85c\uc758": 14, "editing\uc5d0\ub294": 14, "\uba87\uac00\uc9c0": [14, 21], "\ub2e8\uc810\uc774": [9, 14, 16], "sdedit\uc740": 14, "\ubb38\uc81c\uc810\uc744": [9, 14, 16, 22, 24], "\ud574\uacb0\ud574\ub098\uc544\uac14\ub2e4\ub294": 14, "\uc810\uc744": [14, 16], "contribution\uc73c\ub85c": 14, "\uc81c\uc2dc\ud558\uc600\ub2e4": 14, "abstract\uc5d0\uc11c": 14, "\ub9d0\ud55c": 14, "editing\uc774\ub780": 14, "\uc720\uc800\uac00": [14, 19], "\uc0dd\uc131\ud558\uace0\uc790": [14, 17, 29], "guide\ub97c": [9, 14], "\uc81c\uc2dc\ud558\uba74": 14, "\uc774\ub54c": [14, 22, 24, 25, 27, 28, 29], "\ub450\uac00\uc9c0\uc758": 14, "\ud3c9\uac00\uc694\uc18c\uac00": 14, "\uc788\ub294\ub370": [2, 14, 15, 16, 20, 28], "faith": 14, "\uc720\uc800\uc758": 14, "\ub530\ub974\ub294\uc9c0": 14, "realist": [14, 27], "real\ud55c\uc9c0": 14, "\uc5f0\uad6c\ubc29\uc2dd\uc740": 14, "\ub450\uac00\uc9c0\ub85c": 14, "\ub098\ub25c\ub2e4": 14, "sota\ub97c": [9, 14, 16, 18, 22, 23], "\uc774\ub8ec": 14, "\uc774\ubbf8\uc9c0\uc5d0\uc11c": [14, 18], "edit\ub41c": 14, "\ub2e8\uc810": 14, "dataset\uc774": 14, "\ud544\uc694\ud558\uace0": 14, "condition\ub9c8\ub2e4": 14, "\uc7ac\ud559\uc2b5\uc744": 14, "\uc694\uad6c": 14, "inversion\ud55c": 14, "vactor\ub97c": 14, "\uc870\uc791\ud574": 14, "function\uc774": [14, 21], "\uc815\uc758\ub418\uc5b4\uc57c\ud558\uace0": 14, "\ud544\uc694\ud558\uc9c0": [14, 21], "\uc54a\ub2e4": 14, "function\uacfc": [14, 21], "\uc7ac\ud559\uc2b5\uc774": 14, "\ud55c\uac1c\uc758": 14, "weight\ub85c": 14, "condition\uc758": 14, "idea": 14, "\uc774\ubbf8\uc9c0\ub4e4\uc740": 14, "\ubd84\ud3ec\uc5d0\uc11c": [14, 19], "\ubd84\ud3ec\uac00": [14, 21, 29], "\ub192\uc740\uacf3\uc73c\ub85c": 14, "\ud574\ub098\uac00\uba74": 14, "\uc5bb\uc5b4\ub0bc": 14, "score\ub294": [14, 18], "\ubc00\ub3c4": 14, "\ud568\uc218\uc758": 14, "\uc21c\uac04": 14, "\uae30\uc6b8\uae30": 14, "\ubbf8\ubd84\uac12": 14, "\uc815\uc758\ud55c\ub2e4": 14, "\uc8fc\uc785\ud558\ub294\ub370": 14, "\uc8fc\uc785\ud55c\ub2e4": 14, "\ub610\ub2e4\ub978": [9, 14], "probabl": 14, "ddpm\uacfc\uc758": 14, "\ucc28\uc774\ub294": [14, 26], "\uc815\uc758\ud558\ub294": 14, "equation\uc758": 14, "\uc815\ub3c4\uc774\ub2e4": 14, "1907": 14, "05600": 14, "setup": 14, "level\uc744": 14, "\uc774\ubbf8\uc9c0\uc704\uc5d0": 14, "patch\ub97c": 14, "stroke\ub97c": 14, "coarse\ud55c": 14, "stroke\uc758": 14, "procedur": 14, "\ub2ec\ub9ac": [9, 14, 18, 21, 25, 29], "sde\uc758": 14, "\uc644\uc804\ud788": [14, 21], "noise\ud654\ub41c": 14, "noise\ub85c\ubd80\ud130": 14, "\uc9c4\ud589\ud560": [14, 22], "\ud544\uc694\uac00": [14, 21, 24], "t_": [2, 14], "\uc9c0\uc815\ud55c": [14, 26], "\uc815\uc758\ud574\uc57c\ud558\ub294\ub370": 14, "realistic\ud558\uc9c0\ub9cc": 14, "\ud558\uc9c0\uc54a\uc740": 14, "faithful\ud558\uc9c0\ub9cc": 14, "artistic\ud55c": 14, "\uc5bb\uac8c\ub41c\ub2e4": 14, "sdedit\uc758": 14, "\uacfc\uc815\uc774\ub2e4": 14, "better": [14, 26], "\uc885\ud569\uc801\uc778": 14, "\uc9c0\ud45c\ub85c": [14, 18], "survey\ub97c": 14, "\ubc29\uc2dd\ub4e4\uacfc": 14, "stylegan": 14, "ada": 14, "sdedit\uc774": 14, "\uc790\uc5f0\uc2a4\ub7fd\uace0": [14, 16], "\ub530\ub974\ub294": [2, 14, 28], "origin": [14, 15], "blend": 14, "\uc804\ud1b5\uc801\uc778": 14, "\uae30\ubc95\uacfc": 14, "\ube44\uad50\ud574\ub3c4": 14, "sdxl\uc740": 15, "diffusion\uacfc": 15, "\ube44\uad50\ud558\uba74": 15, "\ubc30": [15, 22], "\uaddc\ubaa8\uc758": [15, 18, 19], "unet\uc744": 15, "\ube14\ub85d\uacfc": 15, "sdxl\uc5d0\uc11c": 15, "encoder\ub85c": 15, "\uc0ac\uc6a9\ub418\uba74\uc11c": 15, "\ud30c\ub77c\ubbf8\ud130\uac00": 15, "\uc99d\uac00\ud588\ub2e4": 15, "\ub2e4\uc218\uc758": 15, "\ubc29\ubc95\uacfc": [9, 15, 18], "\ube44\uc728\uc5d0": 15, "sdxl\uc744": 15, "\ud559\uc2b5\ud560": [9, 15, 18, 21, 25], "\uc124\uacc4\ud588\ub2e4": 15, "sdxl\uc758": 15, "\uc0d8\ud50c\uc758": [15, 18], "\uc2dc\uac01\uc801\uc778": [15, 19], "fidelity\ub97c": 15, "\ud5a5\uc0c1\uc2dc\ud0a8": 15, "\ub300\ud3ed": 15, "\uc8fc\uc694": 15, "\uae30\ub2a5\uc774\ub77c": 15, "3\ubc30": 15, "\ud615\ud0dc\uc758": [15, 19, 24], "\uac10\ub3c5": 15, "supervis": [15, 21], "\uac04\ub2e8\ud558\uba74\uc11c\ub3c4": 15, "\ud6a8\uacfc\uc801\uc778": 15, "\ucd94\uac00\uc758": 15, "\ud5a5\uc0c1\ud558\ub294": 15, "latent\ub97c": 15, "\ubcc4\uac1c\uc758": 15, "img": [15, 25], "\uadf8\ub9bc": [2, 15, 19, 21], "1\uc5d0\uc11c": 15, "\ub192\uc778": 15, "sdxl\uc774": 15, "sd\ubcf4\ub2e4": 15, "\uc2dc\uac01\ud654\ud588\ub294\ub370": 15, "128x128": 15, "\ud65c\uc6a9\ud558\uace0": 15, "sdedit\uc744": 15, "\uc801\uc6a9\ud55c\ub2e4": 15, "sdxl\uacfc": 15, "autoencoder\ub97c": 15, "sd\uc640": 15, "\ube14\ub85d\uc758": 15, "heterogen": 15, "\uc0ac\uc6a9\ud588\ub2e4\ub294": [15, 23], "\ud14c\uc774\ube14": [15, 23], "1\uc744": 15, "\ucc38\uace0\ud558\uba74": [15, 23], "highest": 15, "level\uc5d0\uc11c": 15, "\ube14\ub7ed\uc744": 15, "level\uc5d0\uc11c\ub294": 15, "unet\uc5d0\uc11c": 15, "lowest": 15, "8x": 15, "conditioning\uc744": [9, 15], "encoder\ub97c": 15, "l\uacfc": 15, "openclip": [15, 20], "bigg\ub97c": 15, "\ucc44\ub110": 15, "\ucd95\uc5d0": 15, "encoder\uc758": [15, 18, 19], "\uc8fc\uae30": [9, 15], "\ub808\uc774\uc5b4\ub97c": 15, "\uc0ac\uc6a9\ud588\uc73c\uba70": 15, "openclip\ub85c\ubd80\ud130": 15, "pool": 15, "embedding\uc744": [15, 19, 23], "condition\uc73c\ub85c": [9, 15, 23], "\ucd94\uac00\ud588\ub2e4": 15, "\ubcc0\ud654\ub294": 15, "\ud30c\ub77c\ubbf8\ud130": [15, 24, 29], "\uc0ac\uc774\uc988\uac00": 15, "6b\ub85c": 15, "encoder\ub294": [15, 18, 29], "817m": 15, "\ud53d\uc140": [15, 26], "\uc774\ud558": [15, 21], "\uc2dc\ud0a4\uac70\ub098": 15, "upscale\ud558\uc5ec": 15, "\ucd5c\uc18c": 15, "\ud06c\uae30\uac00": 15, "\uc815\ud574\uc9c0\ub294": 15, "\ubb38\uc81c\uc810\uc774": 15, "\ubc1c\uc0dd\ud55c\ub2e4": 15, "\uc800\ud558\uc2dc\ud0a4\uac70\ub098": 15, "\uc77c\ubc18\ud654\ub97c": 15, "\uc14b\uc758": 15, "\uc2dc\uac01\ud654\ud574\uc8fc\ub294": 15, "\uadf8\ub9bc\uc774\ub2e4": 15, "\uc81c\uc548\ub41c": 15, "conditiong": 15, "\ud06c\uae30": 15, "\ubbf8\ub9cc\uc758": 15, "39": 15, "\ub098": [15, 24, 26], "\ub2ec\ud55c\ub2e4": 15, "upscal": 15, "blur": 15, "\uac00\uc838\uc640": 15, "\uc544\ud2f0\ud329\ud2b8\uac00": 15, "\uc0dd\uae34\ub2e4": [15, 26], "\uc6d0\ub798\uc758": 15, "\ud574\uc0c1\ub3c4\uc5d0\uc11c": 15, "\uc8fc\uc5c8\ub2e4": [15, 19], "\uc5b4\ub5a0\ud55c": [15, 17, 24], "rescal": [15, 23], "\ud06c\uae30\uc778": 15, "w_": [2, 15, 24], "\uc81c\uacf5\ud574": 15, "\uc904": [9, 15, 21, 23, 26], "\ucd94\uac00\ub41c\ub2e4": 15, "\ud574\uc0c1\ub3c4\ub97c": [15, 17, 27], "\uc815\ud560": 15, "\ud574\uc0c1\ub3c4\uc5d0": 15, "\uc758\uc874\uc801\uc778": 15, "\uc5f0\uad00\uc2dc\ud0a4\ub3c4\ub85d": 15, "imagenet\uc73c\ub85c": 15, "\uc9c4\ud589\ud574": 15, "conditiong\uc5d0": 15, "\uc6b0\uc218\uc131\uc744": 15, "\uc785\uc99d\ud588\ub2e4": 15, "cin": 15, "\uc2dc\ucf30\uace0": 15, "70k": 15, "\uc7a5": 15, "nocond": 15, "\ud45c": [15, 21], "\ubcf4\ub2e4\uc2dc\ud53c": 15, "IS": [9, 15, 22], "4\uc5d0\uc11c": [15, 21], "\uace0\uc591\uc774": [15, 21], "\uba38\ub9ac\uac00": [15, 17], "\uc798\ub824\uc9c4": 15, "cropping\uc73c\ub85c": 15, "\uc0dd\uc131\ub418\uc5c8\uae30": 15, "\ub54c\ubb38\uc774\ub2e4": 15, "\uc81c\uc548\ud55c\ub2e4": [15, 16, 19, 23], "\uade0\ub4f1\ud558\uac8c": 15, "\ub192\uc774": 15, "\ub108\ube44": 15, "\ucd95\uc744": 15, "\ubaa8\uc11c\ub9ac\uc5d0\uc11c": 15, "\ud53d\uc140\uc758": 15, "\uc9c0\uc815\ud558\ub294": 15, "\uc815\uc218": [2, 15], "\uc0d8\ud50c\ub9c1\ud55c\ub2e4": [15, 19], "fourier": 15, "\uc784\ubca0\ub529\uc744": 15, "\ud30c\ub77c\ubbf8\ud130\ub85c\uc368": 15, "\uc785\ub825\ud55c\ub2e4": 15, "conditioning\uacfc": 15, "\uc784\ubca0\ub529": [15, 19], "\ud30c\ub77c\ubbf8\ud130\ub85c": 15, "\ubfd0\ub9cc": [15, 21], "dm\uc5d0\uc11c\ub3c4": 15, "\uc0ac\uc6a9\ub420": [15, 18, 19, 21], "\uac15\uc870\ud55c\ub2e4": 15, "conditioning\uc740": 15, "\uc27d\uac8c": [2, 15, 17, 19], "\uacb0\ud569\ub420": 15, "\ud0c0\uc784\uc2a4\ud15d": 15, "\uc784\ubca0\ub529\uc5d0": 15, "\ucd94\uac00\ud55c\ub2e4": 15, "512x512": [15, 20, 28], "1024x1024": [15, 17, 18, 27], "\ud604\uc2e4": 15, "\uc138\uacc4\uc5d0\uc11c": 15, "\ubd80\uc790\uc5f0\uc2a4\ub7fd\ub2e4": 15, "\uc138\uacc4\uc5d0\uc11c\ub294": 15, "\ube44\uc728\uc744": 15, "\ub9ce\uace0": [15, 18], "\ud48d\uacbd": 15, "\ube44\uc728\uc758": 15, "\uc9c0\ub2c8\uace0": [15, 25, 27], "\ub2e4\ub8f0\uc218": 15, "\ud30c\uc778\ud29c\ub2dd\ud588\ub2e4": 15, "\ud53d\uc140\uc218\ub97c": 15, "\ub9cc\ud07c": [2, 15, 20, 22], "64\uc758": 15, "\ubc30\uc218\ub97c": 15, "\uc9c0\ub2c8\ub3c4\ub85d": 15, "ratio": 15, "\ubc30\uce58\ub294": 15, "\ubc84\ud0b7": 15, "\uc2a4\ud15d\ub9c8\ub2e4": 15, "\ubc88\uac08\uc544": [15, 25], "\uac00\uba70": 15, "\ud0c0\uac9f": 15, "conditioning\uc73c\ub85c": 15, "\uc8fc\uc5c8\uc73c\uba70": 15, "\uacf5\uac04\uc5d0": 15, "\uc784\ubca0\ub529\ub418\ub294": 15, "tgt": [15, 16], "\ud615\ud0dc\ub85c": [15, 23, 28], "\ud45c\ud604\ub41c\ub2e4": 15, "\uace0\uc815\ub41c": [15, 19, 25], "\ube44\uc728\ubc0f": 15, "\ud574\uc0c1\ub3c4\uc758": 15, "pretraining\uc774": 15, "\ub9c8\uce5c": 15, "\ud30c\uc778\ud29c\ub2dd": [15, 18], "\ud559\uc2b5\ud588\uace0": 15, "\ucd95\uc73c\ub85c": 15, "2\uc808\uc5d0\uc11c": 15, "\uc18c\uac1c\ud55c": 15, "\uae30\uc220\uacfc": 15, "\uacb0\ud569\ud588\ub2e4": 15, "16\uc5d0\uc11c": 15, "sd\ub294": 15, "\ud558\ub098\uc774\uace0": 15, "autoencoder\uc758": 15, "space\ub97c": [15, 19], "composition\uc740": 15, "ldm\uc73c\ub85c\ubd80\ud130": 15, "\ud45c\ud604\ub418\uc9c0\ub9cc": 15, "local": [15, 16], "frequenc": [15, 22], "\ub514\ud14c\uc77c\ud55c": 15, "\ud5a5\uc0c1\ud558\uace0\uc790": 15, "\ud5a5\uc0c1\ud588\ub2e4": 15, "\ub05d\uc73c\ub85c": 15, "sd\ub97c": 15, "\uc544\ud0a4\ud14d\ucc98\uc5d0\uc11c": 15, "\ubc30\uce58\uc0ac\uc774\uc988": 15, "average\ub97c": 15, "\uba54\ud2b8\ub9ad\uc5d0": 15, "\uc815\ub9ac\ud574\uc8fc\ub294": 15, "\uc808\uc785\ub2c8\ub2e4": 15, "step\uc740": [15, 18], "step\uc744": [15, 19], "model\ub97c": [15, 19], "\ub0b4\ubd80": 15, "\uc14b\uc73c\ub85c": 15, "2\uc5d0": 15, "\ub098\uc640\uc788\ub294": 15, "\ubd84\ud3ec\uc5d0": [15, 19], "600": 15, "000": [15, 20], "\uc0ac\uc774\uc988\ub85c": 15, "2048\ub85c": 15, "\ud559\uc2b5\uc2dc\ucf30\uace0": 15, "\ub9c8\uce68\ub0b4": 15, "offset": 15, "11": [15, 26], "\uc218\uc900\uacfc": 15, "\ub2e4\uc911": 15, "\ube44\uc728": 15, "\uc601\uc5ed\uc758": 15, "\ube44\uc728\ub85c": 15, "\uacbd\ud5d8\uc801\uc73c\ub85c": 15, "6\ucc98\ub7fc": 15, "\ucc3e\uc558\ub2e4": 15, "\uadf8\ub9bc\uc774": [15, 18], "stage\ub97c": 15, "\ud2b9\ud654\ub41c": 15, "\ubcc4\ub3c4\uc758": [9, 15], "ldm\uc744": [15, 19], "sdedit\uc5d0\uc11c": 15, "ediff": 15, "\ub530\ub790\uc73c\uba70": 15, "\uc2a4\ucf00\uc77c\uc5d0": 15, "inference\uc5d0\uc11c": 15, "diffuse\uc640": 15, "denoise\ub97c": 15, "\ub123\uc5c8\ub2e4": 15, "\uc2a4\ud15d\uc740": 15, "\uc120\ud0dd\uc774\uc9c0\ub9cc": 15, "\ubc30\uacbd": [15, 19], "\uc0ac\ub78c": [15, 21, 26], "\ub514\ud14c\uc77c\uc5d0\uc11c": 15, "13": [15, 21, 26], "\uc788\uc5c8\ub2e4": [9, 15, 16, 20], "your": [16, 20], "One": [16, 19], "2303": [16, 20], "03231": 16, "sty": 16, "lize": 16, "ne": 16, "\ud55c\uc7a5\uc758": 16, "\uc785\ud788\uace0\uc790\ud558\ub294": 16, "\uc9c4\ud589\uc911\uc774\ub2e4": 16, "\uc774\uc804\uae4c\uc9c0\uc758": 16, "\uc5f0\uad6c\ub4e4\uc740": 16, "\ud55c\uc7a5\uc529\uc744": 16, "\ud65c\uc6a9\ud558\ub824\ub294": 16, "\uc2dd\uc774": 16, "\uc8fc\ub97c": 16, "\uc774\ub8e8\uc5c8\ub2e4": 16, "\ubc29\uc2dd\uc5d0\ub294": 16, "face\ub97c": 16, "\uc758\uc874\ub3c4\uac00": 16, "\ucee4\uc11c": [16, 21], "style\uc744": [16, 19], "\uc785\ud788\uae30": 16, "\ud798\ub4e4\ub2e4": 16, "space\uc548\uc5d0\uc11c": 16, "content": [16, 27], "\uc815\ubcf4\uc640": 16, "entangl": [16, 17, 24], "\ub418\uc5b4\uc788\ub2e4": 16, "styo\ub294": 16, "\ud3ec\uc6a9\ud558\ub294": 16, "base\ubaa8\ub378\ub85c": 16, "\ucc44\uc6a9\ud55c\ub2e4": 16, "stage\ub85c": 16, "\uad6c\uc131\ub418\ub294\ub370": 16, "disentangl": 16, "learner": 16, "idl": 16, "\ubd84\ub9ac": 16, "grain": 16, "fcc": 16, "idl\ub85c\ubd80\ud130": 16, "\ubd84\ub9ac\ub41c": 16, "content\uc640": 16, "\uc6d0\ud558\ub294\ub300\ub85c": 16, "\uc7ac\uc870\ud569": 16, "src": 16, "detail\ud55c": 16, "\uc720\uc9c0\ud558\uae30\uc704\ud574": 16, "map\uc744": 16, "\uc7ac\uc0ac\uc6a9\ud558\ub294": 16, "trick\uc744": 16, "\uc81c\uc548\ud588\ub2e4": 16, "gan\uc774": [16, 19], "\ubd84\uc57c\ub97c": 16, "\uc7a5\uc545\ud558\ub358": 16, "\ub4f1\uc7a5\uc73c\ub85c": [16, 18], "\uc8fc\ubaa9\uc744": [16, 26], "\uc2dc\uc791\ud588\ub2e4": 16, "prompt\ub97c": [9, 16, 19], "\uac00\ub2a5\ud574\uc84c\uc9c0\ub9cc": 16, "\ubd80\ubd84\uae4c\uc9c0": 16, "control\ud558\uae30\uc5d0\ub294": 16, "fine\ud55c": 16, "\uc815\ubcf4\uae4c\uc9c0": 16, "model\uc774\ub2e4": 16, "\ubcf4\uc774\uba74\uc11c": 16, "stylegan\uc744": 16, "\ubca0\uc774\uc2a4\ub85c": 16, "dataset\uc744": [16, 26], "\uc758\uc874\uc131\uc774": 16, "\ucee4": 16, "artist": 16, "\uc785\ud788\ub294\ub370": 16, "\ud55c\uacc4\ub97c": [2, 16, 21], "\uac1c\uc120\ud55c": 16, "\uac04\uc758": [9, 16, 21], "transfer\ub97c": 16, "disentagl": 16, "\ubd84\ub9ac\ud558\ub294": 16, "s_": 16, "\ubc18\ub300": [16, 21], "\uc548\uc5d0": [16, 26], "a\uc758": [16, 17], "conext": 16, "\ubc30\uc81c\ud568\uacfc": 16, "\ud3ec\ud568\ud558\uae30\uc704\ud574": 16, "\uc55e\uc5d0": [16, 21, 23], "negat": 16, "\ubd80\uc815\uc758": 16, "\uc758\ubbf8\ub97c": [9, 16, 18], "\ub2e8\uc5b4": [16, 19], "except": 16, "auxiliari": [16, 26], "\uc14b\uc744": [16, 19], "\uad6c\uc131\ud574": 16, "ffhq": [16, 17], "\uc784\uc758\ub85c": 16, "\ud6a8\uacfc": [16, 21, 26], "\ud559\uc2b5\ud568\uc73c\ub85c\uc368": [16, 19, 25], "prompt\uac04": 16, "disentanglement\ub97c": 16, "\ud5a5\uc0c1": [16, 18, 22], "\uc774\ubbf8\uc9c0\uc5d0\ub294": 16, "\uc774\ubbf8\uc9c0\ub9cc\uc758": 16, "\uc8fc\uc785": 16, "style\uacfc": [16, 17], "\uad6c\ubcc4\ud558\ub294\ub370": 16, "\ub3c4\uc6c0\uc744": 16, "\uc90c": 16, "idl\uc758": 16, "\ud559\uc2b5\ub9cc\uc73c\ub85c": 16, "transfer\uac00": 16, "\uc774\ubbf8\uc9c0\ucc98\ub7fc": 16, "\uc783\uc5b4\ubc84\ub9ac\ub294": 16, "\uac1c\uc120\ud558\uae30\uc704\ud574": 16, "\ub3c4\uc785\ud558\uc600\ub2e4": 16, "idl\ub85c": 16, "\uc870\ud569": 16, "recombin": 16, "\uc720\uc9c0\ud558\ub3c4\ub85d": 16, "trick": 16, "ldm\uc740": [16, 19], "\uc8fc\uc785\ud558\uae30\uc704\ud574": 16, "mechanism\uc744": 16, "promt": 16, "paper\uc5d0\uc11c": 16, "m\uc758": 16, "layout\uc5d0": 16, "\ubbf8\uce5c\ub2e4": 16, "mask\ub97c": 16, "\uacfc\uc815\uc5d0": 16, "\uc8fc\uc785\ud569\uc73c\ub85c\uc368": 16, "\uc720\ub3c4": [16, 23], "map\uc758": 16, "replace\ud558\uc9c0\uc54a\uace0": 16, "content\uc5d0": 16, "index\ub9cc": 16, "\uc120\ud0dd\uc801\uc73c\ub85c": 16, "replac": 16, "index": [16, 19], "time\uc5d0\uc11c": 16, "n\ubc88": 16, "\uc0ac\uc6a9\ud568\uc73c\ub85c\uc11c": 16, "n_": 16, "\uc2e4\ud5d8\uc0c1": 16, "\uc774\ud558\uc758": [16, 24], "\ucd94\ucc9c": 16, "5b": [16, 20], "ak47": 16, "m4a1": 16, "adam": [16, 18, 28], "400": 16, "ldm\uacfc": 16, "\ub3d9\uc77c": [16, 21], "styo\uac00": 16, "identity\uc640": 16, "\uc720\uc9c0\ud568\uacfc": 16, "\uc790\uc5f0\uc2a4\ub7fd\uac8c": [9, 16], "\uacb0\uacfc\ubb3c\uc744": 16, "\uc0dd\uc131\ud574\ub0b8\ub2e4": [16, 19], "study\ub3c4": 16, "\ubaa8\ub378\ub4e4\uc5d0": [16, 18], "effect": [16, 17, 27, 28], "contrast": [9, 16, 22, 26], "templat": 16, "\ub123\uace0": 16, "\ud559\uc2b5\ud560\uacbd\uc6b0": 16, "overfitting\uc774": 16, "\uc2ec\ud558\uace0": 16, "\uc815\ubcf4\uc758": 16, "\ubd84\ub9ac\uc5d0": 16, "\uc5b4\ub824\uc6c0\uc744": [9, 16, 22], "detail\uc744": [16, 19], "set\uc758": 16, "trick\ub3c4": 16, "\uc801\uc6a9\ud558\ub294\uac83\uc774": 16, "\uc0dd\uc131\ud574\ub0c8\ub2e4": 16, "inference\ud560": 16, "\ubcf4\uc774\uc9c0\ub9cc": 16, "fcc\ub97c": 16, "\ud3ec\ud568\ud560": 16, "\ub192\uc544\uc838": 16, "significant\ud55c": 16, "\uc0dd\uc131\ub418\ub294\uac83\uc744": 16, "photorealistic\uc5d0\uc11c": 16, "artistic\ud558\uac8c": 16, "\ubc14\ub00c\uace0": 16, "\ub9c8\ucc2c\uac00\uc9c0\ub85c": [16, 18, 19], "\ub098\uc624\ub294": [16, 18, 22, 23, 24], "idl\uacfc": 16, "gan\uc744": [16, 17, 21], "\ubaa8\ub378\ub4e4\ubcf4\ub2e4": [16, 28], "\uc0dd\uc131\ud574\ub0bc": 16, "singl": [16, 19, 28], "10\ubd84\uc774": 16, "\uac78\ub9ac\ubbc0\ub85c": 16, "efficiency\uac00": 16, "\ubabb\ud558\ub2e4\ub294": 16, "2019": 17, "1812": 17, "04948": 17, "huangzh13": 17, "12": [17, 21, 25, 29], "stylegan\uc785\ub2c8\ub2e4": 17, "gan\uacfc": 17, "\ubcc0\uacbd\ud568\uc73c\ub85c\uc368": 17, "\uc62c\ub9ac\uace0": 17, "feature\uc758": 17, "control\uc774": 17, "loss\ub098": 17, "discrimin": [17, 20, 21, 25], "\uac1c\uc120\uc5d0": 17, "\ubcf4\ub3c4\ub85d": 17, "\ud558\uc8e0": 17, "\uc81c\uc548\ud558\uc5ec": 17, "\ub192\uc774\uba74\uc11c": 17, "\uac00\ub2a5\ud574\uc84c\uc2b5\ub2c8\ub2e4": 17, "\uc81c\uc548\ud588\uc2b5\ub2c8\ub2e4": 17, "\uc911\uc5d0\uc11c": [17, 22], "contribution\uc744": [17, 23], "abstract\uc5d0\ub294": 17, "\ubb38\uc7a5\uc774": 17, "lead": 17, "automat": [17, 27], "unsupervis": [17, 21], "ident": [17, 21, 24], "freckl": 17, "enabl": [17, 19], "intuit": 17, "\uc81c\uc548\ud55c": [17, 18], "\uad6c\uc870\uac00": 17, "\uc77c\uc744": 17, "\uc124\uba85\ud558\ub294": [17, 18, 19, 21], "\ubcf4\uc2dc\uba74": 17, "attribute\uc758": 17, "separation\uc774": 17, "\uc598\uae30\ud558\uace0": 17, "\ubd80\ubd84\uc774": [17, 18], "stylegan\uc758": 17, "\ud2b9\uc9d5\uc774\ub77c\uace0": 17, "\uc0ac\uc6a9\uc790\ub294": [17, 19], "\ubaa9\uc801\uc744": 17, "\uc790\uc2e0\uc774": 17, "\ub9cc\ub4e4\uace0\uc790": 17, "\ud488\uc9c8\uc774": 17, "\uc88b\ub354\ub77c\ub3c4": 17, "\uc0ac\uc6a9\uc790\uc758": 17, "\uc758\ub3c4\uc640": 17, "\uc0c1\uad00\uc5c6\ub294": 17, "\ub79c\ub364\ud55c": [17, 18], "\ub0b4\ubc49\uc5b4\uc900\ub2e4\uba74": 17, "\uc2e4\uc6a9\uc131\uc774": 17, "\uc88b\ub2e4\uace0": [17, 18, 28], "\uc5c6\uc744": [17, 26, 27], "\uadfc\ub798\uc5d0": 17, "\uc778\uae30\ub97c": 17, "\uc5bb\uc5c8\ub358": 17, "\uc774\uc720\ub3c4": 17, "\ub204\uad6c\ub098": 17, "\uc810\ub3c4": 17, "\ud55c\ubaab\ud588\ub2e4\uace0": 17, "stylegan\uc740": 17, "controllability\ub97c": 17, "\ubaa8\ub378\uc774\ub77c\ub294": 17, "\uc758\ubbf8\uc788\ub2e4\uace0": 17, "network\ub294": 17, "4x4\uc5d0\uc11c": 17, "1024x1024\uae4c\uc9c0": 17, "\ub192\uc5ec\uc90d\ub2c8\ub2e4": 17, "\uac16\uac8c\ub429\ub2c8\ub2e4": 17, "gan\ud558\uace0": 17, "\ube44\uad50\ud574\uc11c": [17, 22], "\ud2b9\uc774\ud55c": 17, "\uc810\uc774": [17, 18], "z\ub97c": 17, "noise\uc640": 17, "\uc0dd\uac01\ud574\ubcf4\uba74": 17, "\uac70\uccd0\uc11c": 17, "\uad6c\uc870\uc785\ub2c8\ub2e4": 17, "z\ub294": 17, "distribution\uc5d0\uc11c": [17, 23], "\uc0d8\ud50c\ub9c1\uc73c\ub85c": 17, "\uc5bb\uc2b5\ub2c8\ub2e4": 17, "distribution\uc73c\ub85c": 17, "\ubcf4\ub0b4\ub294": 17, "\ubc30\uc6b0\uac8c": 17, "\ub420": [2, 17, 18, 19], "\uac83\uc774\uace0": 17, "\ubd84\ud3ec\ub294": 17, "\uc0dd\uae30\uac8c": 17, "\uc8fc\uc5b4\uc838\uc11c": 17, "\uc5c6\uac70\ub098": 17, "\uc801\uc744": 17, "\uc608\ub97c": [17, 20, 21, 27], "\ub4e4\uc5b4": [17, 20, 21], "\ud53c\ubd80\uac00": 17, "\ud76c\uba74\uc11c": 17, "\uae34": 17, "\uc0d8\ud50c\ub4e4\uc774": 17, "\ud574\ubd05\uc2dc\ub2e4": 17, "\ud53c\ubd80\uc0c9\uacfc": 17, "\uba38\ub9ac": 17, "\uae38\uc774\ub77c\ub294": 17, "feature\ub294": 17, "\uc5bd\ud788\uac8c": 17, "\ud558\ub098\ub97c": [17, 26], "\ubc14\uafc0": [17, 21], "\ud558\ub098\ub3c4": [17, 19], "\ubc14\ub00c\ub294": 17, "\uc77c\uc5b4\ub098\uac8c": 17, "\uc644\ud654\ud558\uae30": 17, "gaussian\uc5d0\uc11c": 17, "learnabl": [9, 17, 24, 28], "w\ub97c": 17, "\uc0ac\uc6a9\ud569\ub2c8\ub2e4": [17, 24], "instanc": [17, 21, 24], "normalization\uc740": 17, "\ucc44\ub110\ub9c8\ub2e4": 17, "\ucde8\ud574\uc8fc\ub294": 17, "normalization\uc5d0": 17, "scale\uc744": [17, 23], "\uacf1\ud574\uc8fc\uace0": 17, "\ub354\ud574\uc8fc\ub294": 17, "vector\uc758": 17, "transformation\uc73c\ub85c": 17, "\uc8fc\uc5b4\uc9c0\ub294": 17, "w\ub294": 17, "\ubcf4\ub0b4\uc9c0\uac8c": 17, "adain\uc758": 17, "\uc218\uc2dd\uc740": 17, "adain\uc740": 17, "\ube14\ub85d\ub9c8\ub2e4": 17, "\uac1c\uc529": 17, "\ub4e4\uc5b4\uac00\uc11c": [17, 19], "style\uc740": 17, "\uc5f4\uc5ec\ub35f": 17, "\ubc88": [17, 20], "adain\uc744": 17, "generator\uc5d0": [17, 19], "\ub4e4\uc5b4\uac00\uac8c": [17, 27], "localization\uc774\ub77c\ub294": 17, "\ud2b9\uc9d5\uacfc\ub3c4": 17, "\uc5f0\uad00\uc774": 17, "\ub9d0\ud558\ub294": 17, "localization\uc774\ub780": 17, "\uc77c\ubd80\ub97c": 17, "\ubc14\uafc8\uc73c\ub85c\uc368": 17, "\ud2b9\uc9d5\ub4e4\uc744": 17, "\uc758\ubbf8\uc785\ub2c8\ub2e4": 17, "\ub2e4\uc74c\uc5d0": 17, "map\ub4e4\uc740": 17, "normalization\ub418\uace0": 17, "style\uc5d0": 17, "\uc758\ud574": [2, 17, 21, 24], "statistics\ub97c": 17, "\uac00\uc9c0\uac8c": 17, "convolution\uc5d0": 17, "\uc801\uc6a9\ub418\uace0": 17, "convolution\uc5d0\uc11c": 17, "normalization\uc774": 17, "\uc218\ud589\ub418\uae30": 17, "layer\uc5d0": 17, "style\uc774": 17, "\ubd84\ub9ac\ub418\uac8c": 17, "\ud559\uc2b5\ub420": [17, 18], "\ucf54\ub4dc": [17, 20], "stylemod": 17, "latent_s": [17, 20], "use_wscal": 17, "lin": 17, "equalizedlinear": 17, "gain": 17, "n_channel": 17, "view": [17, 20, 24, 25, 29], "layerepilogu": 17, "thing": 17, "dlatent_s": 17, "use_nois": 17, "use_pixel_norm": 17, "use_instance_norm": 17, "use_styl": 17, "activation_lay": 17, "noiselay": 17, "activ": 17, "pixel_norm": 17, "pixelnormlay": 17, "instance_norm": 17, "instancenorm2d": 17, "top_epi": 17, "ordereddict": 17, "style_mod": 17, "dlatents_in_slic": 17, "assert": [17, 20], "b\uc758": 17, "style\ub85c": 17, "\ubcc0\uacbd\ud574\uc11c": 17, "\uc774\ubbf8\uc9c0\ub4e4\uc785\ub2c8\ub2e4": 17, "18\uacf3\uc5d0\uc11c": 17, "\uc0ac\uc6a9\ub418\ub294\ub370": 17, "\ucc98\uc74c": [9, 17], "4\uacf3": 17, "coars": 17, "\uadf8\ub2e4\uc74c": 17, "middl": [17, 27, 28], "10\uacf3": 17, "64": [17, 18, 26, 28], "1024": [17, 22, 25, 26], "\uc815\uc758\ud558\uc600\uc2b5\ub2c8\ub2e4": 17, "\uc717": [17, 21], "\ubd80\ubd84\uc5d0\uc11c\ub294": 17, "\ud3ec\uc988\ub098": 17, "\uc2a4\ud0c0\uc77c\uac19\uc774": 17, "\uac08\uc218\ub85d": 17, "\ud2c0\uc744": 17, "\ubd80\ubd84\ub4e4\uc744": 17, "b\uc5d0\uc11c": [17, 26], "\uac00\uc838\uc654\uc74c\uc744": 17, "\uc548\uc5d0\ub294": 17, "\ubc14\ub014": 17, "\uc8fc\uadfc\uae68": 17, "\uba38\ub9bf\uacb0": 17, "\ud53c\ubd80": 17, "\ubaa8\ub378\ub9c1\ud558\uae30": 17, "\ub354\ud574\uc9d1\ub2c8\ub2e4": 17, "\uc548\uc5d0\uc11c\ub3c4": 17, "\ub514\ud14c\uc77c\ub4e4\uc740": 17, "\ub2ec\ub77c\uc9c8": 17, "deviation\uc744": 17, "\uad6c\ud574\ubd24\uc744": 17, "\uc5bc\uad74\ud615\uacfc": 17, "attribute\ub294": 17, "\ubcc0\ud558\uc9c0\uc54a\uc9c0\ub9cc": 17, "noise\uc5d0": 17, "\uc758\ud574\uc11c": [2, 17], "\uba38\ub9ac\uce74\ub77d\uacfc": 17, "\uc0dd\uae40\uc744": 17, "\uc900": [9, 17, 23], "\uc8fc\uc9c0": 17, "\uc5d0\ub9cc": [17, 28], "\uba38\ub9ac\uce74\ub77d\uac19\uc740": 17, "\ub514\ud14c\uc77c\uc774": 17, "\uc81c\ub300\ub85c": 17, "\uc0b4\uc544\uc788\uc9c0": 17, "layers\uc5d0": 17, "\ub4e4\uc5b4\uac04": 17, "\uba38\ub9ac\uce74\ub77d\uc758": 17, "\uc138\ubc00\ud55c": [17, 28], "\ubd80\ubd84\uc5d0": [17, 18, 29], "\ub07c\uce5c\ub2e4\ub294": 17, "localization\uc774": 17, "\ub418\uac8c\ud558\uae30": 17, "mixing\uc774\ub77c\ub294": 17, "\uc55e": 17, "\ucabd": 17, "layer\uc5d0\ub294": 17, "\ub4a4": 17, "generator\uac00": [17, 21], "\uc778\uc811\ud55c": 17, "style\ub07c\ub9ac": 17, "correlated\ub418\uc5b4\uc788\ub2e4\uace0": 17, "\ub9c9\uc544\uc11c": 17, "localization\uc744": 17, "\ub418\uac8c": 17, "\ubaa9\uc801\uc785\ub2c8\ub2e4": [17, 29], "\uc800\uc790\ub4e4\uc774": [17, 18, 26], "\ubc29\ubc95\ub4e4\uc774": [9, 17], "\ud6a8\uacfc\uac00": [17, 28], "\uc788\uc5c8\ub294\uc9c0": 17, "\ud655\uc778\ud574\ubd05\uc2dc\ub2e4": 17, "\ud45c\uc640": 17, "\uc2e4\ud5d8\uc801\uc73c\ub85c": [17, 26], "\ubcf4\uc558\uc744": [17, 21], "\ubc29\ubc95\ub4e4\uc744": 17, "fid\uac00": [17, 18, 23], "variou": [17, 23, 28, 30], "design": [17, 25, 29], "2304": 18, "08466": 18, "jeonghwa": 18, "yoo": 18, "\uc774\ubc88\uc5d0": 18, "\ub9ac\ubdf0\ud560": 18, "\uad6c\uae00": [18, 26], "\ub9ac\uc11c\uce58": 18, "\uadf8\ub8f9\uc5d0\uc11c": 18, "tmlr": 18, "transact": 18, "machin": 18, "research": [18, 27], "2023\uc5d0": 18, "\uc81c\ucd9c\ud55c": 18, "\ub17c\ubb38\uc778": 18, "\uc18d\ub3c4\ub85c": 18, "\ubc1c\uc804\ud558\uace0": 18, "\uc788\ub294\ub370\uc694": [18, 28], "\uc218\uc900\uc774": 18, "\uc5bc\ub9cc\ud07c": 18, "\uc654\ub294\uc9c0": 18, "\ub370\uc774\ud130\uc778": 18, "\ucda9\ubd84\ud55c": 18, "\uc815\ub3c4\uac00": 18, "\ub418\uc5c8\ub294\uc9c0": 18, "augment\ub41c": 18, "\uc815\ub3c4\uae4c\uc9c0": 18, "\uc654\ub294\uc9c0\uc5d0": 18, "\uc2e4\ud5d8\uacfc": 18, "\ub2f5\uc744": 18, "\uc81c\uc2dc\ud569\ub2c8\ub2e4": [18, 21, 27], "\uae00\uc758": 18, "\ubaa9\ucc28\ub294": 18, "\ub0b4\uc6a9\uacfc": 18, "\uad6c\uc131\ud558\uc600\uc2b5\ub2c8\ub2e4": 18, "\uc694\uc57d": [18, 20], "task\uc5d0\uc11c": [18, 21], "augmentation\uc73c\ub85c": 18, "\ubd84\ub958": [18, 21], "tuning\ud558\uc5ec": 18, "\ub2ec\uc131": [18, 19, 22], "imagenet\uc5d0": 18, "tuning\ub41c": 18, "\uc0ac\uc6a9\ud568": [18, 19, 21, 22, 26], "\ud569\uc131": 18, "\uc0ac\uc6a9\ud558\uc600\uc744": 18, "\ub428": [2, 18, 20, 22], "\uae30\uc220\uc801\uc73c\ub85c": 18, "\uc5c4\uccad": 18, "\ub0b4\uc6a9\uc740": 18, "\uc5c6\ub294\ub370\uc694": 18, "\ub2e4\ub9cc": [18, 20], "\uc0ac\uc6a9\ud558\ub358": 18, "\ubc29\ubc95\ub4e4\uacfc\ub294": 18, "imagen\uc744": 18, "\ud588\ub2e4\ub294": 18, "\uc0c8\ub86d\uc2b5\ub2c8\ub2e4": 18, "\uae30\uc220\uc774": [9, 18], "\ubc1c\uc804\ub418\uc5c8\uc2b5\ub2c8\ub2e4": 18, "\ub9cc\ud07c\uc758": 18, "\uc790\uc5f0\uc2a4\ub7ec\uc6b4": 18, "\uc9c8\ubb38\uc774": 18, "\ub2f9\uc5f0\ud558\uace0": 18, "\ucc3e\uace0\uc790": 18, "\uc9c8\ubb38\uc5d0": 18, "\uc774\uc57c\uae30": 18, "imagen\uc774": [18, 26], "ca": 18, "\ud558\uc600\ub2e4": 18, "\ub370\uc774\ud130\uc640": [18, 24, 25, 29], "\uacb0\ud569\ud558\uc5ec": 18, "\ub370\uc774\ud130\uc758": [18, 21], "\uc2dc\uac04\uc774": [18, 23], "\uae38\uc218\ub85d": 18, "\ud5a5\uc0c1\ub418\uc5c8\ub2e4": 18, "\uc788\ub4ef\uc774": [18, 21], "\ub370\uc774\ud130\ub85c\ub9cc": 18, "\uc815\ud655\ub3c4\uc640": 18, "\uc801\ub2e4\ub294": 18, "\uc54c": [9, 18, 21, 26], "\ub354\ud574\uc11c": 18, "\ud559\uc2b5\ud588\uc744": 18, "\ubaa8\ub378\uacfc": 18, "\ubaa8\ub378\ub4e4\uc5d0\uc11c": 18, "\ub54c\ubcf4\ub2e4": 18, "\ud5a5\uc0c1\uc774": 18, "augmentation\uc744": 18, "\ud558\ub824\uace0": 18, "\ud588\ub358": 18, "\ubc29\ubc95\ub4e4\uc5d0": 18, "\uc9e7\uac8c": 18, "\ud590\ub824\uace0": 18, "\ucd5c\uadfc\uc5d0\ub294": 18, "\ubcf4\uac15\ud558\ub294\ub370": 18, "\uc0ac\uc6a9\ub418\uae30": 18, "\uc2dc\uc791\ud588\uc2b5\ub2c8\ub2e4": 18, "\uc608\ub85c": 18, "Is": 18, "readi": 18, "\ub17c\ubb38\uc774": 18, "glide\ub85c": 18, "shot\uacfc": 18, "few": [18, 22], "\uc2dc\ucf30\uc73c\uba70": 18, "glide\ub97c": 18, "\uc138\ud2b8\uac00": [18, 21], "100\uc758": 18, "\uc2dc\ucf30\ub2e4\uace0": 18, "\ud3ec\ud568\ud574\uc11c": 18, "\ub17c\ubb38\ub4e4\uc740": 18, "\uc774\uc6a9\ud574\uc11c": [18, 19], "\ud558\uc5ec\ub3c4": 18, "\uc2dc\ud0a4\uc9c0": 18, "\ubabb\ud588\uc2b5\ub2c8\ub2e4": 18, "\ud558\uc9c0": [18, 19, 20], "\uc54a\uc558\uc2b5\ub2c8\ub2e4": 18, "\ub17c\ubb38\ub4e4\uacfc\ub294": 18, "\ub3d9\uc791\ud558\uace0": 18, "\uc2dc\ud0ac": [18, 20], "\uc6cc\ub099": 18, "\uc4f0\uc5ec\uc11c": 18, "\uc124\uba85\uc740": 18, "\uc0dd\ub7b5\ud558\uace0": 18, "cas\uc5d0": 18, "\uc368\uc838": 18, "\ub0b4\uc6a9\uc73c\ub85c": 18, "\uc18c\uac1c\ud558\uaca0\uc2b5\ub2c8\ub2e4": 18, "cas\ub294": 18, "score\uc640": [9, 18], "\ub9cc\ub4e4\uc5b4\ub0b8": 18, "\uc9c0\ud45c\uc785\ub2c8\ub2e4": 18, "\ub85c\ub9cc": 18, "\ub9cc\ub4e4\uc5b4\ub0c5\ub2c8\ub2e4": 18, "\ub370\uc774\ud130\ub9cc\uc744": 18, "\uc774\uc6a9\ud558\uc5ec": [9, 18, 22], "50\uc744": 18, "\uc2dc\ud0a4\uace0": 18, "cas\uac00": 18, "\ub9cc\uc57d": [2, 18], "imagenet\uacfc": 18, "\ube44\uc2b7\ud558\ub2e4\uba74": 18, "\ubcf4\uc77c": 18, "\uac00\uc815\uc744": [18, 23, 29], "\uc9c0\ud45c\ub77c\uace0": 18, "\uc774\ud574\ud558\uba74": 18, "\uc800\uc790\uc5d0": 18, "\uc758\ud558\uba74": 18, "\uadf8\ub3d9\uc548": 18, "\uc0dd\uc131\ubaa8\ub378\uc758": [9, 18], "\uc54a\uc558\ub2e4\uace0": 18, "\uc0d8\ud50c\ub85c\ub9cc": 18, "\ub5a8\uc5b4\uc84c\uace0": 18, "\ub2f9\uc5f0\ud574\ubcf4\uc785\ub2c8\ub2e4": 18, "\ub5a8\uc5b4\uc84c\ub2e4\uace0": 18, "\uc544\ub9c8\ub3c4": 18, "\ud488\uc9c8": 18, "\ub2e4\uc591\uc131": 18, "\uac83\uc774\ub77c\uace0": 18, "\uc5ec\uae30\uc11c\ub294": [18, 19], "\ud558\uc600\ub294\uc9c0\uc5d0": 18, "\uc124\uba85\uc744": [18, 19], "\ubaa8\ub378\ub85c\ub294": 18, "\uc0ac\uc6a9\ud558\uc600\uc2b5\ub2c8\ub2e4": 18, "\ud074\ub798\uc2a4\uc640": 18, "\uc9c0\uc5d0": 18, "\uace0\ubbfc\uc774": 18, "\ud544\uc694\ud588\ub2e4\uace0": 18, "\uc9e7\uc740": 18, "\ud558\uc600\ub294\ub370": 18, "imagen\uc5d0\uc11c": 18, "\ub2e4\uc591\uc131\uc774": 18, "\uc800\ud558": 18, "\ub418\uba74\uc11c": 18, "\ud604\uc0c1\uc77c": 18, "\ub450\ub2e8\uc5b4": 18, "\ud074\ub798\uc2a4": [18, 20], "\uc774\ub984\uc73c\ub85c": 18, "\uc218\uc815\ud558\uace0": 18, "\ud588\ub2e4\uace0": [9, 18, 23], "tuning\uc774": 18, "\uc774\ubbf8\uc9c0\uace0": 18, "\uc624\ub978\ucabd\uc774": 18, "\uc801\uc6a9\ub418\uc9c0": 18, "imagen\uc785\ub2c8\ub2e4": 18, "\uc544\ub798\uc5d0\uc11c": [18, 21], "\ud074\ub798\uc2a4\uc778": 18, "schipperke\ub97c": 18, "\uc2a4\ud0a4\ud37c\ud0a4\ub77c\ub294": 18, "\uac1c": [18, 20, 21], "\ud488\uc885\uc744": 18, "\uc758\ubbf8\ud558\ub294\ub370": 18, "imagen\uc758": 18, "\uacbd\uc6b0\ub294": [18, 21], "\uaf43\uacfc": 18, "\uc804\ud600": [18, 21], "\uc5c9\ub6b1\ud55c": 18, "\ub9cc\ub4e4\uace0": [18, 19], "\ud588\ub294\uc9c0\ub97c": 18, "\uad6c\uc870\uc5d0\uc11c": 18, "\uc6d0\uc73c\ub85c": 18, "\ud45c\uc2dc\ub41c": 18, "\ub300\ud574\uc11c\ub9cc": [18, 20], "frozen": [18, 26], "\uc6d0\ub798": [18, 21, 23], "imagen\uc5d0\uc11c\ub3c4": 18, "\ubd80\ubd84\uc774\ub77c": 18, "\uc54a\uc558\uace0": 18, "\ucd9c\ub825\uc73c\ub85c": 18, "\uace0\ud574\uc0c1\ub3c4\uc758": 18, "\uc801\uc5b4\uc11c": 18, "210k": 18, "\ud559\uc2b5\ud558\uc600\uace0": 18, "optimizer\uc758": 18, "\uc0ac\uc6a9\ud558\uc600\ub358": 18, "adafactor": 18, "optimizer\ub97c": 18, "\uc0ac\uc6a9\ud558\uc600\ub2e4\uace0": [9, 18], "490k": 18, "\ucd5c\uc801\uc758": 18, "\uc120\ud0dd\uc758": 18, "\uae30\uc900\uc73c\ub85c\ub294": 18, "sampler\uc640": 18, "1k": 18, "10k\uac1c\uc758": 18, "\uc0d8\ud50c\ub4e4\uc5d0": 18, "score\ub97c": [9, 18], "\uacc4\uc0b0\ud588\uc744": 18, "\uc120\ud0dd\ud588\ub2e4\uace0": 18, "\uc815\ud588\ub294\uc9c0\ub97c": 18, "\uc0d8\ud50c\ub9c1\uc758": 18, "\uc18d\ub3c4\ub294": 18, "\ub514\ud4e8\uc804": 18, "\uc2a4\ud15d": 18, "free": [18, 27, 28], "coeffici": 18, "\ub4f1\uc5d0": 18, "\ubc1b\ub294\ub2e4\uace0": 18, "\uac04\ub2e8\ud558\uac8c": [18, 21], "\uc124\uba85\ud558\uba74": 18, "\ud655\ub960\uc801\uc778": 18, "\ub3c4\uc785\ud558\uc5ec": 18, "\ub2e4\uc591\uc131\uc744": 18, "\uc99d\uac00\uc2dc\ud0a4\ub294": 18, "\uc77c\ubc18\uc801\uc73c\ub85c": [18, 19], "\uc7a0\uc7ac": 18, "\uacf5\uac04\uc758": 18, "\ubcf4\uc774\uac8c": 18, "\ub9cc\ub4e4\uba70": 18, "understand": [18, 22, 26], "\ucc38\uace0\ud574\uc8fc\uc138\uc694": 18, "\ubd84\ub958\uae30\ub098": 18, "\uc9c0\ud45c": [18, 23], "\uc678\ubd80": 18, "\uc0ac\uc6a9\ud55c\ub2e4\ub294": 18, "\ubc18\uc601\ud560\uc9c0\ub97c": 18, "\uc758\ubbf8\ud560": 18, "\uc870\uc808\ud558\uc5ec": 18, "\ud2b9\uc131\uc774\ub098": 18, "\uc0dd\uc131\ud558\ub3c4\ub85d": [9, 18, 19], "\ubd84\ud3ec\uc758": 18, "\ubcc0\ub3d9\uc131\uc744": 18, "\uacc4\uc218\ub97c": 18, "\ud3c9\uade0\uacfc": [18, 26], "\ubd84\uc0b0\uc744": [18, 26], "\uc870\uc808\ud568\uc73c\ub85c\uc368": 18, "\ub85c\uadf8": 18, "\ud63c\ud569": 18, "\uacc4\uc218\ub294": 18, "\uc0ac\uc6a9\ub418\uba70": 18, "\uc758\ubbf8\ud558\uace0": 18, "\uc758\ubbf8\ud568": 18, "\uc0dd\uc131\uc758": 18, "\uc124\uc815\ubc95\uc5d0": 18, "\uc124\uba85\ud558\uaca0\uc2b5\ub2c8\ub2e4": [18, 23], "\uc804\ubc18\uc801\uc778": [18, 19, 24], "\ud2b9\uc9d5\uacfc": 18, "\ub2e4\uc591\uc131\uc758": 18, "\uc8fc\uac8c": [18, 21], "1\ucc28": [2, 18], "sweep\uc73c\ub85c": 18, "ddpm": [9, 18, 26], "\uc0d8\ud50c\ub7ec\ub97c": 18, "50k\uc5d0": 18, "\ud558\uc774\ud37c\ud30c\ub77c\ubbf8\ud130\ub97c": 18, "\ucc3e\uc2b5\ub2c8\ub2e4": 18, "sweep\uc758": 18, "\ud558\uc774\ud37c\ud30c\ub77c\ubbf8\ud130\uc758": 18, "\ubc94\uc704\ub294": 18, "75": 18, "128": [18, 21, 25], "sweep": 18, "fid\ub294": 18, "variance\ub294": 18, "1000\uc774\uc5c8\uc744": 18, "\ub54c\ub77c\uace0": 18, "sweep\uc774": 18, "\ub05d\ub09c": 18, "\ud6c4\uc5d0\ub294": 18, "weight\uc5d0": 18, "sweep\uc744": 18, "\ub54c\uc5d0\ub294": [18, 22], "2m": 18, "guidacn": 18, "cas\ub97c": 18, "\uce21\uc815\ud588\ub2e4\uace0": 18, "\ud558\uc774\ud37c\ud30c\ub77c\ubbf8\ud130\uc5d0": 18, "sweep\uc5d0": 18, "\uacb0\uacfc\uace0": 18, "\uac00\uc6b4\ub370\uc640": 18, "2\ucc28": 18, "\uacb0\uacfc\ub85c": 18, "\ub098\ud0c0\ub0b8": 18, "\uc774\uc81c": 18, "\ub2e4\uc74c\uc73c\ub85c\ub294": 18, "\uc120\ud0dd\ud558\ub294": [18, 22], "range\ub294": 18, "30": [18, 27], "denos": 18, "129": 18, "\uadf8\ub798\ud504\ub294": 18, "\uc124\uc815\ud558\uace0": [18, 24], "\ubcc0\uacbd\ud588\uc744": 18, "cas\uc758": 18, "\uadf8\ub798\ud504\ub97c": 18, "\uadf8\ub798\ud504\uc785\ub2c8\ub2e4": 18, "logvar": [18, 29], "coeff\uac00": 18, "3\uc77c": 18, "\ubcf4\uc600\uc73c\uba70": 18, "\uacbd\uc6b0\ub3c4": [18, 21], "\ubcf4\uc778": [18, 22], "\ubd84\uc11d\ud574\ubcf4\uc790\uba74": 18, "\uc0c1\uad00\uad00\uacc4\uac00": 18, "weight\uac00": 18, "\ub192\uc544\uc9c0\uc9c0\ub9cc": 18, "score\uc5d0\ub294": 18, "\ubd80\uc815\uc801\uc778": 18, "\uc8fc\uba70": [18, 26], "augmentation\uc774": 18, "0\uc77c": 18, "\ud558\uc774\ud37c\ud30c\ub77c\ubbf8\ud130": 18, "\uc124\uc815\ud55c": 18, "\uac19\ub2e4\uace0": 18, "\ubca0\uc774\uc2a4": 18, "\ub098\uba38\uc9c0": [18, 19, 27], "sampler": 18, "\ud569\uc131\uc740": 18, "\ud504\ub85c\ud1a0\ucf5c\uc744": 18, "\ub530\ub790\ub294\uc9c0\uc5d0": 18, "balance\ub97c": 18, "\uc720\uc9c0\ud558\uba70": 18, "\ud569\uc131\ud588\uc73c\uba70": 18, "\ud569\uc131\ub41c": 18, "\uaddc\ubaa8\ub294": 18, "1\ubc30\uc778": 18, "10\ubc30\uc778": 18, "12m": 18, "\ubc94\uc704\ub97c": 18, "\uac00\uc9c0\ub3c4\ub85d": 18, "\ud569\uc131\ud588\ub2e4\uace0": 18, "\ud0dc\uc2a4\ud06c\uc5d0\uc11c": 18, "\uc9c0\ud45c\uc778": 18, "is\uc758": 18, "\uad00\uc810\uc73c\ub85c": 18, "\ubd05\ub2c8\ub2e4": 18, "\ud45c\uc5d0\uc11c": 18, "\ud30c\uc778": 18, "\ud29c\ub2dd\ub41c": 18, "\ubca0\uc774\uc2a4\ubaa8\ub378\ub4e4": 18, "is\uac00": 18, "resolution\uacfc": 18, "resolution\uc5d0\uc11c": 18, "\ud574\ub2f9\ub418\uc5c8\uc2b5\ub2c8\ub2e4": 18, "\ud655\uc778\ud558\ub294": 18, "5\uc5d0\uc11c": [18, 19, 21], "\ud30c\ub780\uc0c9": [2, 18], "\uc131\ub2a5\uc774\uace0": 18, "\ube68\uac04\uc0c9": 18, "\uc131\ub2a5\uc785\ub2c8\ub2e4": 18, "\ubca0\uc774\uc2a4\ub77c\uc778": 18, "cdm": 18, "\uadf8\ub9bc\uc774\uba70": 18, "\uac00\uc6b4\ub370\ub294": 18, "\uc624\ub978\ucabd\uc740": 18, "\ubd80\ubd84\ubcf4\ub2e4": 18, "\uc704\ucabd\uc5d0": 18, "\uc704\uce58\ud558\uba74": 18, "\ud574\uc11d\ud560": 18, "\ubca0\uc774\uc2a4\ub77c\uc778\ubcf4\ub2e4": 18, "\ubcf4\uc778\ub2e4\ub294": 18, "2\uc5d0\uc11c\ub3c4": 18, "\uc8fc\ubaa9\ud560": 18, "\ub9cc\ud55c": [18, 20], "resnet50\uc774": 18, "256x256\uc73c\ub85c": [9, 18], "\ub2e4\uc6b4\uc0d8\ud50c\ub9c1": 18, "\ud568\uc5d0\ub3c4": 18, "\uc88b\ub2e4\ub294": [18, 27], "our": [18, 19, 22, 30], "resolution\ubcf4\ub2e4": 18, "resolution\uc758": 18, "\uc6d4\ub4f1\ud788": [18, 27], "\ub192\uc74c": [18, 26], "\uc885\ub958\uc758": 18, "\uc2dc\ucf30\uc744": 18, "cas\uc640": 18, "cas\uc5d0\uc11c\ub294": 18, "resnet50": 18, "\ud655\uc778\ud588\uc9c0\ub9cc": [18, 28], "\uc774\uc678\uc5d0": 18, "\ubaa8\ub378\ub85c\ub3c4": 18, "\ubcf8\ub2e4\ub294": 18, "\ucc28\uc774\uc810\uc774": 18, "\uc0b4\ud3b4\ubcf8": 18, "\ub0ae\uc558\uc9c0\ub9cc": 18, "\ud569\uccd0\uc11c": 18, "\ub370\uc774\ud130\ub9cc": 18, "\uc99d\uac00\ud55c": [2, 18], "onvnet\uae30\ubc18": 18, "\uc591\uc0c1\uc744": 18, "\ubcf4\uc600\uc2b5\ub2c8\ub2e4": 18, "\uaddc\ubaa8\uc5d0": 18, "50\uc758": 18, "\ubd84\uc11d\ud55c": 18, "\uc99d\uac00\ud568\uc5d0": 18, "\uc9c0\uc18d\uc801\uc73c\ub85c": 18, "8m": 18, "\uaddc\ubaa8\uac00": [9, 18], "\ub54c\uae4c\uc9c0\ub294": 18, "\uc88b\uc558\uc73c\ub098": 18, "\uc774\uc0c1\uc758": 18, "\ub418\uc5c8\uc744": 18, "\uc624\ud788\ub824": 18, "\uacb0\ub860": [18, 20], "\ubcf4\uc790\uba74": 18, "sclae": 18, "\ud30c\uc778\ud29c\ub2dd\ud558\uc5ec": 18, "\uc9c0\ud45c\uc5d0": 18, "\ub2ec\uc131\ud588\uc2b5\ub2c8\ub2e4": 18, "76": 18, "239": 18, "96": 18, "69": 18, "\uadf8\ub807\uac8c": [18, 19], "resnet\uacfc": 18, "accuracy\ub97c": 18, "\uc2dc\ucf30\uc2b5\ub2c8\ub2e4": 18, "\uacb0\uacfc\uc5d0": [18, 20, 21], "\uc0dd\uac01\ud574\ubcfc\ub9cc\ud55c": 18, "\uac70\ub9ac\ub4e4\uc774": 18, "\uc788\uc5c8\ub294\ub370": 18, "\ud558\ub098\ub294": 18, "\uce21\uc815\ud560": 18, "\uc785\ub825\uc744": 18, "256x256\ubcf4\ub2e4": 18, "1024x1024\uc758": 18, "\ub2e4\uc6b4\uc0d8\ud50c\ub9c1\uc744": 18, "\ud558\ub354\ub77c\ub3c4": 18, "resolution\uc774": 18, "\ud074": 18, "\ub2f4\ub294\ub2e4\ub294": 18, "\uac83\uc77c": 18, "\uc815\ud655\ub3c4\uac00": 18, "\uc99d\uac00\ud588\uc9c0\ub9cc": 18, "\ub370\uc774\ud130\uc5d0\uc11c\ub294": 18, "\uadf8\ub807\uc9c0": [18, 20], "\uc54a\uc558\ub358": 18, "\uace0\ud574\uc0c1\ub3c4\uc5d0": 18, "\uc815\uad50\ud55c": 18, "\ud544\uc694\ud560": [18, 21], "\uc2dc\uc0ac\ud558\uace0": 18, "\ub9ac\ubdf0\ub97c": 18, "\ub9c8\uce58\uaca0\uc2b5\ub2c8\ub2e4": 18, "\ub290\ub080": 18, "\uc0b0\uc5c5\uc5d0\uc11c\ub294": 18, "shortage\ub098": 18, "imbal": 18, "\ubc1c\uc0dd\ud558\ub294\ub370": 18, "\ud574\uacb0\ubc95": 18, "\ud558\ub098\uac00": [18, 23], "\uac19\ub2e4\ub294": 18, "\ub4e4\uc5c8\uc2b5\ub2c8\ub2e4": 18, "\ud30c\uc778\ud29c\ub2dd\uc774": 18, "\ub418\uc9c0": [18, 20], "\uc0b0\uc5c5\uc5d0\uc11c\ub9cc": 18, "\ud14d\uc2a4\ud2b8\uac00": 18, "\uc788\uc744\uae4c": 18, "\ud569\uc131\ud558\uace0\uc790": 18, "\ub370\uc774\ud130\uc14b\uc5d0": [18, 21], "\ud30c\uc778\ud29c\ub2dd\uc744": 18, "\ud574\uc57c\ud558\ub294": 18, "\uaf64\ub098": 18, "\ubd88\ud3b8\ud560": 18, "\uac19\uc544\uc11c": 18, "\uac16\ub294\uc9c0": 18, "\uc788\uc5c8\uc73c\uba74": 18, "\uc88b\uc558\uc744": 18, "\uac1c\uc778\uc801\uc778": 18, "\uc720\ucd94\ud574\ubcfc": 18, "\uc21c": 18, "\uc788\uc9c0\ub9cc\uc694": 18, "worth": 19, "2208": [19, 24], "01618": 19, "devocean": 19, "techboarddetail": 19, "page": 19, "id": 19, "164320": 19, "boardtyp": 19, "writer": 19, "searchdata": 19, "sam56903": 19, "subindex": 19, "idlist": 19, "pnwriterid": 19, "kwang": 19, "su": 19, "mun": [19, 21], "5\uc7a5\uc73c\ub85c": 19, "\uac1c\ub150": [19, 24], "\ucf58\uc149\ud2b8": 19, "concept": [19, 26], "\ubf51\uc544\ub0b4\ub294": 19, "\uc790\uc5f0\uc5b4\ub97c": 19, "creation\uc5d0": 19, "\uc804\ub840\uc5c6\ub294": 19, "\uc790\uc720\ub3c4\ub97c": 19, "contept\ub97c": 19, "\uadf8\uac83\uc758": 19, "\ubc14\uafb8\uac70\ub098": 19, "\uc5ed\ud560\uc774": 19, "\uc8fc\uc5b4\uc9c0\uac70\ub098": 19, "\ucc38\uc2e0\ud55c": 19, "\uc7a5\uba74\uc774": 19, "\uadf8\ub824\uc9c0\ub294\uac74": 19, "\ubd88\ubd84\uba85\ud558\ub2e4": 19, "\uc774\uac83\uc744": 19, "\uadf8\ub824\uc918": 19, "\ub9d0\ud560": 19, "\uc774\uac83": 19, "\uac83\uc774\ub0d0\ub294": 19, "\ubb3c\uc74c\uc5d0\ub294": 19, "\uac19\ub2e4": [9, 19, 23, 26], "5\uac1c\ub9cc\uc73c\ub85c": 19, "\uc0ac\ubb3c\uc774\ub098": 19, "\uc790\uc5f0\uc5b4": 19, "\ubb38\uc7a5\uc5d0": [19, 21], "\ub179\uc544\ub4e4\uc5b4\uac00": 19, "\uc9c1\uad00\uc801\uc778": 19, "\uc774\ub04c\uc5b4": 19, "\ub3c5\uc790\uc801\uc774\uba74\uc11c": 19, "\ucf58\uc149\ud2b8\ub97c": 19, "capture\ud558\uae30": 19, "\uc704\ud574\uc11c\ub294": [2, 19, 20, 22, 28], "\ucda9\ubd84\ud558\ub2e4\ub294": 19, "\uc54c\uac8c": 19, "\ub418\uc5c8\ub2e4": 19, "\ub300\uaddc\ubaa8": [9, 19], "\uac1c\ub150\uc744": 19, "\ub3c4\uc785\ud558\ub294": [19, 21], "\uc77c\uc740": 19, "\uc77c\uc774\ub2e4": 19, "\uac1c\ub150\uc5d0": 19, "\ud655\uc7a5\ub41c": 19, "retraining\ud558\ub294": 19, "\uc5c4\uccad\ub098\uac8c": 19, "\ube44\uc6a9\uc774": 19, "\ub4e4\uace0": 19, "\uc608\uc81c\uc5d0": 19, "\uce58\uba85\uc801\uc778": 19, "\ub9dd\uac01\uc744": 19, "\ucd08\ub798\ud55c\ub2e4": 19, "\uacf5\uac04\uc5d0\uc11c": 19, "\ub2e8\uc5b4\ub97c": 19, "\uadf9\ubcf5\ud560": 19, "figure\uc5d0\uc11c": 19, "\uc9c0\ub098\uba74\uc11c": 19, "508": 19, "701": 19, "set\uc73c\ub85c": [19, 22], "\ubcc0\ud658\ub418\uace0": 19, "\ud1a0\ud070\uc740": [19, 22], "\uc790\uccb4": 19, "\ubca1\ud130\ub294": 19, "\ub2e4\uc6b4\uc2a4\ud2b8\ub9bc": 19, "\uc81c\uacf5\ub428": 19, "concept\ub97c": 19, "\ub098\ud0c0\ub0b4\ub294": [19, 21], "word\uc778": 19, "\ub098\ud0c0\ub0b8\ub2e4": [2, 19], "vector\ub294": 19, "\ub2e8\uc5b4\uc640": 19, "\ucc98\ub9ac\ub418\uba70": 19, "query\ub97c": 19, "\uad6c\uc131\ud558\ub294\ub370": 19, "query\ub294": 19, "\uc758\ub3c4\ud55c\ubc14\uc640": 19, "\uc77c\uce58\ud558\ub3c4\ub85d": 19, "\uadf8\ub9bc\uc774\ub77c\uace0": 19, "\uc0dd\uc131\ubaa8\ub378": 19, "ldm\uc774": 19, "\uc4f0\uc784": 19, "untouched\ub418\uc5b4": 19, "\ub530\ub85c": [19, 21], "\ub4e4\uc5b4\uac00\uc9c0": 19, "\uc54a\ub294\ub4ef\ud568": 19, "\ud568\uc73c\ub85c\uc368": [2, 19, 27], "\uc190\uc2e4\ub418\ub294": 19, "text\uc5d0": [19, 22], "\uc774\ud574\ub3c4\ub098": 19, "generalization\uc744": 19, "\uc720\uc0ac\ub2e8\uc5b4": 19, "\ucc3e\uae30": 19, "inversion\uc2dc\ucf1c": 19, "\ud504\ub808\uc784\ud654": 19, "5\uac1c\uc758": 19, "set\uc774": [19, 21], "\uc8fc\uc5b4\uc9c4\ub2e4": 19, "\ubb38\uc7a5\uc744": 19, "\uc124\uc815\ud574": [19, 20], "\uc7ac\uad6c\uc131": 19, "\uc774\uc5b4\uc9c0\ub294": 19, "\ucc3e\ub294": [19, 21], "concept\uc778": 19, "\ud55c\ub2e4\uace0": 19, "found": 19, "palavra": 19, "\ubc14\uafb8\ub294\ub370": 19, "\ucd94\uc815": 19, "object\uc758": 19, "\ubcf5\uad6c": 19, "segmentation\uc744": 19, "palavra\ub294": 19, "\uac1c\uccb4\ub97c": 19, "\ucc38\uc870\ud558\ub294": 19, "clip\uc758": 19, "word\ub97c": 19, "\uc2dd\ubcc4\ud568": 19, "\uac80\uc0c9\uc744": 19, "\uc124\uba85\ud558\uac70\ub098": 19, "\uc7a5\uba74\uc5d0\uc11c": 19, "\ubd84\ud560\ud558\uae30": 19, "\uc0ac\uc6a9\ub428": 19, "\ubcf4\ub4ef\uc774": 19, "\uadf8\ub7f4\ub4ef\ud55c": 19, "\ud569\uc131\uc5d0": [19, 26], "\ucea1\ucc98\ud558\uc9c0": 19, "goal": 19, "specifi": 19, "\uc758\uc5ed": 19, "\uc758\ub3c4\ud55c": 19, "\ucd08\ucca8\uc744": 19, "\ub9de\ucd98": 19, "embedding\uc73c\ub85c": 19, "\uac00\uc774\ub4dc\ud574\uc11c": 19, "\uad1c\ucc2e\uc740": 19, "\uc131\uacfc\ubb3c\uc744": 19, "representation\uc73c\ub85c": 19, "\uc778\ucf54\ub529\ud558\ub294\ub370": 19, "\ucd08\uc810\uc744": [19, 21, 29], "\ub9de\ucda4": 19, "model\uc5d0\uc11c\ub294": 19, "representation\uc5d0": 19, "\ud6c4\ubcf4\uad70\uc744": 19, "\ucc3e\ub294\ub2e4": 19, "\uadf8\ub7ec\ub098": 19, "depth": [19, 21, 23, 28], "visual": [19, 26, 28], "understanding\uc744": 19, "\ud544\uc694\ub85c": 19, "\uc54a\ub294\ub2e4": [19, 23], "\uc0dd\uc131\uc790\uac00": 19, "\uadf8\ub9b0\ub2e4": 19, "inversion\uc5d0\uc11c": 19, "\uc601\uac10\uc744": 19, "\uc81c\uc2dc": [19, 20, 23], "\ucd9c\ucc98": [19, 21], "hyoseok": 19, "tistori": [2, 19], "entri": 19, "vector\ub97c": 19, "vector\ub85c\ubd80\ud130": 19, "\uc774\uc758": 19, "\uc5ed\uacfc\uc815\uc73c\ub85c\uc368": 19, "gan\uc758": [19, 21], "inverting\uc2dc\ucf1c": 19, "\uc54c\uc544\uac00\ub294": 19, "\uc0dd\uc131\ubaa8\ub378\ub85c\uc11c": 19, "\ub9d0\ud588\ub4ef\uc774": 19, "\uac74\ub4e4\uc9c0": 19, "\uc785\ub825\ub41c": [19, 27], "\ubb38\uc790\uc5f4\uc758": 19, "\ud558\uc704": 19, "\ub2e8\uc5b4\ub294": 19, "\ud1b5\uacfc\ud558\uba70": 19, "\uc815\uc758\ub41c": 19, "dictionary\uc5d0\uc11c": 19, "token\uc73c\ub85c": [9, 19], "\ubcc0\ud658\ud568": 19, "\ucc3e\uc744": [19, 20], "\uace0\uc720\ud55c": 19, "\ubca1\ud130\uc5d0": 19, "\uc5f0\uacb0\ub428": 19, "index\uc5d0": 19, "encoder\uc778": 19, "c_\u03b8\uc758": 19, "\uc77c\ubd80\ub85c": 19, "target\uc73c\ub85c": 19, "\uc0bc\uc558\uc74c": 19, "\ub098\ud0c0\ub0b4\uae30": 19, "\uc790\ub9ac\ud45c\uc2dc\uc790": 19, "\ubb38\uc790\uc5f4\uc778": 19, "\uc9c0\uc815\ud568": 19, "palavra\ub97c": 19, "\ucd94\uc815\ud568": 19, "process\uc5d0": 19, "\uac1c\uc785\ud574\uc11c": 19, "tokenize\ub41c": 19, "\ubb38\uc790\uc5f4\uacfc": 19, "\ub300\uccb4\ud558\uc5ec": 19, "\ubcf8\uc9c8\uc801\uc73c\ub85c": 19, "\uc5b4\ud718": 19, "\uc8fc\uc785\ud568": 19, "5\uc7a5": 19, "\ud3ec\uc988\uc640": 19, "\uc124\uc815\uc5d0": 19, "\uac78\uccd0": 19, "\ubb18\uc0ac\ud568": 19, "\ucd5c\uc18c\ud654\ud558\ub294": [19, 25, 28], "v\ub97c": 19, "\ucd5c\uc801\ud654\ud568": 19, "\uace0\uc815\ud558\uae30": 19, "\ud15c\ud50c\ub9bf\uc5d0\uc11c": 19, "\ud30c\uc0dd\ub41c": 19, "\uc911\ub9bd": 19, "\ucee8\ud14d\uc2a4\ud2b8": 19, "\uc5ec\uae30\uc5d0\ub294": 19, "rendit": [19, 24], "\ud615\uc2dd": 19, "\ud504\ub86c\ud504\ud2b8\uac00": 19, "\ud3ec\ud568\ub41c\ub2e4": 19, "\uc544\ub9c8": [19, 26], "\uc6d0\ubcf8\uacfc": 19, "\ube44\uad50\ud558\uae30": 19, "\ubaa9\uc801\uc774": 19, "\uc544\ub2d0\uae4c": 19, "\uc2f6\uc74c": 19, "\ubaa9\ud45c\uc2dd\uc740": 19, "\uac19\uc74c": [19, 21], "loss\ud568\uc218\uc640": 19, "\uc720\uc0ac\ud568": 19, "c\u03b8\uc640": 19, "e\u03b8\ub294": 19, "\ubbf8\uc138\ud55c": 19, "\ud3ec\ucc29\ud560": 19, "\uc788\uc744\uac83\uc73c\ub85c": 19, "\uae30\ub300\ud568": 19, "\ud3ec\ucc29\ud558\ub294": 19, "\uc720\uc0ac\ud558\uba74\uc11c\ub3c4": 19, "guide\uc5d0": 19, "\ub9de\ucdb0\uc11c": 19, "\uc9c4\ud589\ud568": 19, "\uc8fc\uc81c\uc5d0": 19, "\uc815\ud655\ud558\uac8c": 19, "\ubcf4\uc874\ud558\uace0": 19, "\uc784\ubca0\ub529\uacfc": 19, "\ucea1\uc158\ub4e4\uc5d0": 19, "\ucd94\ub860\uc774": 19, "\uac00\ub2a5\ud588\uc74c": 19, "\ub370\uc774\ud130\uc14b\uc73c\ub85c\ub3c4": 19, "\ubcf4\uc874\ud558\uba74\uc11c": 19, "\ud45c\ud604\ud55c": 19, "\uc0ac\uc9c4\uc5d0\uc11c\uc640": 19, "\uc758\uc0ac": 19, "\ubc31\uc778": 19, "\ub0a8\uc131": 19, "\uc758\uc0ac\ub97c": 19, "\uadf8\ub824\ub0c8\uc74c": 19, "\ub9ce\uc558\uc74c\uc744": 19, "imageset\uc5d0\uc11c": 19, "\uc778\uc885\uc801": 19, "\ub2e4\uc591\uc131\uc5d0": 19, "\uc778\uc2dd\uc744": 19, "embedding\uc758": 19, "y\ucd95": 19, "\ubcf5\uc81c\ud558\ub294\uc9c0": 19, "\ubcc0\ud615\uc744": [9, 19], "\uc0dd\uc131\ud558\ubbc0\ub85c": 19, "\uac70\ub9ac\ub97c": 19, "\uace0\ub824\ud558\uc5ec": 19, "\uc720\uc0ac\uc131\uc744": 19, "\ucee8\uc149\uc5d0": 19, "64\uac1c\uc758": 19, "x\ucd95": 19, "\ub09c\uc774\ub3c4\uc640": 19, "\uc124\uc815\uc758": 19, "\uc77c\ub828\uc758": 19, "\ubcc4\ub85c": 19, "prompt\uc758": 19, "embedding\uc5d0\uc11c": 19, "similarity\ub97c": 19, "\uc2a4\ucf54\uc5b4\ub294": 19, "capability\uc640": 19, "\uc2e0\ub8b0\ub3c4\ub97c": 19, "\ubcf4\uc5ec\uc90c": [19, 21, 26], "\ud658\uacbd": 19, "\ub530\ub984": 19, "\uc0dd\ub7b5": 19, "evaluation1": 19, "baseline\uacfc": 19, "quality\ub294": 19, "set\uc5d0\uc11c": 19, "\uc784\uc758\uc758": [9, 19], "\uc0d8\ud50c\ub9c1\ud558\ub294": 19, "\uc5c6\uc5c8\ub2e4": [19, 21], "\ub2ec\uc131\ud558\uace0": 19, "baseline\uc5d0\uc11c": 19, "editablity\uc744": 19, "space\uc758": 19, "\uc778\uc0c1\uc801\uc778": [19, 21, 28], "\uc720\uc5f0\uc131\uc744": 19, "\ub098\ud0c0\ub0b4\uace0": 19, "\ub2e8\uc77c": [9, 19, 21], "word\ub9cc": 19, "\uc815\ud655\ub3c4\ub85c": 19, "\ucea1\ucc98\ud558\ub294\ub370": 19, "distort": 19, "tradeoff": 19, "\uace1\uc120\uc758": 19, "outline\uc744": 19, "\uadf8\ub9ac\uba70": 19, "\uc218\uc815\ub420": 19, "target\uc758": 19, "\ucea1\ucc98\ud558\uc9c0\ub294": 19, "\ubc18\ub300\ub85c": 19, "\uba40\ub9ac": 19, "\ubc97\uc5b4\ub098\uba74": 19, "editability\uac00": 19, "\uac10\uc18c\ud558\ub294": 19, "reconstruction\uc774": 19, "rate\ub97c": [19, 21], "\ubcc0\uacbd\ud574": 19, "\uace1\uc120\uc744": 19, "\uc774\ub3d9\ud560": 19, "\uc788\uc73c\ubbc0\ub85c": 19, "\uc0ac\uc6a9\uc790\uc5d0\uac8c": 19, "tradeoff\uc5d0": 19, "\uc815\ub3c4\uc758": 19, "\uc81c\uc5b4\ub97c": 19, "\uc81c\uacf5\ud568": 19, "description\uc744": 19, "\ud3ec\ucc29\ud558\uc9c0": 19, "\ubabb\ud558\uba74\uc11c\ub3c4": 19, "\uac10\uc18c\ud568": 19, "\uc124\ubb38\uc9c0": 19, "\uc81c\uacf5\ubc1b\uc558\uace0": 19, "\uc774\ubbf8\uc9c0\uc640\uc758": [19, 24], "\uc720\uc0ac\uc131\uc5d0": 19, "\uc21c\uc704\ub97c": 19, "\ub9e4\uae40": 19, "context\ub97c": [9, 19], "\uc9c8\ubb38\ubcc4\ub85c": 19, "600\uac1c\uc529": 19, "200\uac1c\uc758": 19, "\uc751\ub2f5\uc744": 19, "\uc218\uc9d1": [19, 27], "\uc81c\uacf5\ud558\uc9c0\ub9cc": 19, "\uc758\ubbf8\ub860\uc801\uc778": 19, "\ubcf8\uc9c8\uc744": 19, "\ud30c\uc545\ud558\uac70\ub098": 19, "shape\ub97c": 19, "\ud55c\uacc4": 19, "\ucd5c\uc801\ud654\uac00": 19, "\uc624\ub798": 19, "\uac78\ub9b0\ub2e4": [19, 20], "2\uc2dc\uac04\uc774": 19, "\uc18c\uc694\ub428": 19, "\uc124\uc815\uacfc": 19, "\ud65c\uc6a9\ud558\ub294": 19, "\uac1c\uc778\ud654\ub418\uba70": 19, "generation\uc744": 19, "\uc18c\uac1c\ud568": 19, "word\ub85c": 19, "inverse\ud558\uc5ec": 19, "\uc791\ub3d9\ud568": 19, "word\ub294": 19, "\uc7a5\uba74\uc5d0": 19, "\uac04\ub2e8\ud558\uace0": 19, "\uc758\ubbf8\uc5d0\uc11c": 19, "\ud3b8\uc9d1\ud558\uae30": 19, "\uc27d\ub3c4\ub85d": 19, "interpace\ub97c": 19, "\uc0ac\uc6a9\ud558\uc9c0\ub9cc": 19, "\uc790\uc5f0": 19, "\uc5b8\uc5b4\uc758": 19, "\ud55c\uacc4\uc5d0": 19, "\uc811\uadfc\ud560": 19, "\ub2e8\uc11c\ub97c": 19, "\uacf5\uac1c\uc801\uc73c\ub85c": 19, "\uc0ac\uc6a9\uac00\ub2a5\ud55c": 19, "model\uc778": 19, "\uad6c\ud604\ub428": 19, "\uc544\ud0a4\ud14d\ucc98": 19, "\uc815\ubcf4\uc5d0": 19, "\uc758\uc874\ud558\uc9c0": 19, "\uc801\uc6a9\ud560": 19, "\uc0dd\uac01": 19, "\uac70\uae30\uc5d0\uc11c": 19, "preserav": 19, "\ud5a5\uc0c1\ub420": 19, "unpair": 21, "translat": [2, 21], "iccv": [20, 21], "2017": 21, "1703": 21, "10593": 21, "tensorflow": 21, "tutori": 21, "\ub17c\ubb38\ub9ac\ubdf0": 21, "cyclegan\uc744": 21, "\uc0ac\ub78c\uc774": [21, 26, 27], "\ud55c\uad6d\uc778\uc774\ub77c\uace0": 21, "\ub72f\uc5b4\ubcf4\uae30": 21, "kwangsu": 21, "\ub3c4\uba54\uc778\uc744": 21, "\ub3c4\uba54\uc778\uc73c\ub85c": 21, "\ubcc0\ud658\uc2dc\ud0a4\ub294": 21, "vision\uc758": 21, "translation\uc740": 21, "input\uacfc": 21, "\uc9dd\uc774": 21, "\uc9c0\uc5b4\uc9c4": 21, "\uc5bb\ub294": [21, 26], "\uc5b4\ub835\uc2b5\ub2c8\ub2e4": 21, "\uc9dd\uc9c0\uc5b4\uc9c4": 21, "x\ub77c\ub294": 21, "domain\uc73c\ub85c\ubd80\ud130": 21, "\uc5bb\uc740": 21, "domain": 21, "y\ub85c": 21, "\ubc14\uafb8\ub294": [21, 24], "\uc5f0\uad6c\ub294": 21, "\ubd84\ud3ec\uc640": 21, "y\ub85c\ubd80\ud130\uc758": 21, "\uad6c\ubd84\uc774": 21, "\ubd88\uac00\ub2a5\ud558\ub3c4\ub85d": 21, "y\ub85c\uc758": 21, "mapping\uc5d0": 21, "\uc81c\uc57d\uc744": 21, "\uac00\ud574\uc11c": 21, "\uac15\uc81c\ud558\uae30": 21, "\uc5ed\ubc29\ud5a5": 21, "\ub9e4\ud551\uc744": 21, "\uc9c4\ud589\ud558\uace0": 21, "\uc720\uc0ac\ud574\uc9c0\ub3c4\ub85d": 21, "\uac15\uc81c\ud558\ub294": 21, "\ub3c4\uc785\ud588\uc2b5\ub2c8\ub2e4": 21, "pair\uac00": 21, "\ubcf4\uc5ec\uc92c\ub2e4\uace0": 21, "image\ub85c": [9, 21], "\uadf8\ub9bc\uc73c\ub85c": 21, "\ubcc0\ud658\ud55c\ub2e4\uac70\ub098": 21, "\ub0ae\uc5d0": 21, "\ucc0d\uc740": 21, "\ubc24\uc5d0": 21, "\ud754\ud788": 21, "output\uc73c\ub85c": 21, "\ubc14\ud0d5\uc73c\ub85c": 21, "\uc774\ub8e8\uc5b4\uc838": [2, 9, 21, 29], "\uc788\uc5c8\ub294\ub370\uc694": 21, "\uc5b4\ub835\uace0": 21, "\ube44\uc2fc": 21, "\uc77c\uc774": 21, "image\uac00": [9, 21], "\uc77c\ub300\uc77c\ub85c": 21, "\uc9dd\uc9c0\uc5b4\uc9c0\uc9c0": 21, "\ubaa8\uc74c\uc758": 21, "\ucea1\uccd0\ud558\uace0": 21, "\ubaa8\uc74c\uc73c\ub85c": 21, "\ubcc0\ud658\ud560": 21, "x\uc5d0": 21, "\uc138\ud2b8": 21, "y\uc5d0": 21, "\uc81c\uacf5\ub418\uace0": 21, "output\uacfc": [21, 23], "y\uac00": 21, "discriminator\uc5d0": 21, "\uad6c\ubcc4\ud560": 21, "\uc5c6\ub3c4\ub85d": 21, "y\ub97c": [21, 23], "\ud559\uc2b5\ud569\ub2c8\ub2e4": [21, 28], "\uc774\uac8c": 21, "\uac1c\ubcc4": 21, "\ubb34\uc870\uac74": 21, "\uc720\uc758\ubbf8\ud558\uac8c": 21, "\uc30d\uc744": 21, "\uc774\ub8ec\ub2e4\ub294": 21, "\ub73b\ud558\uc9c0\ub294": 21, "g\uac00": 21, "image\uc5d0\ub294": 21, "\ubb34\ud55c\ud55c": 21, "\uc218\uac00": [2, 21, 22], "\ub54c\ubb38": [20, 21], "collapse\uac00": 21, "\uc77c\uc5b4\ub098\uae30\ub3c4": 21, "dl": 21, "blogspot": 21, "08": [2, 21], "problem": 21, "image\ub4e0": 21, "\ub9e4\ud551\ud558\uba74\uc11c": 21, "\ucd5c\uc801\ud654\uc5d0": 21, "\uc2e4\ud328\ud558\ub294": 21, "\ud604\uc0c1": [2, 21, 24], "\ud604\uc0c1\uc740": 21, "\uc785\uc7a5\uc5d0\uc11c": 21, "discriminator\uac00": 21, "\uc0ac\uc9c4\uc774": [21, 23], "\uc9c4\uc9dc": 21, "y\uc778\uc9c0": 21, "\uac00\uc9dc\uc778": 21, "\uc778\uc9c0": 21, "\uad6c\ubcc4\ud558\ub294": 21, "\uc18d\uc774\uae30\ub9cc": 21, "\uc6b0\ub9ac\uc758": 21, "\ubaa9\uc801\uacfc": 21, "\uc0c1\uad00\uc774": 21, "\ub9cc\ub4e4\ub354\ub77c\ub3c4": 21, "\uc0dd\uae30\uc9c0": 21, "\uc54a\uc544\uc11c": 21, "\ubc1c\uc0dd\ud568": 21, "\uc774\uc288\ub85c": 21, "\ud544\uc694\ud574": 21, "\uc84c\uc2b5\ub2c8\ub2e4": 21, "task\ub294": 21, "\uc601\uc5b4": 21, "\ud504\ub791\uc2a4\uc5b4": 21, "\uc601\uc5b4\ub85c": 21, "\ubc88\uc5ed\ud588\uc744": 21, "\ub3c4\ub2ec\ud558\ub294": 21, "\uac83\ucc98\ub7fc": 21, "\ub3cc\uc544\uac00\ub294": 21, "\uac19\uc544\uc57c": 21, "\ud55c\ub2e4\ub294": 21, "\uc758\ubbf8\uc758": 21, "cyclic": 21, "consistency\uc774\ub77c\ub294": 21, "\uc18d\uc131\uc744": 21, "\uc774\uc6a9\ud569\ub2c8\ub2e4": 21, "\ubaa9\uc801\uc2dd\uc744": 21, "\uc815\ub9ac\ud558\uba74": 21, "\uc815\ubc29\ud5a5": 21, "\ub17c\ubb38\uacfc": 21, "\uc5f0\uad6c\uc5d0": 21, "\ub0b4\uc6a9\uc774\uc5c8\uc74c": 21, "\uac1c\ub150\ub4e4\uc740": 21, "introduction\uc5d0\uc11c": 21, "\uc124\uba85\ud588\uace0": 21, "\uc2a4\ud130\ub514\uc640\ub294": 21, "\uad00\ub828\uc774": 21, "\uc2a4\ud0b5\ud588\uc74c": 21, "\ub3c4\uc2dd\ud654": 21, "\uc790\ub8cc": 21, "mapping\ud558\ub294": 21, "function\uc744": [21, 22], "\uc6a9\uc5b4": 21, "\uc815\ub9ac": 21, "pdata": 21, "\ud45c\uc2dc": 21, "dx": [21, 25], "dy\ub294": 21, "dx\ub294": 21, "\uad6c\ubd84": 21, "y\uc640": 21, "\ubaa9\uc801\uc2dd\uc740": 21, "\ub450\uac1c": 21, "domain\uc758": 21, "distribution\uacfc": 21, "\uc77c\uce58\uc2dc\ud0a4\uae30": 21, "g\uc640": 21, "f\uac00": 21, "\ubaa8\uc21c\ub418\ub294": 21, "\ubc29\uc9c0\ud558\uae30": 21, "dy\uc5d0": 21, "l_gan": 21, "gan\uc5d0\uc11c": 21, "\ub300\uc2e0\uc5d0": 21, "\uac08": [21, 23], "x\ub85c": 21, "\uc218\uc2dd\uc774": 21, "\ub098\uc624\uba70": 21, "dx\uc5d0": 21, "dx\ub97c": 21, "\ub123\uc740": 21, "\uc55e\uc11c": 21, "\ub9d0\ud588\ub4ef": 21, "\uc81c\ud55c\uc744": 21, "\ub450\uc5b4": 21, "\uc218\uc2dd\uc73c\ub85c\uc11c": 21, "\uc608\ube44": 21, "\uc2e4\ud5d8\uc5d0\uc11c": [9, 21], "l1": [20, 21], "loss\ub85c": 21, "\ub300\uccb4\ud574\ubd24\ub294\ub370": 21, "\ud5a5\uc0c1\uc744": [21, 23], "\uad00\ucc30\ud560": 21, "\uc5c6\uc5c8\uc74c": 21, "\uc720\ub3c4\ub41c": 21, "loss\uc640\uc758": 21, "\uc0c1\ub300\uc801": 21, "\uc911\uc694\ub3c4\uc5d0": 21, "\uacb0\uc815\ub428": 21, "architecture\ub85c\uc11c": 21, "transfer\uc640": 21, "\ubcf4\uc5ec\uc900": [21, 28], "\ucc44\ud0dd\ud568": [21, 23], "3\uac1c\uc758": 21, "sever": 21, "block": [21, 23, 25, 28], "fraction": 21, "stride": 21, "feature\ub97c": 21, "rgb\ub85c": 21, "\ub9e4\ud551\ud558\ub294": 21, "\uc548\uc815\ud654\uc2dc\ud0a4\uae30": 21, "\ud14c\ud06c\ub2c9\uc744": 21, "function\uc5d0\uc11c": 21, "\ubcc0\uacbd": 21, "50\uac1c\ub97c": 21, "\uc800\uc7a5\ud574": 21, "\ud55c\uaebc\ubc88\uc5d0": 21, "\uc9c4\ub3d9\uc744": 21, "sjinu": 21, "ysbsb": 21, "02": 21, "lsgan": 21, "generator\uc758": 21, "\uc5c5\ub370\uc774\ud2b8\ub97c": 21, "lsgan\uc744": 21, "\uc774\ud574\ub294": 21, "\ubabb\ud588\uace0": 21, "\uc774\ub7f0\uac8c": 21, "\uc788\uad6c\ub098": 21, "\uc815\ub3c4\ub85c\ub9cc": 21, "discriminator\ub294": 21, "\uc774\ubcf4\ub2e4": 21, "\uace0\ucc28\uc6d0\uc774\uc9c0\ub9cc": 21, "\uac04\ub7b5\ud788": 21, "2\ucc28\uc6d0\uc744": 21, "\ud45c\ubc29\ud558\uba74": 21, "\uacb0\uc815\uacbd\uacc4\ub97c": 21, "\ub098\ud0c0\ub0bc": [2, 21], "\ucabd\uc774": 21, "\uac00\uc9dc": [21, 25], "\uc601\uc5ed": 21, "\uc601\uc5ed\uc785\ub2c8\ub2e4": 21, "\uc544\ub798\uc5d0": 21, "\uac70\ub9ac\uac00": 21, "\uba3c": 21, "\uc0ac\uc6a9\ud55c\ub2e4\uba74": 21, "\uc785\uc7a5\uc5d0\uc11c\ub294": 21, "discriminator\ub97c": 21, "\uc18d\uc774\uace0": 21, "vanish": [21, 25], "\uc77c\uc5b4\ub098\uae30": 21, "\uc18d\uc778\ub2e4\ub294": 21, "\uc774\uc720\ub9cc\uc73c\ub85c": 21, "\ud328\ub110\ud2f0\ub97c": 21, "\uc5c6\uac8c": 21, "ls": 21, "generator\ub294": 21, "\uc18d\uc774\ub294": 21, "\ub118\uc5b4\uc11c": 21, "\uac00\uc9c0\uac8c\ub054": 21, "\ud574\uc57c\ud569\ub2c8\ub2e4": 21, "\ub78c\ub2e4\ub97c": 21, "10\uc73c\ub85c": 21, "\uc544\ub2f4\uc744": 21, "\ub124\ud2b8\uc6cc\ud06c\ub294": 21, "\uc5d0\ud3ec\ud06c": 21, "\ub3d9\uc548\uc5d0\ub294": 21, "ln\uc744": 21, "\uc5d0\ud3ec\ud06c\ub9c8\ub2e4": 21, "\uc870\uae08\uc2dd": 21, "\uc218\ub834\ud558\uac8c": 21, "amt": 21, "\ucc38\uac00\uc790\ub4e4\uc740": 21, "\uc0ac\uc9c4\uc774\ubbf8\uc9c0": 21, "\uc9c0\ub3c4": 21, "\uac00\uc9dc\uc774\ubbf8\uc9c0\uc5d0": 21, "\ub178\ucd9c\ub41c": 21, "\uc9c4\uc9dc\ub77c\uace0": 21, "\uc0dd\uac01\ub418\ub294": 21, "\uc120\ud0dd\ud558\uac8c": 21, "1\ubc88": 21, "study\uac00": 21, "\ud14c\uc2a4\ud2b8\uc5d0": 21, "\uc788\uc5b4": [9, 21], "\uae30\uc900\uc784\uc5d0\ub3c4": 21, "\uc2e4\ud5d8\uc774": 21, "\uc591\uc801\uc778": 21, "\uae30\uc900\uc744": 21, "\ucc3e\uc558\ub294\ub370": 21, "score\uc784": 21, "fcn\uc740": 21, "\uc0ac\uc9c4\uc5d0": 21, "\ub808\uc774\ube14": 21, "\ub9f5\uc744": 21, "\ub9f5\uc740": 21, "\ubd84\ud560": 21, "\uba54\ud2b8\ub9ad\uc744": 21, "label\uacfc": 21, "\ube44\uad50\ud560": 21, "\ub3c4\ub85c": 21, "\uc0c1\uc758": 21, "\uc790\ub3d9\ucc28": 21, "label\uc5d0\uc11c": 21, "fcn\uc774": 21, "\uac10\uc9c0\ud558\uba74": 21, "\uc131\uacf5\ud55c": 21, "\ub77c\ubca8\ub9c1": 21, "pixel\ub2f9": 21, "\uc815\ud655\ub3c4": 21, "\ub2f9": 21, "iou": 21, "intersect": 21, "union": 21, "cityscap": 21, "benchmark\uc758": 21, "cogan": 21, "simgan": 21, "pix2pix": 21, "aginst": 21, "6\uc5d0\uc11c": 21, "baseline\uc5d0\uc11c\ub3c4": 21, "\uac15\ub825\ud55c": [9, 21], "\ubc18\uba74\uc5d0": [21, 25], "cyclegan\uc740": 21, "supervise\uc778": 21, "pix2pix\uc640": 21, "translation\uc744": 21, "realism": 21, "\uc9c0\ub3c4\uc5d0\uc11c": 21, "\ud56d\uacf5": 21, "\uc0ac\uc9c4\uc5d0\uc11c": 21, "\ubaa8\ub450\uc5d0\uc11c": 21, "4\uc758": 21, "\ucc38\uac00\uc790\ub97c": 21, "\uc18d\uc77c": 21, "baseline\uc740": 21, "\ub3c4\uc2dc": 21, "\ud48d\uacbd\uc5d0": 21, "\ud3c9\uac00\ud558\uace0": 21, "3\uc740": 21, "\ud3c9\uac00\ud568": [21, 26], "cyclegan\uc774": 21, "baseline\ub4e4\uc758": 21, "\ub2a5\uac00\ud55c\ub2e4": 21, "consistency\uc758": 21, "\ubcf4\uc5ec\uc8fc\ub294": [21, 22, 24, 28], "\uc5c6\uc560\uba74": 21, "cycle\uc744": 21, "\uc81c\uac70\ud558\ub294": 21, "\uc800\ud558\ub428": 21, "\uacb0\ub860\uc744": 21, "\ub0b4\ub9b4": 21, "\ubc29\ud5a5\uc5d0\uc11c\ub9cc": 21, "\uba54\uc18c\ub4dc\ub97c": 21, "cycle\ub9cc": 21, "\ub3cc\ub838\uc744": 21, "backward": [21, 24, 25], "\uc774\ub530\uae08\uc529": 21, "\ubcf4\uc774\uace0": [21, 22], "collapse\ub97c": 21, "\uc720\ubc1c\ud558\ub294": 21, "\ubc1c\uacac\ud568": 21, "\uc81c\uac70\ub41c": [9, 21], "\ub9e4\ud551\uc758": 21, "\ubc29\ud5a5\uc5d0": 21, "7\uc744": 21, "\uc787\uc5c8\uc74c": 21, "\uc7ac\uad6c\uc131\ub41c": 21, "\uc0ac\uc9c4\uacfc": 21, "\ub3c4\uba54\uc778\uc774": 21, "\uacbd\uc6b0\uc5d0\ub3c4": 21, "\ud14c\uc2a4\ud2b8": 21, "\ub9ce\uc558\uc74c": 21, "8\uc740": 21, "cmp": 21, "fa\u00e7ad": 21, "database\uc758": 21, "\uac74\ucd95": 21, "ut": 21, "zapoos50k": 21, "dataset\uc758": 21, "\uc2e0\ubc1c\uacfc": 21, "pix2pix\uc5d0": 21, "cyclegan\uc758": 21, "\ud488\uc9c8\uc740": 21, "\ub300\uc758": 21, "\uc9f1\uc774\ub2e4": 21, "\ub9ce\uc544": 21, "\uc0dd\ub7b5\ud558\uaca0\uc2b5\ub2c8\ub2e4": 21, "\u3160": 21, "data\uac00": 21, "data\uc5d0\uc11c": 21, "transslation\uc774": 21, "\ud55c\uac83\ubcf4\ub2e4": 21, "\ub9e4\ub825\uc801\uc774\ub2e4": 21, "application\uc740": 21, "\uc6f9\uc0ac\uc774\ud2b8\uc5d0": 21, "\uc2e0\uacbd": 21, "\uc804\ub2ec": 21, "\uc791\uc5c5\uacfc": 21, "\uc120\ud0dd\ud55c": 21, "\uc608\uc220": 21, "\uc791\ud488\uc758": 21, "\uc804\ub2ec\ud558\ub294": 21, "\uc791\ud488": 21, "\uceec\ub809\uc158\uc758": 21, "\ubaa8\ubc29\ud558\ub294": 21, "\ubcc4\uc774": 21, "\ube5b\ub098\ub294": 21, "\uadf8\ub9ac\ub294": 21, "\ubc18": 21, "\uace0\ud750": 21, "\ub530\ub77c\ud558\ub294": 21, "\ub290\ub08c\uc744": 21, "\ub530\ub77c\ud55c\ub2e4": 21, "turmukhambetov": 21, "\ubc94\uc8fc\uc758": 21, "\uac1d\uccb4\ub85c": 21, "\uc81c\uc548\ud558\ub294": 21, "\uc2dc\uac01\uc801\uc73c\ub85c": [21, 22], "\ubc94\uc8fc": 21, "\ubcc0\ud615\uc5d0": 21, "\uc911\uc810\uc744": [9, 21, 22], "\ub461\ub2c8\ub2e4": [21, 29], "turn": 21, "hors": 21, "zebra": 21, "\uac04": 21, "\uc0c9": 21, "\uad6c\uc131\uc744": 21, "\ubcf4\uc874\ud558\uae30": 21, "\uc720\uc6a9\ud558\ub2e4\ub294": 21, "\ubc1c\uacac\ud560": 21, "taigman": 21, "49": 21, "\ucc44\ud0dd\ud558\uc5ec": [21, 23], "\uc81c\ub108\ub808\uc774\ud130\uac00": 21, "\ub3c4\uba54\uc778\uc758": 21, "\uc81c\uacf5\ubc1b\uc744": 21, "\uadfc\ucc98\uc5d0": 21, "\uc815\uaddc\ud654\ud569\ub2c8\ub2e4": 21, "lident": 21, "ey_pdata": 21, "lidentity\uac00": 21, "\uc5c6\uc73c\uba74": 21, "\uc0dd\uc131\uc790": 21, "\uad73\uc774": 21, "\uc54a\uc744": [9, 21], "\uc0c9\uc870\ub97c": 21, "\uc790\uc720\ub86d\uac8c": 21, "\ubcc0\uacbd\ud560": 21, "monet\uc758": 21, "flickr": 21, "\uc0dd\uc131\uc790\ub294": 21, "\uadf8\ub9b0": 21, "\uc77c\ubab0": 21, "\uc2dc\uac04\uc5d0": 21, "\ub9e4\ud551\ud569\ub2c8\ub2e4": 21, "\uc801\ub300\uc801": 21, "\uc0ac\uc774\ud074": 21, "\uc77c\uad00\uc131": 21, "\uc190\uc2e4": [21, 22], "\ub9e4\ud551\uc774": 21, "\ub3d9\ub4f1\ud558\uac8c": 21, "\uc720\ud6a8\ud560": 21, "\uc190\uc2e4\uc758": 21, "\ud6a8\uacfc\ub294": 21, "9\uc5d0\uc11c": 21, "\ubcf4\uc5ec\uc9d1\ub2c8\ub2e4": 21, "9\ub294": 21, "\ud3ec\ud568\ub418\uc5b4": 21, "set\uc740": 21, "set\uc73c\ub85c\ubd80\ud130": 21, "\uadf8\ub824\uc9c4": 21, "datqa\ub97c": 21, "\uadf8\ub9bc\uc5d0": 21, "\ud0c0\ub2f9\ud55c": 21, "\uc544\ub2c8\ub2e4": [20, 21], "monet\uc774": 21, "\uc0c8": [21, 22], "\uadf8\ub9b4": 21, "generalization\uc740": 21, "press": 21, "\uc595\uc740": 21, "\uae4a\uc774\uc758": 21, "flickr\uc5d0\uc11c": 21, "\ub2e4\uc6b4\ub85c\ub4dc\ud55c": 21, "\uaf43": 21, "\ud6c8\ub828\ud569\ub2c8\ub2e4": 21, "\uc18c\uc2a4": 21, "\ub3c4\uba54\uc778\uc740": 21, "\uc2a4\ub9c8\ud2b8\ud3f0\uc73c\ub85c": 21, "\ucc0d\ud78c": 21, "\uad6c\uc131\ub418\uc5b4": [21, 28], "\uc870\ub9ac\uac1c\ub85c": 21, "\uae4a\uc740": 21, "dof": 21, "\ucd08\uc810": 21, "\uae4a\uc774": 21, "\ub300\uc0c1\uc740": 21, "\uc870\ub9ac\uac1c\uac00": 21, "dslr\ub85c": 21, "\ucd2c\uc601\ub41c": 21, "\ud3ec\ud568\ud569\ub2c8\ub2e4": 21, "\uc0ac\uc9c4\uc73c\ub85c\ubd80\ud130": 21, "\uc131\uacf5\uc801\uc73c\ub85c": 21, "shallow": 21, "field": [21, 27], "\ucd08\uc810\uc774": 21, "\ub9de\uc740": 21, "\ubc30\uacbd\uc774": 21, "\ud750\ub9bf\ud558\uac8c": 21, "\ud65c\uc6a9": [21, 22, 23], "\uad6c\ubaa9\ud558\uace0\uc790": 21, "\uac15\uc870\ud558\uae30": 21, "domain\uc740": 21, "\uc2a4\ub9c8\ud2b8\ud3f0\uc758": 21, "target\uc740": 21, "discuss": 21, "\ud765\ubbf8\ub85c\uc6b4": [21, 24], "\uade0\uc77c\ud558\uac8c": 21, "\uc544\ub2c8\uc5c8\uc2b5\ub2c8\ub2e4": 21, "\ud574\uc11d": 21, "task\uc640": 21, "\ucd5c\uc18c\ud55c\uc758": 21, "\ubcc0\ud654\ub9cc": 21, "\ubcc0\ud654\uac00": [21, 26], "\uc548\ub418\ub294": 21, "\uc788\uc5c8\uace0": 21, "\ud615\uccb4\uac00": 21, "\uc560\ub9e4\ud574\uc9c4": 21, "\uc774\ub7f0\uac78": 21, "geometri": 21, "\ud45c\ud604\uc744": 21, "\ubcf4\uc544": 21, "\ucf54": 21, "\uc785\uc5d0": 21, "\uad6c\ud604\ud558\ub294\ub370": 21, "\ub9d0": 21, "\uc5bc\ub8e9\ub9d0": 21, "\uc608\uc81c\uc758": 21, "\ub9d0\uc740": [2, 21], "\ud0c0\ub294": 21, "\ub9ce\uc558\ub294\ub370": 21, "\uc5bc\ub8e9\ub9d0\uc758": 21, "\uc5c6\ub2e4\ubcf4\ub2c8": 21, "\ubc30\uacbd\ub3c4": 21, "\uc5bc\ub8e9": 21, "\uadf8\ub9ac\uac70\ub098": 21, "\uc5bc\ub8e9\ub9d0\uc5d0\uc11c": 21, "\ub178\ub797\uac8c": 21, "\uce60\ud55c": 21, "\uc0dd\uae40": 21, "\ub54c\ub54c\ub85c": 21, "\ub098\ubb34\uc640": 21, "\uac74\ubb3c\uc758": 21, "label\uc744": 21, "\ubaa8\ud638\uc131\uc744": 21, "\ud574\uacb0\ud558\ub824\uba74": 21, "weak": 21, "supervision\uc774": 21, "\ub9c8\ubb34\ub9ac": 21, "\ud48d\ubd80\ud558\uac8c": 21, "\uc81c\uacf5\ub418\uba70": 21, "\ud65c\uc6a9\ud574\uc57c": 21, "setting\uc5d0\uc11c": 21, "\uac83\uc758": 21, "\ub298\ub9ac\ub294\ub370": 21, "\uae30\uc5ec\ud569\ub2c8\ub2e4": 21, "12092": 22, "unoffici": 22, "donggeun": [22, 23, 26], "sean": [22, 23, 26], "ko": [22, 23, 26], "june": 22, "22": 22, "\ubaa8\ub378\uc774\uba70": 22, "120\uc5b5\uac1c": 22, "\uc218\uc640": 22, "5\uc5b5": 22, "\ubaa8\ub378\ub9c1\uc744": 22, "\ud1b5\ud558\uc5ec": 22, "2021\ub144": 22, "diverse\ud55c": 22, "3\uc640": 22, "vae\ub97c": 22, "transformer\uc744": 22, "architecture\uc744": [22, 23], "\uad6c\ucd95": [22, 28], "model\uba70": 22, "learning\uc744": [9, 22], "\ub0c4": 22, "\uc218\ub294": 22, "shot\uc744": 22, "\ubd80\ubd84\ub9cc": [22, 23], "1750\uc5b5": 22, "\uac1c\uc218\uc758": 22, "2005": 22, "14165": 22, "jalammar": 22, "how": 22, "gpt3": 22, "encoder\uc5d0\uc11c": 22, "output\uc740": 22, "discret": [2, 22], "categor": 22, "\uac16\ub294\ub2e4\uace0": 22, "cnn": 22, "\uac70\uce5c": [9, 22, 27], "d\ucc28\uc6d0\uc758": 22, "\uc704\uce58\uc5d0": 22, "\uadf8\ub9ac\ub4dc\ub85c": 22, "\ub098\ub204\uace0": 22, "\ud835\udc52_1": 22, "\ud835\udc52_\ud835\udc58": 22, "code\ub85c": 22, "\ubcc0\ud658": 22, "z_": [22, 28], "e_j": 22, "\ucc3e\uc544\uc11c": 22, "\ubd80\uc5ec\ud568": 22, "p2yeong": 22, "explain": 22, "issu": 22, "pixel\uc744": 22, "\uc9c1\uc811\uc801\uc73c\ub85c": 22, "token\uc744": [9, 22], "\uace0\ud654\uc9c8": [22, 23], "\uc774\ubbf8\uc9c0\uc77c\uc218\ub85d": 22, "\uba54\ubaa8\ub9ac\ub7c9\uc774": 22, "\ud544\uc694\ud574\uc11c": 22, "\ube44\ud6a8\uc728\uc801": 22, "short": 22, "depend": [22, 24], "model\ub4e4": 22, "likelihood": [22, 23, 25, 29], "dependency\ub97c": 22, "\uac83\uc774\uba70": 22, "detail\uc5d0": 22, "\uc9d1\uc911\ud558\uac8c": 22, "recognizable\ud574\uc11c": 22, "2\uac00\uc9c0": [22, 26], "\uadf9\ubcf5\ud558\uace0\uc790": 22, "textbf": 22, "rgb": 22, "rightarrow": 22, "\uc555\ucd95": 22, "192\uac1c\uc758": 22, "\uac12": [2, 22], "\uc911\uc5d0": 22, "\ubc30\uc815": 22, "size\ub97c": 22, "bpe": 22, "\ub4e4\uacfc": [22, 26], "\uc5f0\uc18d\uc801\uc73c\ub85c": 22, "\uc785\ub825\ud568": 22, "concaten": [22, 27], "token\uacfc": [9, 22], "\ub4e4\uc758": 22, "\uacb0\ud569": [2, 22], "\ubaa8\ub378\ub9c1\ud558\uc5ec": [22, 23], "\uc2dc\uac01\ud654": [22, 23], "jiho": 22, "ml": [22, 29], "weekli": 22, "nlp": 22, "40": 22, "\ud30c\uc774\ud504\ub77c\uc778": 22, "cqom0r2kmvi": 22, "1729": 22, "\ud835\udc5e": 22, "\u03c6": 22, "dvae": 22, "token\ub97c": 22, "\ud835\udc5d": 22, "\ud835\udf03": 22, "token\uc5d0\uc11c": 22, "decoder\uc5d0\uc11c": 22, "\u03c8": 22, "purpl": 22, "\ubaa8\ub378\ub9c1\ud55c": [2, 22], "text\uc640": 22, "token\ub4e4\uc758": 22, "\ud835\udc5e_\u03c6": 22, "\ud835\udc5d_\ud835\udf03": 22, "\ud559\uc2b5\ud568": 22, "elb": 22, "bound\ub97c": 22, "192": 22, "elb\ub97c": 22, "continuous\ub97c": 22, "\ubc14\uafd4\uc57c": 22, "\ud559\uc2b5\uc2dc\uc5d0\ub294": 22, "argmax\ub97c": 22, "\uc778\ub371\uc2a4\ub97c": 22, "\uc120\ud0dd\ud558\uc5ec": 22, "\uacc4\uc0b0\ud558\uba74": 22, "reparameter": 22, "gradient\ub97c": [9, 22, 23], "\uc5f0\uc0b0": 22, "argmax": 22, "gumbel": 22, "\ud574\uacb0": 22, "underset": 22, "g_i": 22, "e_i": 22, "relaxation\ub97c": 22, "q_": [22, 29], "tau": [22, 28], "temperatur": 22, "relaxation\uc744": 22, "tight\ud558\uac8c": 22, "\uc7a1\uc544\uc90c": 22, "psi": 22, "120\uc5b5\uac1c\uc758": 22, "token\uc740": 22, "logit\uc5d0\uc11c": 22, "\uc18c\ubb38\uc790\ud654": 22, "384": 22, "vocabulary\ub97c": 22, "\ud55c\ubc88\uc5d0": 22, "causal": 22, "row": 22, "column": 22, "\ub300\ud558\uc5ec": [2, 22], "n\uac1c\ub294": 22, "n\uac1c": 22, "\uace8\ub77c\uc11c": 22, "\uace0\ub974\uae30": 22, "\ubc88\uc9f8\ub85c": 22, "\uc120\ud0dd\ud568": 22, "best\ub97c": 22, "\uace0\ub97c\ub54c": 22, "\uc99d\uac00\ud560\uc218\ub85d": 22, "prompt\ub791": 22, "\ub098\uc634": [22, 23], "\uc54c\uace0\ub9ac\uc998\uc744": 22, "score\uc774": 22, "\uc81c\uc77c": [22, 23, 28], "\ubf51\uc74c": 22, "\uc54c\ub9de\uc740": 22, "\uac1c\uc218\uc5d0": [22, 24], "df": 22, "five": 22, "vote": 22, "gan\ubcf4\ub2e4": [22, 23], "\uc555\ub3c4\uc801\uc778": [9, 22], "\ucc28\uc774\ub85c": 22, "\ud22c\ud45c": 22, "\ubc1b\uc558\uc74c": 22, "frechet": 22, "distanc": 22, "\ub0ae\uc744\uc218\ub85d": [22, 23], "\uc88b\uc73c\uba70": 22, "\ub192\uc744\uc218\ub85d": [22, 23], "\ub791": 22, "cub": 22, "coco\uc5d0\uc11c\ub294": 22, "\ubcf4\uc5ec\uc92c\uc74c": 22, "cub\uc5d0\uc11c\ub294": 22, "\ucc0d\uc9c0": 22, "\ubabb\ud558\uc600\uace0": 22, "score\uc5d0\uc11c\ub294": 22, "\uae30\ub85d\ud568": 22, "cub\uc5d0": 22, "\uacc4\uc120\uc744": 22, "\uc0dd\uac01\ud568": 22, "\uacb0\uacfc\uac12": 22, "\ud655\uc7a5": 22, "parameter\uacfc": 22, "\ub6f0\uc5b4\ub098\uac8c": 22, "\ud574\uacb0\ud568": 22, "\ud6cc\ub96d\ud55c": [20, 22, 26], "\uc77c\ubc18\ud654": 22, "\ud3c9\uac00\uc5d0\uc11c": 22, "\uc900\uc218\ud55c": 22, "\uc2f6\uc740": 22, "\uac1d\uccb4\uac00": 22, "\ud3ec\ud568\ub418\uba74": 22, "\uacaa\uc74c": 22, "\uace0\uc2b4\ub3c4\uce58\uac00": 22, "2\ub9c8\ub9ac\uac70\ub098": 22, "\uac15\uc544\uc9c0\uc640": 22, "\uace0\uc2b4\ub3c4\uce58": 22, "\ub458\ub2e4": 22, "\ud06c\ub9ac\uc2a4\ub9c8\uc2a4": 22, "\uc2a4\uc6e8\ud130\ub97c": 22, "\uc785\uace0": 22, "\uc544\uc26c\uc6b4": 22, "\ub370\uc774\ud130\uc14b\uc774": [22, 27], "tuning\uc73c\ub85c": 22, "limitation\uc744": 22, "2105": 23, "05233": 23, "\ubaa8\ub378\ub4e4\uc758": 23, "\ub6f0\uc5b4\ub118\uc74c": 23, "\ubd80\ubd84\uc5d0\uc11c\ub3c4": 23, "\ubcf4\uc5ec\uc900\ub2e4\uace0": [23, 24], "\uc8fc\uc7a5\ud568": 23, "diversity\uc640": 23, "fidelity\uc758": 23, "trade": [9, 23], "off\uc5d0": 23, "model\ub4e4\uc774\uba70": 23, "\uc0dd\uc131\ud574\ub0b4\ub294\ub370\uc5d0": 23, "\uc131\uacf5": 23, "\ud588\uc74c": 23, "deep\uc5d0": 23, "\ub0ae\uc73c\uba70": 23, "\uac1c\uc120\uc0ac\ud56d\uc774": 23, "\ud544\uc694\ud568": 23, "\ub450\uac00\uc9c0": 23, "model\ub4e4\uc758": 23, "\ub04c\uc5b4\uc62c\ub9ac\uba70": 23, "\ub0ae\ucd94\uaca0\ub2e4\uace0": 23, "\uc124\uba85\ub418\uc788\uc73c\ubbc0\ub85c": 23, "\ub17c\ubb38\ub4e4\uc758": 23, "\uadfc\uc0ac\uac12\uc774\ub77c\uace0": 23, "\uac00\uc815\ud558\uba70": 23, "\uacc4\uc0b0\ud55c\ub2e4": 23, "approx": [23, 25, 29], "\ub9cc\ub4e0\ub2e4": [20, 23], "\uc608\uce21\ud55c\ub2e4": [9, 23], "\uacf5\ubd84\uc0b0": 23, "\ubd88\uac00\ub2a5\ud55c": 23, "\ub9e4\uac1c\ubcc0\uc218\ub85c": 23, "\uc124\uc815\ub418\uba70": 23, "\uac00\uc9c4\ub2e4": 23, "pipelin": [23, 26], "ddpm\uc5d0\uc120": 23, "\uc9c0\ud45c\uac00": 23, "\ub0ae\uc558\ub2e4": 23, "scheduling\uc744": 23, "\uc0ac\uc6a9\ud588\uc9c0\ub9cc": 23, "\uc8fc\uc7a5\ud588\ub2e4": 23, "\ud559\uc2b5\uc5d0\ub3c4": 23, "\ub04a\uace0": 23, "\ubc14\uafc8": 23, "iteration\uc73c\ub85c": 23, "\ucc44\ud0dd\ud588\uc9c0\ub9cc": 23, "parameter\uc744": 23, "\ubcc0\uacbd\ud558\uc5ec": 23, "\uc77c\uc815\ud558\uac8c": 23, "\uac00\uc838\uac00\uba74\uc11c": 23, "\uc99d\uac00": 23, "\ubcf4\uae30": 23, "\uc2dc\ucf1c\ubcf4\uae30": 23, "head\uc5d0": 23, "8x8": 23, "16x16": 23, "\ud574\ubcf4\uae30": 23, "\uc77c\ubc18": 23, "block\uc774": 23, "biggan\uc758": 23, "block\uc744": [9, 23], "connection\uc744": 23, "chang": 23, "32\uc77c\ub54c": 23, "\ub0ae\ub2e4": 23, "160": 23, "resolution\uc744": [9, 23], "block\ub9c8\ub2e4": 23, "\uc904\uc774\uae30": 23, "\ud29c\ub2dd\uc744": 23, "adain\uc774\ub791": 23, "\uc5f0\uc0b0\ud558\ub294": 23, "adagn": 23, "\uc18c\uac1c\ud588\ub2e4": 23, "\ubc29\ubc95\ub860\uc778\uc9c0\ub294": 23, "\ubaa8\ub974\uaca0\ub2e4": 23, "normalization\uc744": 23, "adpative\ud558\uac8c": 23, "embedding\uacfc": 23, "adain": 23, "\uacf1\ud558\uace0": 23, "\ub354\ud568": 23, "y_b": 23, "where": [23, 28], "adagn\uc758": 23, "adagn\uacfc": 23, "additon": 23, "normalization\ubcf4\ub2e4": 23, "addit": [20, 23], "layer\uc744": 23, "\uc0ac\uc6a9\ud588\ub294\ub370": 23, "\ub0ae\uac8c": 23, "\uc8fc": 23, "de": 23, "\uc90c\uc73c\ub85c\uc368": 23, "zp_": 23, "normalizing\uc744": 23, "\uc0c1\uc218": 23, "log_": 23, "\uace1\ub960\uc774": 23, "\ubb34\ud55c\uc73c\ub85c": 23, "rightarrow0": 23, "\ud14c\uc77c\ub7ec": 23, "\uae09\uc218\ub97c": 23, "\uc7ac\uc804\uac1c": 23, "classifier\uc758": [9, 23, 26], "\uc2dd": 23, "\uc720\ub3c4\ub294": 23, "\ubcf8\ubb38\uc758": 23, "\ubc88\uc2dd\uc774\ubbc0\ub85c": 23, "\ubc29\ubc95\uc774\ub2e4": 23, "\ub611\uac19\uc774": 23, "sample\ud55c\ub2e4": 23, "ddim\uc5d0\uc11c": [23, 26], "gradient\uc758": 23, "\ube7c": 23, "score\uc744": 23, "\uad6c\ud55c\ub2e4": [20, 23], "scaling\uc758": 23, "\uc601\ud5a5": 23, "\uac12\uc5d0": 23, "classifier\uac00": 23, "scaling\uc774": 23, "\ub2e4\ub974\ub2e4": 23, "\uc8fc\uba74": 23, "\uc6f0\uc2dc\ucf54\uae30\ub77c\ub294": 23, "\uc6f0\uc2dc\ucf54\uae30\uc2a4\ub7ec\uc6b4": 23, "\uac15\uc544\uc9c0\uac00": 23, "\ub418\uc9c0\ub294": 23, "\uc6f0\uc2dc\ucf54\uae30": 23, "class\ub77c\ub294": 23, "\ubd84\uc704\uae30\uc758": 23, "\uac15\uc544\uc9c0\uc758": 23, "epsilon\uc774\ub77c\ub294": 23, "scale\uc5d0": 23, "\ubc1b\ub294\uc9c0": 23, "sampling\ud560": 23, "off": [9, 23], "scale\uc774": 23, "recall\uc740": 23, "\ub0ae\uc9c0\ub9cc": 23, "precision\uc740": 23, "\ub192\ub2e4": 23, "\uc0dd\uae30\ub294\ub370": 23, "recall\uc774": 23, "diveristy\uac00": 23, "\ub0ae\ub2e4\ub294": [23, 29], "\uc758\ubbf8\uc774\uace0": 23, "precision\uc774": 23, "\ub192\ub2e4\ub294": 23, "\ub73b\uc774\ub2e4": 23, "\ub192\uc77c\uc218\ub85d": 23, "label\ucabd\uc73c\ub85c": 23, "guide\uac00": 23, "\uc0dd\uae30\ubbc0\ub85c": 23, "\uc77c\uc815\ud55c": 23, "sfid\ub294": 23, "off\ub85c": 23, "\ub3c4\ucd9c\ub418\ub294": 23, "\uac12\uc774\ubbc0\ub85c": 23, "\ucd5c\uace0\uc758": [20, 23], "\uc9c0\uc810\uc5d0\uc11c": 23, "\ub098\uc654\ub2e4": 23, "adm\uc740": 23, "\uc57d\uc790\uc774\uba70": 23, "adm": [9, 23], "g\ub294": 23, "guidance\uc758": 23, "\uc57d\uc790\uc774\ub2e4": 23, "\uc8fc\uc5c8\uc744": 23, "fid\uac12\uc774": [23, 26], "\ub098\uc654\uc73c\uba70": 23, "vice": 23, "versa": 23, "center": 23, "\ub450\ubc88\uca30": 23, "\ud50c\ub77c\ubc0d\uace0": 23, "\ubcfc\ub54c": 23, "biggan\uc740": 23, "\uc774\ubbf8\uc9c0\uac04\ub4e4\uc758": 23, "\ud50c\ub77c\ubc0d\uace0\uac00": 23, "\ub2e4\uc218": 23, "\ub290\ub08c\uc758": 23, "\ubf51\uc544\ub0b8\ub2e4": 23, "\ub2e4\ucc44\ub85c\uc6b4": 23, "\ud55c\ub9c8\ub9ac\ub9cc": 23, "\uc0ac\uc9c4\ub3c4": 23, "\ub290\ub9ac\ub2e4": 23, "distil": 23, "\ubc95\uc744": 23, "\uace0\ub824": [2, 23], "guidance\ub294": 23, "classif": [9, 20, 23, 25], "function\uc758": 23, "label\uc774": 23, "data\uc5d0\ub294": 23, "\ud655\uc7a5\uc774": 23, "\ubd88\uac00\ub2a5\ud558\ub2e4": 23, "unlabel": 23, "sample\uc744": [9, 23], "cluster": 23, "\ubc29\ubc95\ub860\uc744": 23, "\ud558\ub824": 23, "driven": [9, 24], "12242": 24, "huggingfac": [24, 28], "\ucd5c\uadfc\uc5d0": [24, 25, 26], "\ub4f1\uc7a5\ud558\uc600\uc9c0\ub9cc": 24, "\ubd80\ubd84\uc5d0\uc11c": 24, "\uba74\ub4e4\uc744": 24, "\uac1c\uc120\ud558\uae30": 24, "\uae30\ubc95\uc73c\ub85c": [9, 24, 28, 29], "\uc18c\uac1c\ub418\uc5c8\uace0": 24, "5\uc7a5\uc758": 24, "\ub418\uba70": [24, 27], "nvidia": [24, 28], "5\ubd84": [2, 24], "\uc815\ub3c4\ubc16\uc5d0": 24, "\uc18c\uc694\ub418\uc9c0": 24, "\uc54a\ub294\ub2e4\uace0": 24, "\ubb34\uc5c7\uc778\uc9c0": [24, 29], "\uc54c\uc544\ubcf4\uae30": 24, "\uc815\ub9ac\ub97c": 24, "\ud574\ubcfc": 24, "gamma": 24, "\uc785\ub825\ubc1b\uc544\uc11c": 24, "gen": 24, "\uc218\uc2dd\uc801\uc73c\ub85c": [24, 29], "\ud45c\ud604\ud558\uba74": [24, 29], "w_t": [2, 24], "alpha_tx": 24, "t5": [20, 24], "xxl": [20, 24], "\ud560\ub54c": 24, "\ub54c\ub85c\ub294": 24, "\ud3ec\ud568": 24, "\uace0\uc815\uc2dc\ud0a8\ub2e4\uace0": 24, "\uc55e\uc368": [24, 27, 28], "\uc124\uba85\ub4dc\ub838\ub358": 24, "\ub0b4\uc6a9\ub4e4\uc744": 24, "blob": 24, "main": 24, "text_encoder_cl": 24, "import_model_class_from_model_name_or_path": 24, "arg": [20, 24, 29], "noise_schedul": 24, "ddpmschedul": 24, "from_pretrain": 24, "subfold": 24, "text_encod": 24, "autoencoderkl": 24, "unet2dconditionmodel": 24, "epoch": [24, 25, 28], "first_epoch": 24, "num_train_epoch": 24, "train_dataload": 24, "skip": [24, 26], "until": 24, "reach": 24, "resum": 24, "resume_from_checkpoint": 24, "resume_step": 24, "progress_bar": [24, 28], "continu": [2, 24], "accumul": 24, "pixel_valu": 24, "weight_dtyp": 24, "latent_dist": 24, "config": 24, "scaling_factor": 24, "offset_nois": 24, "randn": [20, 24], "bsz": 24, "randint": 24, "num_train_timestep": 24, "accord": 24, "magnitud": 24, "noisy_lat": 24, "add_nois": 24, "get": 24, "encoder_hidden_st": [20, 24, 28], "input_id": 24, "model_pr": 24, "prediction_typ": 24, "v_predict": 24, "get_veloc": 24, "part": 24, "model_pred_prior": 24, "target_prior": 24, "mse_loss": [20, 24], "float": 24, "prior_loss": 24, "sync_gradi": 24, "params_to_clip": 24, "itertool": 24, "clip_grad_norm_": 24, "max_grad_norm": 24, "zero_grad": [24, 25], "set_to_non": 24, "set_grads_to_non": 24, "noun": 24, "\uc720\uc9c0\ud558\uace0\uc790": 24, "\ub300\uc0c1\uc5d0": 24, "\ub2f4\ub294": 24, "rare": [24, 27], "3\uac1c": 24, "unicod": 24, "charact": 24, "\ub79c\ub364\ud558\uac8c": 24, "\uc0d8\ud50c\ub9c1\ud574\uc11c": 24, "\uc815\uc758\ud569\ub2c8\ub2e4": [24, 27, 28, 29], "drift": 24, "\uc785\ub825\ud558\uc5ec": 24, "\uacc4\uc0b0\ud569\ub2c8\ub2e4": 24, "\uacfc\uc815\uc73c\ub85c": [2, 20, 24], "\ud559\uc2b5\ud558\uace0\uc790": 24, "\uc2dc\ud0a8": 24, "sigma_t": 24, "alpha_": [9, 24], "\ucd94\uac00\ud568\uc73c\ub85c\uc368": 24, "\uc720\uc9c0\ud558\uac8c": 24, "\uc774\ub85c\uc368": [24, 29], "encourag": 24, "\uac00\uc9c0\uc758": 24, "\uccab\ubc88\uc9f8\ub85c\ub294": 24, "dino": 24, "\uc0dd\uc131\ub418\uae30": 24, "\uc120\ud638\ub41c\ub2e4\uace0": 24, "\uc790\uc138\ud558\uac8c\ub294": [24, 28, 29], "\uacc4\uc0b0\ub429\ub2c8\ub2e4": 24, "pairwis": 24, "\ube44\uad50\ud588\uc744\ub54c": 24, "\uacb0\uacfc\ub3c4": [24, 28], "\uc801\uc6a9\ub428\uc73c\ub85c\uc368": 24, "\uc18c\uac1c\ub4dc\ub838\ub358": 24, "div": 24, "\ud574\uacb0\ub418\ub294": 24, "\uc785\ub825\ud588\uc744\ub54c\uac00": 24, "\uc124\uba85\ud569\ub2c8\ub2e4": 24, "randomli": 24, "can": 24, "backpack": 24, "recontextu": 24, "articul": 24, "art": 24, "famou": 24, "painter": 24, "statu": 24, "sculptor": 24, "\ucc44": 24, "\ud615\ud0dc\ub3c4": 24, "novel": 24, "\uac01\ub3c4\uc5d0\uc11c": 24, "\ubcf4\ub294": 24, "\uc0dd\uc131\ub3c4": [24, 26], "properti": 24, "modif": 24, "dog": 24, "speci": 24, "\uace0\uc720": 24, "\ub4e4\uc774": 24, "\uc5d0\uc11c\ub3c4": 24, "\ubc18\uc601\uc774": 24, "\ud55c\uacc4\uc810\ub3c4": 24, "\uc790\uc8fc": 24, "\ub098\ud0c0\ub098\uc9c0": 24, "appear": 24, "\ubcf4\uc778\ub2e4\uace0": [9, 24], "\ubcf8\ubb38\uc5d0": 24, "\uc18c\uac1c\ub418\uace0": 24, "\uc788\uc9c0\ub294": 24, "\uc54a\uc9c0\ub9cc": 24, "\ubd80\ubb38\uc5d0\uc11c\ub3c4": 24, "\ud559\uc2b5\uacb0\uacfc\ub97c": 24, "\ubcf4\uc5ec\uc8fc\ub294\ub370": 24, "\uc7a5\ub9cc\uc73c\ub85c\ub3c4": 24, "\ub9cc\ud654": 24, "\uc0ac\ub840\ub4e4\uc744": 24, "nip": 25, "2014": [25, 29], "1406": 25, "2661": 25, "eriklindernoren": 25, "smart": [25, 29], "lab": [25, 28, 29, 30], "kaist": [25, 29], "\ub525\ub7ec\ub2dd": [25, 29], "chp": 25, "ian": 25, "goodfellow": 25, "2014\ub144\uc5d0": 25, "\ubc1c\ud45c\ud55c": 25, "\uc18c\uac1c\ub418\uae30": 25, "\uc804\uae4c\uc9c0": 25, "\ub144": 25, "\uc0dd\uc131\ubd84\uc57c\uc5d0\uc11c": 25, "\ub300\ud45c\uc801\uc778": 25, "\uc790\ub9ac\uc7a1\uc558\uc5c8\uc2b5\ub2c8\ub2e4": 25, "margin": [25, 29], "\uad6c\ud558\uac8c": 25, "taxonomi": 25, "\uc7a0\uc7ac\ubcc0\uc218": [25, 29], "\uadf8\ub85c\ubd80\ud130": 25, "\uad6c\ubd84\ud558\ub294": 25, "\uad6c\uc131\uc774": 25, "\ub9d0\ud574\uc11c": 25, "\ub4e4\uc5b4\uc624\uba74": 25, "\uac00\uc9dc\ub85c": 25, "\ucd9c\ub825\ud558\ub294": 25, "binari": 25, "\uc9c4\ud589\ud569\ub2c8\ub2e4": 25, "\ucf54\ub4dc\ub3c4": 25, "in_feat": 25, "out_feat": 25, "batchnorm1d": 25, "leakyrelu": 25, "inplac": 25, "opt": 25, "latent_dim": 25, "np": 25, "prod": 25, "img_shap": 25, "tanh": 25, "sigmoid": [25, 29], "img_flat": 25, "d\ub97c": 25, "g\ub97c": 25, "\uc190\uc2e4\ud568\uc218": [25, 29], "min_g": 25, "max_d": 25, "p_z": 25, "\uc54c\uace0\ub9ac\uc998\uacfc": 25, "\ube44\uad50\ud574\ubcf4\uaca0\uc2b5\ub2c8\ub2e4": 25, "n_epoch": 25, "variabl": [2, 25, 29], "fill_": 25, "fake": 25, "real_img": 25, "optimizer_g": 25, "gen_img": 25, "measur": 25, "abil": [20, 25, 28], "fool": 25, "g_loss": 25, "adversarial_loss": 25, "optimizer_d": 25, "real_loss": 25, "fake_loss": 25, "d_loss": 25, "print": 25, "item": 25, "batches_don": 25, "sample_interv": 25, "save_imag": 25, "png": 25, "nrow": 25, "\ucd5c\ub300\ud654\ud558\uace0": 25, "descent": 25, "\uc9c4\ud589\ud558\uac8c": 25, "\ud559\uc2b5\ud558\uc9c0": 25, "\uc0c1\ud669\uc774": 25, "\ubc1c\uc0dd\ud569\ub2c8\ub2e4": [25, 27], "\ucd5c\uc18c\ud654\ud558\uc9c0": 25, "\ucd5c\ub300\ud654\ud558\ub294": 25, "\uae30\ubc95\ub3c4": 25, "\ucd5c\uc801\ud654\ub41c": 25, "solut": 25, "\uc644\ubcbd\ud788": 25, "\ubcf5\uc6d0\ud558\uace0": 25, "\uc5b8\uc81c\ub098": 25, "\ub0b4\ubc49\uac8c": 25, "proposit": 25, "p_g": 25, "\uc99d\uba85\ud558\uc790\uba74": 25, "\uc190\uc2e4\ud568\uc218\ub97c": [25, 28], "int_x": 25, "int_z": 25, "dz": [25, 29], "\uc77c\ub54c": 25, "\uc131\ub9bd\ud558\uace0": 25, "\uc190\uc2e4\ud568\uc218\ub294": [25, 28], "\uac19\uace0": 25, "ast": 25, "jsd": 25, "\ucd5c\uc19f\uac12\uc740": 25, "\uc131\ub9bd\ud569\ub2c8\ub2e4": 25, "mnist": 25, "toronto": 25, "databas": 25, "tfd": 25, "\ud3c9\uac00\ud588\uc2b5\ub2c8\ub2e4": 25, "\ud3c9\uac00\uc2dc\uc5d0\ub294": 25, "parzen": 25, "densiti": 25, "\uac70\uccd0": 25, "vae\ub294": 25, "\ud750\ub9bf\ud558\ub2e4\ub294": 25, "unstabl": 25, "converg": [25, 26], "\ucc28\uc6d0\ucd95\uc18c\ub85c": 25, "\ud65c\uc6a9\ub418\uace0": 25, "\uc0dd\uc131\ud558\ub294\ub370\ub294": [9, 25], "\ud65c\uc6a9\ub418\uc5c8\ub2e4\uace0": 25, "2205": [2, 26], "11487": 26, "learning\uc774": 26, "\ub354\ubd88\uc5b4": 26, "\ub4e4\uc744": [20, 26, 29], "\ub3c5\ucc3d\uc801\uc778": 26, "\ub9d0\ubb49\uce58": 26, "corpu": 26, "llm\ub4e4\uc758": 26, "embedding\ub4e4\uc740": 26, "\ud6a8\uacfc\uc801\uc774\ub77c\uace0": 26, "\ucda9\uc2e4\ub3c4": 26, "\uc0ac\uc774\uc988\ub97c": 26, "\uc911\uc694\ud558\ub2e4\ub294": 26, "\uc99d\uba85\ud568": 26, "\uc81c\uc2dc\ud558\uc5ec": 26, "weight\uc744": 26, "leverag": 26, "\ub9cc\ub4e4\uc5b4": 26, "\ud604\uc2e4\uc801\uc778": 26, "palett": [26, 27], "\uad6c\uc870\ubcf4\ub2e4": 26, "\uc81c\uc2dc\ud568": 26, "27": 26, "\ub2ec\uc131\ud568": 26, "evaluation\uc6a9": 26, "benchmark": [26, 27], "encoder\uc744": 26, "\ud574\ub193\uc74c": 26, "improv": [9, 26], "sr": 26, "\uc774\ub780": 26, "\ud6a8\uacfc\ub97c": [26, 27], "guidance\uac00": [9, 26], "generation\uc774": 26, "\uc77c\uc815\ud558\uc9c0": 26, "\ubabb\ubc1b\uc544\uc11c": 26, "class\ub098": 26, "object\uc774": 26, "\uc77c\uc815\ud558\uace0": 26, "\ubb34\uc5c7\uc744": 26, "\uc0dd\uc131\ud558\ub294\uac83\uc778\uc9c0": 26, "\uc790\uc138\ud558\uac8c": 26, "guide\uc758": 26, "\ub192\uc774\uba74": 26, "\ubd88\uc77c\uce58\uac00": 26, "\uac00\uc911\uce58\uc758": 26, "\ubc94\uc704": [26, 27], "\uc774\ub3d9\uc2dc\ucf1c": 26, "\uc544\uc608": 26, "\ube57\ub098\uac00": 26, "\uc774\uc0c1\ud55c": 26, "satur": 26, "\ub35c\ud55c": 26, "\ub40c": 26, "\ud574\uacb0\ud558\uace0\uc790": 26, "\ubc31\ubd84\uc704\uc218": 26, "\uc808\ub300": 26, "\uc9c0\uc815\ud558\uace0": 26, "s\ub85c": 26, "\ub098\ub208\ub2e4": 26, "90": [20, 26], "\uc9c0\uc810\uc758": 26, "among": 26, "net\uc774\ub77c\ub294": 26, "net\uc5d0\uc11c": 26, "\uc5ec\ub7ec\uac00\uc9c0": 26, "modification\uc744": 26, "\ud558\uc600\ub2e4\uace0": 26, "effu": 26, "net\uc740": 26, "\uc758\ub8cc\ucabd\uc73c\ub85c": 26, "\uc788\ub294\uac78\ub85c": 26, "\uc544\ub294\ub370": 26, "remov": 26, "keep": 26, "connect": 26, "scaling\uc744": 26, "\ud558\uc5ec": 26, "block\uc5d0\uc11c": 26, "blocks\ub97c": 26, "\ucd94\uac00\ud568": 26, "\ubca4\uce58\ub9c8\ud06c": 26, "\ub370\uc774\ud130\uc14b\uc740": [9, 20, 26], "categori": 26, "\uc774\ub8e8\uc5b4\uc84c\ub2e4": 26, "\uae43\ud5c8\ube0c\uc5d0\uc11c": 26, "\ub2e4\uc6b4": 26, "\ubc1b\uc744": 26, "\uac17\ub2e4": 26, "25\uba85\uc758": 26, "\ud3c9\uac00\uc790": 26, "a\uc5d0\uc11c": 26, "\ud3c9\uac00\uc790\ub294": 26, "\uc9c8\ubb38\uc744": 26, "\uae30\uc900\uc810\uc73c\ub85c": 26, "q1": 26, "higher": 26, "q2": 26, "repres": 26, "\uae30\uc900\uc810": 26, "\ub2f5\ubcc0": 26, "\uc120\ud0dd\ud574\uc57c\ud568": 26, "am": 26, "indiffer": 26, "screenshot": 26, "drawbench\uc5d0\uc11c": 26, "\uccb4\ub9ac\ud53c\ud0b9": 26, "\uc5c6\uc774\ub3c4": 26, "\uce74\ud14c\uace0\ub9ac\uc5d0\uc11c\ub3c4": 26, "\uc8fc\uc7a5\uc778": 26, "\ubaa8\ub378\ub4e4": 26, "peopl": 26, "\uc62c\ub77c\uac10": 26, "people\uc744": 26, "\uc0dd\uc131\ud558\uae30\uc5d0": 26, "rater": 26, "xxl\ub85c": 26, "\uc120\ud638\ud568": 26, "\ubc1b\uc74c": 26, "evaul": 26, "\uc911\uc694\ud568": 26, "\uc0ac\uc774\uc988\uc758": 26, "\ub07c\uce68": 26, "boost\uc5d0": 26, "thresholding\uc744": 26, "\ub04c\uc5b4": 26, "\uc62c\ub9b4": 26, "allow": 26, "usag": 26, "much": 26, "editbench": 27, "advanc": 27, "inpaint": [27, 28], "06909": 27, "06": 27, "\uc2dc\uac04\uc5d0\ub294": [27, 28], "googl": 27, "\uc18c\uac1c\ud558\ub294": [27, 28, 29], "impaint": [9, 27], "\ud3c9\uac00\uae30\ubc95": 27, "\uc608\uc815\uc785\ub2c8\ub2e4": [27, 28], "\uae30\uc874\uc5d0\ub294": 27, "\uc601\uc5ed\uc744": 27, "\uc9c0\uc815\ud558\uc5ec": 27, "\ucc38\uc870\ud558\uc9c0": 27, "\uc624\ub85c\uc9c0": 27, "\ub9cc\uc73c\ub85c": 27, "\ucc38\uc870\ud560": [9, 27], "\uc720\ub3c4\ud558\ub294": 27, "ssd": 27, "mobilenet": 27, "v2": 27, "detector": 27, "\uac1c\uc120\ub418\ub294": 27, "\ud2b9\uc9d5\uc740": 27, "cascad": 27, "\uc810\uc785\ub2c8\ub2e4": 27, "sr3": 27, "\ud558\uba74\uc11c": 27, "\uac00\uc9c4\ub2e4\uace0": 27, "\uc791\uc5c5": 27, "\uc785\ub825\ud569\ub2c8\ub2e4": [27, 28], "\ub0b4\uae30": 27, "\ucd94\uac00\ub418\ub294": 27, "\ucd08\uae30\ud654\ud574\uc11c": 27, "\uc18c\uac1c\ub418\uc5c8\ub358": 27, "1\ubd80\ud130": 27, "\ubcc0\ud654\uc2dc\ud0a4\ub294": 27, "oscil": 27, "\uc801\uc6a9\ud568\uc73c\ub85c\uc368": 27, "\ud004\ub9ac\ud2f0": 27, "\uc0c1\uc2b9\ub418\ub294": 27, "240\uac1c\uc758": 27, "\uc30d\uc73c\ub85c": [9, 27], "\uad6c\ucd95\ub418\uc5b4\uc788\uace0": 27, "\uc30d\ub9c8\ub2e4": 27, "3\uac00\uc9c0\uc758": 27, "\uce21\uc815\ud558\uac8c": 27, "\uc73c\ub85c\ub294": [27, 28], "clipscor": 27, "prec": 27, "\uc808\ubc18\uc740": 27, "open": 27, "\ub370\uc774\ud130\uc14b\uc73c\ub85c\ubd80\ud130": 27, "\uc218\uc9d1\ub418\uc5c8\uace0": 27, "\uc0dd\uc131\ud574\uc11c": 27, "\uad6c\ucd95\ud588\uc2b5\ub2c8\ub2e4": 27, "\uc694\uc18c\ub4e4\uc744": 27, "\uac16\ucd94\ub3c4\ub85d": 27, "\uc0dd\uc131\ud588\uc2b5\ub2c8\ub2e4": 27, "materi": 27, "common": 27, "render": 27, "indoor": 27, "outdoor": 27, "\ub4e4\uc5b4\uc11c": [20, 27], "metal": 27, "\ubb38\uad6c\ub97c": 27, "stand": 27, "farm": 27, "\ud574\ub2f9\uc0ac\uc9c4\ucc98\ub7fc": 27, "rich": 27, "\uad6c\ucd95\uc2dc": 27, "\ud06c\uae30\ub3c4": 27, "\ub2e4\uc591\ud558\uac8c": 27, "\uc124\uc815\ud558\uc5ec": [9, 27], "\ud06c\uae30\uc5d0": [9, 27], "\uce21\uc815\ud574\ubcf8": 27, "medium": 27, "\uc131\ub2a5\uc801\uc73c\ub85c": 27, "\uc800\ud558\ub418\ub294": [27, 28], "\uc18d\uc131\ubcf4\ub2e4": 27, "\uc18d\uc131\uc5d0": 27, "\ucde8\uc57d\ud55c": 27, "failur": 27, "\uc0ac\uc9c4\uc785\ub2c8\ub2e4": [27, 28], "maskrich": 27, "dig": 28, "more": 28, "08453": 28, "tencent": 28, "arc": 28, "\ube44\ub86f\ud55c": 28, "\ub09c\ud574\ud55c": 28, "car": 28, "fly": 28, "wing": 28, "iron": 28, "man": 28, "bunni": 28, "ear": 28, "\uc785\ub825\ubc1b\uc744": 28, "textur": 28, "\ud45c\ud604\ud558\uae30": 28, "\ub9cc\uc73c\ub85c\ub294": 28, "\ud544\uc694\ud558\ub2e4\uace0": 28, "\uc11c\uc220\ud569\ub2c8\ub2e4": 28, "intern": 28, "knowledg": [20, 28], "extern": 28, "\uc18c\uac1c\ud558\uace0": 28, "5\uac00\uc9c0": 28, "plug": 28, "plai": 28, "77m": 28, "300m": 28, "\uc5f0\uc0b0\uc791\uc5c5\uc774": 28, "\uc2e4\ud589\ub429\ub2c8\ub2e4": 28, "\uac00\uc838\uc624\uae30": 28, "\uc6a9\ub7c9\uc774": 28, "\ud06c\uace0": 28, "flexibl": 28, "compos": 28, "generaliz": 28, "\uae30\ubc18\uc774": 28, "autoencod": [28, 29], "\ubc14\uafb8\uace0": 28, "\ubcf5\uc6d0\ud558\ub294": 28, "_2": 28, "bar": 28, "_t": 28, "z_0": 28, "\uc785\ub825\ud568\uc73c\ub85c\uc368": 28, "matric": 28, "\uac00\uc9c0\uba70": [9, 28], "unshuffl": 28, "\ubcc0\ud658\uc774": 28, "1\uac1c\uc758": 28, "4\ubc88": 28, "\ud1b5\uacfc\ud558\uac8c": 28, "\uac70\uce58\uace0": 28, "f_c": 28, "\uc0dd\uc131\ub418\uace0": 28, "\uc5d0\uc11c\uc758": [2, 20, 28], "intermedi": [20, 28], "f_": 28, "enc": 28, "\ub354\ud574\uc9c0\uac8c": 28, "\ub3d9\uc77c\ud558\ub3c4\ub85d": 28, "\uc124\uc815\ud588\uae30": 28, "\ub367\uc148": 28, "\uc5f0\uc0b0\ud558\ub294\ub370": 28, "fulladapt": 28, "in_channel": 28, "320": 28, "640": 28, "1280": 28, "num_res_block": 28, "downscale_factor": 28, "pixelunshuffl": 28, "conv_in": 28, "kernel_s": 28, "bodi": 28, "adapterblock": 28, "total_downscale_factor": 28, "out_channel": 28, "downsample2d": 28, "in_conv": 28, "adapterresnetblock": 28, "act": 28, "relu": [28, 29], "adapter_st": 28, "adapter_input": 28, "adapter_conditioning_scal": 28, "num_images_per_prompt": 28, "repeat": 28, "do_classifier_free_guid": 28, "num_warmup_step": 28, "order": 28, "total": 28, "latent_model_input": 28, "scale_model_input": 28, "noise_pr": [20, 28], "prompt_emb": 28, "cross_attention_kwarg": 28, "down_block_additional_residu": 28, "state": 28, "noise_pred_uncond": 28, "noise_pred_text": 28, "previou": 28, "extra_step_kwarg": 28, "prev_sampl": 28, "\uc885\ub958\ub85c\ub294": 28, "\ubd84\ub958\ud560": 28, "sketch": 28, "segment": 28, "keypos": 28, "bicub": 28, "\uc81c\uc678\uc2dc\ud0a4\uace0": 28, "nearest": 28, "\ud06c\uae30\ub85c": 28, "\ubd80\ubd84\ucc98\ub7fc": 28, "\uc815\uc758\ud558\uac8c": [28, 29], "\uace0\uc815\uc2dc\ud0a8": [9, 28], "\ud30c\ub77c\ubbf8\ud130\ub9cc": 28, "t2": 28, "\uc2dc\uc640": 28, "dure": 28, "\ub123\uc73c\uba74\uc11c": 28, "\ub9c8\ub2e4": [2, 20, 28], "expens": 28, "late": 28, "\uc2e4\ud5d8\ud574\ubcf8": 28, "\ud06c\ub2e4\uace0": 28, "earli": 28, "\ud3ec\ud568\ub418\ub3c4\ub85d": 28, "\uc218\uc2dd\ucc98\ub7fc": 28, "uniformli": 28, "\uc9c4\ud589\ud588\uace0": 28, "cubic": 28, "\uc0c1\uc138\uc0ac\ud56d\uc740": 28, "4x": 28, "tesla": 28, "32g": 28, "v100": 28, "dai": 28, "\uc2e4\ud5d8\ubcc4": 28, "coco17": 28, "164k": 28, "pidinet": 28, "stuff": 28, "keypoint": 28, "aesthet": [20, 28], "\ub370\uc774\ud130\uc14b\ub85c\ubd80\ud130": 28, "600k": 28, "\ucd94\ucd9c": 28, "mm": 28, "mida": 28, "\ubaa8\ub378\ub4e4\uacfc": 28, "\uc815\ub7c9\uc801\uc778": 28, "\uc218\uce58\ub85c": 28, "\ube44\uad50\ud558\ub294\ub370": 28, "\uc0ac\uc6a9\ud558\uc600\uace0": [9, 20, 28], "\ud558\ub2e8": 28, "\uc0ac\uc9c4\ucc98\ub7fc": 28, "\uc88b\uc2b5\ub2c8\ub2e4": 28, "quantit": [9, 28], "comparisoin": 28, "\uc608\uc2dc\ub4e4\uc740": 28, "\uc815\ud655\ud558\uc9c0": 28, "\uc9c0\uc5ed\uc744": 28, "\ubabb\ud558\ub2e4\uace0": 28, "\uac83\ub85c": 28, "\uc704\uc5d0\uc11c\ubd80\ud130": 28, "\uc7a5\uc810\ub4e4": 28, "\uba85\uc2dc\ub418\uc5c8\ub358": 28, "\uc0ac\ub840\uc785\ub2c8\ub2e4": 28, "\uc644\ub8cc\ud55c": 28, "\uc801\uc6a9\ud558\uba74\uc11c": 28, "4\ubcf4\ub2e4": 28, "\uc791\uc744": 28, "\uc801\uc6a9\ud588\uc2b5\ub2c8\ub2e4": 28, "\uacbd\ub7c9\ud654\ub41c": 28, "\uc608\uc2dc\ucc98\ub7fc": 28, "\uc22b\uc790\ub97c": 28, "\ubc14\uafd4\uac00\uba70": 28, "tini": 28, "x4": 28, "x8": 28, "compress": 28, "auto": 29, "bay": 29, "1312": 29, "6114": 29, "gunhochoi": 29, "fastcampu": 29, "ch": 29, "\ubb38\uad6c\uac00": 29, "\uc801\ud600\uc788\ub294\ub370\uc694": 29, "bayesian": 29, "vb": 29, "approach": [29, 30], "involv": 29, "\uc81c\uc2dc\ud558\ub294": 29, "aevb": 29, "\uc54c\uace0\ub9ac\uc998": 29, "\ub274\ub7f4": 29, "\ub124\ud2b8\uc6cc\ud06c\ub85c": 29, "\uadfc\uc0ac\ud568\uc73c\ub85c\uc368": 29, "\uc774\uac00": 29, "\ubc14\uac00": 29, "\ubd80\ubd84\uc73c\ub85c": 29, "\ub9cc\ub4e4\uc5b4\ub0b4\uace0": 29, "\ubcf5\uc6d0\ud558\uac8c": 29, "assumpt": 29, "\ub0b4\ub9bd\ub2c8\ub2e4": 29, "\uccab\ubc88\uc9f8\ub85c": 29, "parametr": 29, "\ud558\ub2e4\ub294": 29, "\ub530\ub974\uace0": 29, "\uc131\uc9c8\uc5d0": 29, "bernoulli": 29, "\ub530\ub974\ub3c4\ub85d": 29, "\uacc4\uc0b0\uc774": 29, "\ucd5c\ub300\ud654\uc2dc\ud0a4\ub294": 29, "\uad6c\ud558\ub294": [9, 20, 29], "\uacc4\uc0b0\ud558\uae30": 29, "\ub4f1\uc7a5\ud558\uac8c": 29, "\uadfc\uc0ac\ud654\ud558\ub294": 29, "\ub124\ud2b8\uc6cc\ud06c": 29, "\ub3c4\uc2dd\ud654\ud55c": 29, "\uc815\ub9ac\ud558\uc790\uba74": 29, "\uacc4\uc0b0\ub41c": 29, "\ud655\uc778\ud574\ubcf4\uaca0\uc2b5\ub2c8\ub2e4": 29, "fc1_1": 29, "784": 29, "hidden_s": 29, "fc1_2": 29, "log_var": 29, "reparametr": 29, "std": 29, "mul": 29, "exp_": 29, "ep": 29, "floattensor": 29, "cuda": 29, "add_": 29, "reparam": 29, "fc1": 29, "\ucc3e\uc73c\uba74": 29, "\ubd84\ud560\ud560": 29, "min_": 29, "g_": 29, "\uc720\uc0ac\ud558\ub3c4\ub85d": 29, "\uc7a0\uc7ac\ubcc0\uc218\uc758": 29, "\uc800\ud76c\uac00": 29, "\ubd80\uc5ec\ud55c": 29, "\uac00\uae5d\ub3c4\ub85d": 29, "\uc124\uc815\ud558\ub294": 29, "mont": 29, "carlo": 29, "\uadfc\uc0ac\uac12\uc744": 29, "\uad6c\ud560": [9, 29], "\uc5f0\uc0b0\ub7c9\uc774": 29, "\ub9ce\uc73c\ubbc0\ub85c": 29, "\uc124\uc815\ud569\ub2c8\ub2e4": 29, "\uae30\ubc95\uc740": 29, "\uc0d8\ud50c\ub9c1\ud558\uc9c0": 29, "backpropag": 29, "\ub354\ud558\uace0": 29, "\uacf1\ud558\uac8c": 29, "\ub530\ub978\ub2e4\uace0": 29, "\uc124\uc815\ud588\uc744": 29, "\ub54c\uc774\uace0": 29, "\uac00\uc815\ud560": 29, "\uc2dc\ub3c4\ud560": 29, "\uba85\uc2dc\ub418\uc5b4": 29, "\uc9c0\uc815\ud574\uc92c\ub2e4\uba74": 29, "\ud30c\ub77c\ubbf8\ud130\ub4e4\uacfc": 29, "\uc7a0\uc7ac\ubcc0\uc218\ub97c": 29, "\uc0ac\uc6a9\ud574\ubcf4\uba74": 29, "repositori": 30, "pseudo": 30, "team": 30, "bulb": 30, "aim": 30, "them": 30, "theoret": 30, "conduct": 30, "experi": 30, "\ucc38\uc5ec": 30, "\ub9e4\uc8fc": 30, "\uc218\uc694\uc77c": 30, "\uc624\ud6c4": 30, "9\uc2dc": 30, "\uac00\uc9dc\uc5f0\uad6c\uc18c": 30, "discord": 30, "room": 30, "dh": 30, "\uc785\uc7a5": 30, "brownian": 2, "bridg": 2, "07680": 2, "xuekt98": 2, "linkedin": [], "seonhoonkim": [], "nov": [2, 20], "\uadf9\ubcf5\ud568": 2, "\uc2dc\uac04\uc758": 2, "\ud750\ub984\uc5d0": 2, "\ubd88\ud655\uc2e4\uc131\uc744": 2, "\ubcc0\ud558\ub294": 2, "\ubcc0\uc218\ub4e4\uc758": 2, "\uc9d1\ud569": 2, "\ubcc0\uc218\ub97c": 2, "\ubcc0\uc218\uac00": 2, "\uad00\ucc30\ub41c": 2, "\uad6c\ubd84\ud560": 2, "motion": 2, "wiener": 2, "\uc720\uccb4\uc758": 2, "\ubbf8\uc18c\uc785\uc790\uac00": 2, "\ubd88\uaddc\uce59\ud558\uac8c": 2, "\uc6b4\ub3d9\ud558\ub294": 2, "\uad74\ub69d\uc5d0\uc11c": 2, "\ud37c\uc838\ub098\uac04": 2, "\uc5f0\uae30": 2, "\uc624\ub978\ucabd\uc73c\ub85c": 2, "90\ub3c4": 2, "\ud68c\uc804\uc2dc\ud0a8": 2, "\uc5f0\uc18d": 2, "\uc774\ud574\ud574\ubcf4\uc790": 2, "\uac00\uc815\ud574\ubcf4\uc790": 2, "\ud558\uc790": 2, "\uc774\ud574\ud558\uae30": 2, "\ud558\ub2e4\uace0": 2, "\ubd80\uc5ec\ub418\uc5b4\uc57c": 2, "\uac04\uaca9\uacfc": 2, "\ube44\ub840\ud574\uc57c": 2, "notat": 2, "ld0rxwajpkm": 2, "finrgb": 2, "\uac04\uaca9": 2, "\uc0b4\ud3b4\ubcf4\uace0\uc790": 2, "\uac04\uaca9\uc758": 2, "epsilon_t": [2, 9], "\uc2dc\uc810\uc5d0\uc11c": 2, "\uac04\uaca9\uae4c\uc9c0": 2, "\uc815\uc758\ud574": 2, "\uadfc\uac70\ub97c": 2, "\ucc3e\uc544\ubcf4\uba74": 2, "\ubcc0\uc218": 2, "\ub3c4\uc785\ud568\uc73c\ub85c\uc368": 2, "\ubd80\uc5ec": 2, "\uac04\uaca9\ub3c4": 2, "\ud558\ud544": 2, "\uacf1\ud588\uc744\uae4c": 2, "\uac00\uae4c\uc6cc\uc9c8": 2, "\ucc9c\ucc9c\ud788": 2, "\uc218\ub834": 2, "\ud558\ub2e4\uba74": 2, "\ub77c\uba74": 2, "\uc791\uc544\uc9d0": 2, "\ucee4\uc9c8": 2, "\ucee4\uc9d0": 2, "\uc8fc\uc758\ud560": 2, "\uc0ac\ud56d": 2, "w_1": 2, "\ub3c5\ub9bd": 2, "\ub9de\uc9c0\ub9cc": 2, "\ub3c5\ub9bd\uc774\ub77c\ub294": 2, "\uc544\ub2d8": 2, "epsilon_0": 2, "var": 2, "\uacf5\ubd84\uc0b0\uc740": 2, "\uc810\ub4e4\uc740": 2, "\ubcf4\ub77c\uc0c9": 2, "\uc810\ucc98\ub7fc": 2, "\ud655\ub960\uc5d0": 2, "\uc874\uc7ac\ud560": 2, "\uc218\ud589\ud558\uba74": 2, "\ubcc0\ud55c\ub2e4": 2, "t_2": 2, "t_1": 2, "10\ubd84\uc73c\ub85c": 2, "\uc9c4\ud589\ud558\uba74": 2, "w_5": 2, "\uc544\ub2d0": 2, "\uc788\uc73c\ub098": 2, "\ubcc0\ud654\ub7c9": 2, "t_5": 2, "\ub530\ub978\ub2e4": 2, "\uc2dc\uc810\uacfc": 2, "\uc54c\uace0": 2, "\ubb34\uc5c7\uc77c\uae4c": 2, "sine": 2, "qua": 2, "158": 2, "\uc120\ud615\uc73c\ub85c": 2, "\uc5f0\uacb0\ub41c": 2, "\uc2dc\uc810": 2, "\uac12\uc778": 2, "\ud45c\ud604\ud574\ubcf4\uc790": 2, "\uc77c\uae4c": 2, "\uadf8\ub7ec\uae30": 2, "\uc774\uc5b4\uc57c": 2, "\ud3b8\ucc28\uc758": 2, "\uc81c\uacf1\uc758": 2, "\ud3c9\uade0\uc758": 2, "\uc81c\uacf1": 2, "\uc5f0\uacb0\ud55c": 2, "\ub9cc\ub4e4\uae30": 2, "\uc6b0\ubcc0\uc5d0": 2, "\ub354\ud574\ubcf4\uc790": 2, "\ub3c5\ub9bd\uc778": 2, "\uc2dd\uc5d0\ub294": 2, "\ub300\uc785\ud574\ub3c4": 2, "\ub098\uc624\uace0": 2, "\uc5f0\uacb0\ud558\ub294": 2, "\ub2e4\ub9ac\uac00": 2, "\ub85c\uc11c\uc758": 2, "\uc131\uc9c8": 2, "\uc99d\uba85\ud558\uae30": 2, "\ud45c\uc900\uc815\uaddc\ubd84\ud3ec\ub97c": 2, "\uc815\uaddc\ubd84\ud3ec": 2, "\ud3c9\uade0\uc740": 2, "\ub3c5\ub9bd\uc774\ubbc0\ub85c": 2, "t_0": 2, "\uc810": [2, 20], "abstrcat": [], "\ubcc0\ud658\uc744": [], "\ub2e4\ub8f8": [], "\uc0c1\uc774\ud55c": [], "\ubaa8\ub378\ub9c1\ud558\ubbc0\ub85c": [], "bidirect": [], "\uc784": 20, "\ubcc0\ud658\uc5d0": [], "\uc811\ubaa9\ud55c": [], "\ub17c\ubb38\uc784": [], "introduct": [], "i2i": [], "\ubcc0\ud658\uc5d0\uc11c": [], "fideltii": [], "\ub192\uc558\uc73c\ub098": [], "\ub5a8\uc5b4\uc9c4\ub2e4": [], "\uc548\ub098\uc624\uace0": [], "applic": [], "\ud1b5\ud569": [], "\uc2dc\ud0b4\uc73c\ub85c\uc368": [], "desir": [], "\ucd94\ub860\ud574\ub0b8\ub2e4\ub294": [], "\uba85\ub8cc\ud55c": [], "\uc774\ub860\uc801": [], "\uadfc\uac70\uac00": [], "\uc548\ub418\ubbc0\ub85c": [], "\uc5d0\uc11c\ub9cc": [], "\uc218\ud589\ud568\uc73c\ub85c\uc368": [], "\ud558\uae34": [], "\ud588\uc73c\ub098": [], "\uc8fc\uc5b4\uc9c0\ubbc0\ub85c": [], "\uc81c\uc2dc\ud558\uae30\uac00": [], "\ud798\ub4e6": [], "\uac00\uc18d\uc744": [], "\uc218\ud589\ud568": [], "duffus": [], "simplifi": [], "\ub4dc\ub7ec\ub098": [], "\uc54a\uc73c\ubbc0\ub85c": [], "\ub3c4\ub2ec\ud560": [], "\ubcf4\uc7a5\uc774": [], "\ub3d9\uc548\uc758": [], "start": [], "\uc774\uc5c8\ub2e4": [], "\ubc14\uafd4\ubcf4\uc790": [], "\ud5a5\ud574": [], "vqgan": [], "\uc601\uc0c1\uc758": [], "\u03b4_t": [], "\ub098\ud0c0\ub09c": [], "\uc0ac\uc6a9\ud558\uac8c": [], "\ub418\uba74": [], "\ubd84\uc0b0\uac12": [], "\ubd84\uc0b0\uac12\uc778": [], "\u03b4_": [], "\ucee4\uc9c0\uba74": [], "\ubd84\uc0b0\uac12\ub3c4": [], "\ucee4\uc9c0\ub294\ub370": [], "\ub2e4\ub8e8\uae30\uc5d0": [], "\uc774\uba74\uc11c": [], "\ub3c5\ub9bd\uc77c": [], "\uc815\uc218\uc758": [], "\ucd5c\ub313\uac12\uc778": [], "delta_t": [], "\uc2dc\uc791\ud558\ub294": [], "m_0": [], "\ubd84\uc0b0\uc740": [], "\ub05d\ub098\ub294": [], "m_t": [], "\uc9c0\uc810\uae4c\uc9c0\ub294": [], "\ud558\ub2e4\uac00": [], "\uc9c0\uc810\ubd80\ud130": [], "\uac10\uc18c": [], "\ubd84\uc0b0\uac12\uc5d0": [], "\uacb0\uc815": [], "\uc2a4\ucf00\uc77c\ub9c1\ud558\ub294": [], "\uc870\uc808": [], "\ub514\ud3f4\ud2b8": [], "\uc11c\ub294": [], "transit": [], "bb": [], "\uc54c\uc544\uc57c\ud568": [], "m_ty": [], "m_": [], "\uc4f0\ub294": [], "\uc633\uc74c": [], "\uc720\ub3c4\ub428": [], "\uc99d\uba85": [], "\ub300\uc785": [], "\uad6c\ud558\uba74": [], "\uc544": [], "\ud655\uc2e4\ud788": [], "\ub3c4\uba54\uc778\uc73c\ub85c\ubd80\ud130": [], "\ub3c4\uba54\uc778\uc73c\ub85c\uc758": [], "\uc815\uc758\ud558\ub294\uad6c\ub098": [], "\uc81c\uac70\ud574\ub098\uac10": [], "\ub460\uc73c\ub85c\uc368": [], "\uc790\uccb4\uc5d0\uc11c": [], "\ud3c9\uade0\uac12": [], "\ub178\uc774\uc988\uc758": [], "\ubb34\uc2dc\ud560": [], "\ubca0\uc774\uc988": [], "\uc774\ub860\uacfc": [], "\ub3c4\ucd9c": [], "\uc131\ub9bd\ub428\uc744": [], "\uc815\ub9ac\ub428": [], "\ud1b5\ud569\ud558\uace0reparameter": [], "mu_t": [], "\ubcc0\ud615\ud560": [], "\ud559\uc2b5\ub428": [], "\uc2dd\uc5d0": [], "\uba85\uc2dc\ud558\uae30": [], "\uba85\uc2dc\ub41c": [], "\ubcc0\ud615": [], "\ud574": [], "14": 20, "\uadfc\uc0ac\ud558\ub3c4\ub85d": [], "\ud559\uc2b5\ub418\uc5b4\uc57c\uaca0\ub124": [], "\ub2e8\uc21c\ud654\ub420": [], "\uac00\uc18d\uc2dc\ud0ac": [], "\uae38\uc774\ub97c": [], "\ub450\uc5c8\uc744": [], "varibal": [], "subset": [], "\uc815\uc758\ub428": [], "atent": [], "\ub450\uc5c8\uc74c": [], "\ud558\uc774\ud37c\ub9c8\ub77c\ubbf8\ud130": [], "\ud504\ub808\uc784\uc6cc\ud06c\ub294": [], "\uc774\ub8e8\uc5b4\uc9d0": [], "lpip": [], "\uc0dd\uc131\ubb3c\uc758": [], "\ub9c8\ub2e4\uc758": [], "\ud45c\uc900\ud3b8\ucc28\uc758": [], "\uad6c\ud568": [], "\uc2e4\ud5d8\ud568": [], "celebamask": [], "\uc8fc\uace0": 9, "edges2sho": [], "edges2handbag": [], "faces2com": [], "\ud3c9\uac00\ud588\ub2e4\uba74": [], "\uc2e4\ud5d8\uc5d0\uc11c\ub294": [], "\ud559\uc2b5\ud558\ubbc0\ub85c": [], "cyclegan": [], "\uc2a4\ucf00\uc77c\uc758": [], "\ub5a8\uc5b4\uc9d0": [], "drit": [], "\uc911\uc5d0\uc11c\ub294": [], "\ub0c8\uc73c\ub098": [], "\ubcc0\ud658\ub41c": [], "oversmooth": [], "\uacfc\ub294": [], "\uba40\uc5c8\uc74c": [], "cde": [], "\ubaa8\ub378\ub4e4\ubcf4\ub2e4\ub294": [], "rregular": [], "occlus": [], "\ub098\ud0c0\ub098\ub294\ub370": [], "\uc9c1\uc811\uc801\uc778": [], "\ubb38\uc81c\ub85c\ubd80\ud130": [], "\uc790\uc720\ub85c\uc6c0": [], "\ud2b9\uc131\uc73c\ub85c": [], "\uc0dd\uc131\ud574\ub0c4": [], "\uae30\ub85d\ud588\uc73c\uba70": [], "\uac80\uc99d": [], "\uc2e4\ud5d8\ud588\uc74c": [], "campar": [], "\uae30": [], "\ub85d\ud568": [], "factor": [], "\uc870\uae08\ub9cc": [], "\ub298\ub824\ub3c4": [], "conclus": [], "futur": [], "\uc5d0\ub3c4": [], "\uc801\uc6a9\ud574\ubcfc": [], "\uc608\uc815": [], "toward": 9, "10741": 9, "sehwan": 9, "e\ubcf4\ub2e4": 9, "\ud3c9\uac00\uac00": 9, "\uc6b0\uc218\ud558\ub2e4\uace0": 9, "powerful\ud55c": 9, "editing\uc774": 9, "natur": 9, "language\ub85c": 9, "realistic\ud55c": 9, "\uc0dd\uaca8\ub098\uace0": 9, "prompts\uc5d0": 9, "\uc815\ud655\ud788": 9, "\ub300\uc751\ud558\ub294": 9, "photorealistic\ud55c": 9, "\uc0dd\uc131\ud558\uae30\uc5d0\ub294": 9, "\uacaa\uace0": 9, "\uc911\uc2ec\uc73c\ub85c": 9, "\ub5a0\uc624\ub974\uba70": 9, "unconditional\ud55c": 9, "\ucc0d\uc5c8\ub2e4\uace0": 9, "conditional\ud55c": 9, "\uc774\ub8e8\uc5b4\uc84c\ub294\ub370": 9, "beat": 9, "synthesis\ub77c\ub294": 9, "noise\ud55c": 9, "class\ub97c": 9, "\ucd94\uac00\ud558\uc5ec": 9, "sampling\uacfc\uc815\uc5d0\uc11c": 9, "label\uc5d0": 9, "control\uc2dc\ud0a4\ub294": 9, "classifier\uc5c6\uc774": 9, "\uc18c\uac1c\ub418\uc5c8\ub2e4": 9, "synthesis\ub97c": 9, "guidance\ub77c\ub294": 9, "\uc81c\uc2dc\ud558\uba70": 9, "guidance\uc640": 9, "\ube44\uad50\ub97c": 9, "\uacb0\uacfc\uc801\uc73c\ub85c\ub294": 9, "shot\uc73c\ub85c": 9, "\uc0dd\uc131\ud558\ub294\ub370\uc5d0": 9, "\ubcf4\uc600\uc73c\ub098": 9, "photorealistc\ud55c": 9, "\uacaa\uc744": 9, "generation\ubfd0\ub9cc": 9, "\ud3b8\uc9d1\ud560": 9, "impainting\uae30\ub2a5\ub3c4": 9, "\uaef4\uc788\ub294": 9, "\uc5bc\ub9cc\ud07c\uc778\uc9c0": 9, "constant\ud55c": 9, "\uace0\uc815\uc2dc\ud0a8\ub2e4": 9, "process\uc640": 9, "alpha_t": 9, "epsilon\uc744": 9, "\ubc29\ud5a5\uc131\uc744": 9, "\ub764\ub2e4\ub77c\uace0": 9, "\uc8fc\uc7a5\ud55c\ub2e4": 9, "proof": 9, "proport": 9, "relationship": 9, "find": 9, "constant\uac12\uc73c\ub85c": 9, "step\ub9cc\uc73c\ub85c": 9, "\uc81c\uc2dc\ud55c\ub2e4": 9, "dharwial": 9, "image\uc0dd\uc131\uc744": 9, "\ub17c\ubb38\uc5d0\uc11c\uc758": 9, "guidance\uc774\ub2e4": 9, "image\ub85c\ubd80\ud130": 9, "\uc720\uc9c0\ud558\ub418": 9, "\uc18d\ud558\ub294\uc9c0": 9, "\uc124\uc815\ud55c\ub2e4": 9, "\uacfc\uc815\uc758": 9, "score\uc5d0\uac8c": 9, "\uc18c\uac1c\ub418\uc5c8\ub294\ub370": 9, "classifiy\ub97c": 9, "\ud574\uc57c\ud558\ubbc0\ub85c": 9, "\uc5c6\uace0": 9, "heavy\ud574\uc9c0\ub294": 9, "\ubc29\ubc95\uc5d0": 9, "\uac1c\uc120\uc810\uc744": 9, "\uc2dd\uc5d0\uc11c": 9, "model\ub9cc\uc73c\ub85c": 9, "clip\uc740": 9, "representation\uc744": 9, "\uc774\ub8e8\uc5b4\uc9c4": 9, "\uc9c4\ud589\uc2dc\ud0a8": 9, "pair\uc5d0": 9, "\uc720\uc0ac\ub3c4": 9, "\ucee4\uc9c0\ub3c4\ub85d": 9, "\uc791\uc544\uc9c0\ub3c4\ub85d": 9, "guidance\uc5d0\uc11c\ub294": 9, "guidance\uc5d0\uc11c": 9, "classifier\ub300\uc2e0\uc5d0": 9, "clip\ubaa8\ub378\uc744": 9, "\ubc29\uc2dd\ub3c4": 9, "classifier\ub300\uc2e0": 9, "\uad6c\ud55c": 9, "text\uac04\uc758": 9, "\uc720\uc0ac\ub3c4\ub97c": 9, "billion": 9, "\uc99d\uac00\uc2dc\ud0a4\ub294\ub370": 9, "base\ub85c": 9, "\uc218\ud589\ud574\uc57c\ud55c\ub2e4": 9, "k\uac1c\uc758": 9, "encoding\ud55c": 9, "input\uac12\uc73c\ub85c": 9, "\ub123\uc5b4\uc900\ub2e4": 9, "output\uc758": 9, "encoding\uc744": 9, "\uc5f0\uc0b0\ud558\uace0\uc790": 9, "projection\ud558\uc5ec": 9, "\ub354\ud55c": 9, "adain\uae30\ubc95\uc744": 9, "block\uc758": 9, "\ub3c4\ucd9c\ud55c\ub2e4": 9, "layer\ub294": 9, "block\ub4a4\uc5d0": 9, "\ubd99\ub294": 9, "e\uc640": 9, "architecture\ub85c\ub294": 9, "up\ub41c": 9, "2b": 9, "paremeters\ub97c": 9, "transformer\ub97c": 9, "upsampling\ud558\ub294": 9, "model\ub3c4": 9, "\ud559\uc2b5\uc2dc\ucf30\ub2e4\uace0": 9, "ddpm\uc5d0\uc11c\uc758": 9, "upsampler\uc640": 9, "\ube44\uc2b7\ud558\ub2e4\uace0": 9, "\uc9c4\ud589\ud588\uc744\ub54c\ub294": 9, "generation\uc5d0": 9, "generation\uc758": 9, "condition\uc5d0": 9, "sequence\ub97c": 9, "impainting\uc744": 9, "\uac70\uce58\uc9c0": 9, "\uc54a\uc558\ub2e4": [9, 20], "\uc54c\ub824\uc9c4": 9, "\uc601\uc5ed\uc5d0": 9, "\ub300\uccb4\ud558\ub294": 9, "\uc0ac\uc6a9\ud588\uae30\uc5d0": 9, "\uc5c6\ub2e4\ub294": 9, "tuning\uacfc\uc815\uc5d0\uc11c": 9, "example\uc758": 9, "\uc9c0\uc6b4\ub2e4\uc74c": 9, "\ub0a8\uc740": [9, 20], "\uc870\uac74": 9, "\uc815\ubcf4\ub85c\uc11c": 9, "\ucc44\ub110\uacfc": 9, "\uc785\ub825\ub418\ub3c4\ub85d": 9, "\uc124\uacc4\ud558\uc600\ub2e4": 9, "guidance\uc5d0": 9, "\uc801\ud569\ud558\uac8c": 9, "\ube44\uad50\ud588\uc74c\uc744": 9, "\uc5b8\uae09\ud588\ub2e4": 9, "models\ub97c": 9, "\uc0ac\uc6a9\ud588\uc74c\uc744": 9, "\ubc1d\ud78c\ub2e4": 9, "\uc5b8\uae09\ud588\ub4ef\uc774": 9, "\uc88b\uc558\ub2e4\uace0": 9, "precision\uacfc": 9, "\uba85\ud655\ud55c": 9, "\uad00\ucc30\ud558\uace0": 9, "\uc5b8\uae09\ud55c\ub2e4": 9, "\ucd5c\uc801\uc73c\ub85c": 9, "\uc218\ud589\ub418\uc5c8\uc73c\uba70": 9, "\ubc29\ubc95\uc784\uc744": 9, "\ud5a5\uc0c1\uc2dc\ud0ac": 9, "\ud3c9\uac00\uc5d0": 9, "caption\uacfc": 9, "\uc77c\uce58\uc2dc\ud0a4\ub294": 9, "\ub6f0\uc5b4\ub098\uc9c0": 9, "\uac00\uc124\uc744": 9, "\uc778\uac04": 9, "\ud3c9\uac00\uc790\ub97c": 9, "\uc9c4\ud589\ud558\uc600\uace0": 9, "\uc778\uac04\ub4e4\uc774": 9, "\uc758\uacac\uc744": 9, "guida": 9, "nce\uac00": 9, "\uc77c\uce58\ud558\ub294": 9, "\uc0dd\uc131\ud55c\ub2e4\uace0": 9, "\ud310\ub2e8\ud588\ub2e4": 9, "table1\uc740": 9, "unguid": 9, "evaluation\uc744": 9, "\uacb0\uacfc\uc774\ub2e4": 9, "\ud56d\ubaa9\uc5d0": 9, "\ubcf4\uc784\uc744": 9, "table2\ub294": 9, "glide\uc640": 9, "model\ub4e4\uc744": 9, "\ud45c\uc774\ub2e4": 9, "\uad6c\ud558\uc600\ub2e4": 9, "coco\uc5d0": 9, "\uacbd\ud5d8\uc774": 9, "result\ub97c": 9, "100\ubc88": 2, "md": [], "pic": [], "img_04": [], "alt": [], "bg": [], "primari": [], "mb": [], "350px": [], "w_1000": 2, "123": 2, "100\uac1c\uc758": 2, "\uc0d8\ud50c\ub9c1\ud55c": 2, "\ud3c9\uade0\uac12\uc774\uba70": [], "secretli": 20, "16203": 20, "\ud68d\ub4dd": 20, "\uac70\ub300": 20, "\ubaa8\ub378\ub85c\ubd80\ud130": 20, "\ub098\uc058\uc9c0": 20, "composit": 20, "reason": 20, "\ud6cc\ub96d": 20, "\uc0b4\ud3b4\ubcf4\uae30": 20, "\ub3d9\ubb3c": 20, "\uc2f6\ub2e4\uba74": 20, "\uc77c\ub2e8": 20, "\ub3d9\ubb3c\uc758": 20, "\ud074\ub798\uc2a4\ub97c": 20, "37\uac1c\uc758": 20, "\ud074\ub798\uc2a4\uac00": 20, "pet": 20, "\uce58\uc790": 20, "\ud638\ub791\uc774": 20, "\uadf8\ub7fc": 20, "\ud68d\ub4dd\ud560": 20, "\uc218\ud589\ud574\uc11c": 20, "\ud310\ubcc4\ud55c\ub2e4": 20, "\ud074\ub798\uc2a4\uc774\ub2e4": 20, "n_sampl": 20, "\uc9c0\uc815\ub41c": 20, "\uc0d8\ud50c\ub9c1\ud574": 20, "\ubca1\ud130\ub97c": 20, "\ud310\ubcc4\uc774": 20, "\ucd9c\ub825\ud55c\ub2e4": 20, "n_trial": 20, "\uc2dc\ub3c4\ud574\uc11c": 20, "\ud3c9\uade0\ub0bc": 20, "\ucd94\ub860\ud55c\ub2e4": 20, "\ud310\uc815\ud55c\ub2e4": 20, "\ucd94\ub860\ud560": 20, "\ud544\uc694\ud558\ub2e4": 20, "\uc218\ud589\ud558\uae30": 20, "\ud559\uc2b5\ud558\uc9c0\ub294": 20, "\uc54a\uc558\uc9c0\ub9cc": 20, "\uc815\uc758\ub418\uc5b4": 20, "\ub370\uc774\ud130\uc14b\uc73c\ub85c": 20, "\uad6c\ud558\uace0": 20, "\ucd94\ub860\ud558\uace0": 20, "\uc18c\ubaa8\ub428": 20, "\ub2e4\uc74c\uc758": 20, "\uc904\uc778\ub2e4": 20, "\uac78\ub7ec\ub0b8\ub2e4": 20, "\uc18c\uc218\uc758": 20, "\ub0a8\uc558\ub2e4\uba74": 20, "\uc774\uc81c\ub294": 20, "\ucd94\ub860\uc744": 20, "oxford": 20, "iiit": 20, "bash": 20, "python": 20, "eval_prob_adapt": 20, "split": 20, "to_keep": 20, "prompt_path": 20, "pets_prompt": 20, "csv": 20, "\uc774\ub807\uac8c\uae4c\uc9c0": 20, "\uc904\uc774\ub824\uace0": 20, "\uc2a4\ud06c\ub9bd\ud2b8": 20, "rtx": 20, "3090": 20, "\ub3cc\ub9ac\uba74": 20, "1\uc7a5": 20, "\ud558\ub294\ub370": 20, "18\ucd08": 20, "\ud558\ub824\uba74": 20, "\ucd08": 20, "all_nois": 20, "max_n_sampl": 20, "eval_error": 20, "ts": 20, "noise_idx": 20, "text_emb": 20, "text_embed_idx": 20, "float32": 20, "l2": 20, "pred_error": 20, "cpu": 20, "idx": 20, "inference_mod": 20, "tqdm": 20, "trang": 20, "batch_t": 20, "noised_lat": 20, "alphas_cumprod": 20, "t_input": 20, "float16": 20, "text_input": 20, "l1_loss": 20, "huber": 20, "huber_loss": 20, "notimplementederror": 20, "\ud074\ub798\uc2a4\uc5d0": 20, "\ucd94\ub860\ud558\uac8c": 20, "\ub420\ud150\ub370": 20, "\uc0ac\uc6a9\ud574\uc57c": 20, "\ubcc0\uc218\uc5d0": 20, "\ub2ec\ub77c\uc9c0\uae30": 20, "\ub2ec\ub77c\uc84c\ub2e4": 20, "\uc62c\ub77c\uac00\ub294\uc9c0": 20, "\uc2e4\ud5d8\ud574\ubcf4\uc558\ub2e4": 20, "\uc88b\uc558\ub2e4": 20, "\ucd94\ucd9c\ud574\ub0b4\ub294": 20, "\ubc29\ubc95\ub4e4\ubcf4\ub2e4": 20, "\ub6f0\uc5b4\ub0ac\ub2e4": 20, "\uc0dd\uc131\ud574": 20, "\uad6c\ucd95\ud558\uace0": 20, "\ud559\uc2b5\uc2dc\ucf1c\uc11c": 20, "\uc218\ud589\ud55c": 20, "\ucd94\ucd9c\ud574": 20, "\uc804\ub2ec\ud574\uc11c": 20, "\ubaa8\ub378\ubcf4\ub2e4\ub3c4": 20, "\ube44\ubcbc\ubcfc": 20, "\ub192\uc740\uc9c0": 20, "\ud55c\uc9c0": 20, "safe": 20, "\ud55c\uc9c0\uc5d0": 20, "filter": 20, "\uc774\uc640": 20, "cifar10": 20, "flower": 20, "stl10": 20, "\uc774\ub4e4": 20, "\uc644\uc804\ud55c": 20, "\ud544\ud130\ub9c1\uc774": 20, "\uc548\ub41c": 20, "\uc62c\ub77c\uac08": 20, "winoground": 20, "visio": 20, "linguist": 20, "\ub9e4\uce58\uc2dc\ud0a4\ub294": 20, "\uba85\uc0ac\uc808\ub07c\ub9ac": 20, "\ub4a4\ubc14\ub010": 20, "\ub3d9\uc0ac\ub07c\ub9ac": 20, "\ud615\uc6a9\uc0ac\ub07c\ub9ac": 20, "\ubd80\uc0ac\ub07c\ub9ac": 20, "\ud488\uc0ac\ub07c\ub9ac": 20, "\uc5ec\ub290": 20, "\ub9cc\uc744": 20, "\ud559\uc2b5\ud588\uc74c\uc5d0\ub3c4": 20, "\uc774\uc790": 20, "\ubcc0\ubaa8": 20, "dit": 20, "101": 20, "79": 20, "\uae30\ub85d\ud558\uba70": 20, "\ub2a5\uac00": 20, "\uc54a\uc558\uc74c\uc5d0\ub3c4": 20, "\ub2a5\uac00\ud588\ub2e4": 20, "\uacb9\uce58\ub294": 20, "\uc2e0\ub8b0\uad6c\uac04": 20, "\ucc0d\ud600": 20, "\ubcc4": 20, "\ubaa8\uc591\uc758": 20, "\ud68d\ub4dd\ud55c": 20, "\uae30\ub300\ub418\ub294": 20, "ood": 20, "\ud558\ub2e4": 20, "\ucd94\ucd9c\ud558\ub294": 20, "\uc6b0\uc218\ud568\uc744": 20, "\ub370\uc774\ud130\ub3c4": 20, "\ud559\uc2b5\uc2dc\ud0ac": 20, "\uac1c\uc120\ub420": 20, "\ud65c\uc6a9\ud588\uc74c": 20, "\ub6f0\uc5b4\ub0a0": 20, "\uc608\uc0c1": 20}, "objects": {}, "objtypes": {}, "objnames": {}, "titleterms": {"inform": [0, 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29], "synthet": [0, 18], "data": [0, 3, 8, 18], "stabl": [0, 28], "diffus": [0, 5, 8, 9, 11, 12, 14, 16, 18, 19, 23, 24, 26, 28, 30], "foliar": 0, "diseas": 0, "classif": [0, 18], "1": [0, 3, 5, 7, 8, 9, 11, 13, 14, 16, 18, 22, 23, 28], "\uac1c\uc694": 0, "2": [0, 3, 5, 7, 8, 9, 11, 13, 14, 15, 16, 18, 22, 23, 28], "baselin": [0, 21], "\uad6c\ucd95": 0, "3": [0, 3, 5, 7, 8, 9, 11, 13, 14, 15, 16, 18, 22, 23, 28], "fine": [0, 3, 5, 9, 18, 24], "tune": [0, 3, 5, 9, 18, 24], "4": [0, 3, 5, 7, 8, 9, 11, 13, 14, 15, 16, 18, 23, 28], "\uc131\ub2a5": 0, "\ube44\uad50": [0, 19], "5": [0, 5, 7, 11, 13, 15, 16, 18, 23], "discuss": [0, 5], "6": [0, 7, 11, 18, 23], "appendix": [0, 1, 24], "train": [1, 3, 4, 5, 8, 9, 15, 18, 21, 22, 25], "dreambooth": [1, 10, 24], "naver": 1, "webtoon": 1, "face": [1, 16], "dataset": 1, "introduct": [1, 3, 5, 7, 8, 9, 10, 11, 13, 14, 15, 16, 18, 19, 21, 22, 23, 24, 25, 26, 28, 29], "ablat": [1, 24, 26, 28], "studi": [1, 21, 24, 26, 28], "prior": [1, 22], "preserv": 1, "loss": [1, 8, 21], "neg": 1, "prompt": 1, "instanc": 1, "guidanc": [1, 3, 9, 23, 26], "scale": [1, 8, 11, 18], "cm3leon": 3, "abstract": [3, 5, 7, 9, 11, 13, 14, 15, 16, 19, 21, 23], "pretrain": [3, 26], "imag": [3, 4, 5, 9, 21, 23, 24, 30], "token": 3, "retriev": 3, "augment": 3, "object": [3, 8, 21], "function": [3, 8, 21], "model": [3, 5, 8, 9, 11, 12, 14, 15, 16, 18, 19, 23, 24, 26, 28], "text": [3, 5, 9, 19, 24, 30], "To": 3, "result": [3, 4, 9, 18, 21, 22, 23, 25, 26], "import": 3, "decod": [3, 8], "strategi": 3, "temperatur": 3, "sampl": [3, 7, 8, 11, 18], "topp": 3, "classifi": [3, 9, 23, 26], "free": [3, 9, 26], "cfg": 3, "contrast": 3, "topk": 3, "cd": 3, "k": 3, "quantit": 3, "evalu": [3, 26], "supervis": 3, "instruct": 3, "gener": [3, 5, 7, 14, 18, 21, 30], "guid": [3, 9, 19], "edit": [3, 5], "ground": 3, "spatial": 3, "caption": 3, "visual": [3, 22], "question": 3, "answer": 3, "task": 3, "controlnet": 4, "addit": [4, 13], "control": 4, "base": [4, 14], "condit": [4, 9, 15], "block": 4, "zero": 4, "convolut": 4, "implement": [4, 21, 28], "custom": 5, "relat": [5, 10, 14, 16, 18, 19, 21], "work": [5, 10, 14, 16, 18, 19, 21, 22, 23], "deep": 5, "transfer": [5, 21], "learn": [5, 22], "adapt": [5, 23, 28], "method": [5, 10, 13, 14, 16, 19, 28], "singl": 5, "concept": 5, "multipl": 5, "composit": 5, "detail": [5, 21, 22, 28], "experi": [5, 7, 8, 10, 12, 13, 14, 16, 24, 25, 28], "limit": [5, 19, 21, 22, 23, 24], "dalle2": 6, "ddim": [7, 23], "background": [7, 8, 9, 18, 22, 23], "ddpm": [7, 8, 11, 23], "variat": [7, 17], "infer": [7, 13], "For": 7, "non": 7, "markovian": 7, "forward": [7, 8], "process": [7, 8], "from": [7, 18, 21, 26], "code": 7, "q": 8, "mathbf": 8, "x": 8, "_t": 8, "_": 8, "t": [8, 13], "revers": 8, "p": 8, "l": 8, "denois": [8, 11], "encod": 8, "l_t": 8, "l_": 8, "l_0": 8, "simplifi": 8, "qualiti": [8, 18, 21], "hyperdreambooth": 10, "contribut": [10, 26], "prelimiari": 10, "lightweight": 10, "lidb": 10, "hypernetwork": 10, "rank": [10, 13], "relax": 10, "fast": 10, "finetun": 10, "comparison": [10, 11, 21, 28], "follow": 10, "up": 10, "conclus": [10, 16, 18, 22, 26], "i": 11, "probabilist": 11, "improv": [11, 15, 18, 23], "log": 11, "likelihood": 11, "improc": 11, "speed": 11, "gan": [11, 19, 23, 25], "size": 11, "latent": [12, 19], "lora": 13, "0": 13, "terminolog": 13, "convent": 13, "problem": 13, "statement": 13, "aren": 13, "exist": 13, "solut": 13, "good": 13, "enough": 13, "our": 13, "low": 13, "parameter": 13, "updat": 13, "matric": 13, "No": 13, "latenc": 13, "appli": 13, "transform": [13, 22], "empir": 13, "ia3": 13, "aa": 13, "\uc0ac\uc6a9\ubc95": 13, "refer": 13, "sdedit": 14, "score": [14, 18], "sde": 14, "smld": 14, "sdxl": 15, "micro": 15, "crop": 15, "paramet": [15, 18, 23], "multi": 15, "aspect": 15, "autoencod": 15, "put": 15, "everyth": 15, "togeth": 15, "refin": 15, "stage": [15, 22], "styo": 16, "styliz": 16, "framework": 16, "stylegan": 17, "map": 17, "network": 17, "style": [17, 21], "adain": 17, "stochast": 17, "mix": 17, "regular": 17, "\uc2e4\ud5d8": 17, "\uacb0\uacfc": [17, 19, 21], "imagenet": 18, "imagen": [18, 26, 27], "protocol": 18, "fid": 18, "IS": 18, "accuraci": 18, "differ": 18, "merg": 18, "real": 18, "textual": 19, "invers": 19, "cf": 19, "\uc774\ud574": 19, "\ubabb\ud568": 19, "ldm": 19, "embed": 19, "\uc131\ub2a5\ud3c9\uac00": 19, "dall": [19, 22], "e": [19, 22], "2\uc640": 19, "synthesi": [19, 23], "pseudo": 19, "word": 19, "\ub450": 19, "\uac1c": 19, "\uc0ac\uc6a9": 19, "bia": 19, "reduct": 19, "\uc815\ub7c9\ud3c9\uac00": 19, "\ud3c9\uac00": 19, "setup": 19, "\uc8fc\ubaa9\ud560": 19, "\uc810": 19, "\uc0ac\uc6a9\uc790\ud3c9\uac00": 19, "\ub9c8\ubb34\ub9ac": 19, "cyclegan": 21, "\ucc38\uace0": 21, "translation\uc774\ub780": 21, "mode": 21, "collapse\ub780": 21, "\uad00\ub828": 21, "\uc5f0\uad6c": 21, "formul": 21, "adversari": 21, "cycl": 21, "consist": 21, "full": 21, "\uc804\uccb4": 21, "\ubaa9\uc801\uc2dd": 21, "least": 21, "squar": 21, "\ucd94\uac00": 21, "\uc124\uba85": 21, "\uae30\ud0c0": 21, "against": 21, "human": [21, 26], "fcn": 21, "\ub4f1": 21, "analysi": 21, "reconstruct": 21, "pair": 21, "dataset\uc5d0": 21, "\ub300\ud55c": 21, "applic": [21, 24, 28], "collect": 21, "transfigur": 21, "season": 21, "photo": 21, "paint": 21, "enhanc": 21, "gati": 21, "discusss": 21, "gpt": 22, "vq": 22, "vae": [22, 29], "methodolog": [22, 26], "previou": 22, "overview": [22, 28], "an": 22, "autoregress": 22, "pipelin": 22, "\uc608\uc2dc": 22, "equat": 22, "\ud559\uc2b5\uacfc\uc815": 22, "codebook": 22, "beat": 23, "architectur": 23, "group": 23, "normal": 23, "algorithm": 23, "7": 23, "impact": 23, "s": 23, "8": 23, "9": 23, "futur": 23, "procedur": 25, "theoret": 25, "summari": [25, 29], "t5": 26, "xxl": 26, "cascad": 26, "larg": 26, "weight": 26, "sampler": 26, "static": 26, "threshold": 26, "dynam": 26, "super": 26, "resolut": 26, "drawbench": 26, "qualit": 26, "tabl": 26, "editor": 27, "t2i": 28, "preliminari": 28, "design": 28, "optim": 28, "intract": 29, "reparameter": 29, "trick": 29, "pseudolab": 30, "feat": 30, "bbdm": 2, "glide": 9, "clip": 9, "inpaint": 9, "nois": 9, "ydmszc": 20, "\ubc1c\ud45c": 20, "\uc790\ub8cc": 20}, "envversion": {"sphinx.domains.c": 2, "sphinx.domains.changeset": 1, "sphinx.domains.citation": 1, "sphinx.domains.cpp": 6, "sphinx.domains.index": 1, "sphinx.domains.javascript": 2, "sphinx.domains.math": 2, "sphinx.domains.python": 3, "sphinx.domains.rst": 2, "sphinx.domains.std": 2, "sphinx.ext.intersphinx": 1, "sphinx": 56}})
\ No newline at end of file