CV_tests | Notion

Preview

1e-4, GD (1 data per batch), Color/Depth Estimate, Full Image for Input (복원 task)
U-Net
VAE
ViT (결과 망해서 삭제)

RIR 정보의 효과적인 Injection Method Study는 모델 비교 다음 단계이다. (Now: Non-regardless)

Summary

U-Net → based model로 사용
다만 둘 다 가끔 전혀 다른 사진을 출력할 때가 있음

exp1 | U-Net: Excellent

loss: 69 → 39

	Small DT	Train DT = Train	Val DT = Val	lr = 1e-4

	Audio	Image Model	Image	Others
	2D, STFT	U-Net	Full Image	Batch Size = 1

exp2 | VAE: Good

vae.py

loss: 162 → 63