분류 전체보기
- Qualifications
  - Information-Device-Operatio..
- Copilot
- Blue-Whale
- Everyday-occurrence
- IT-Information
  - chatGPT
  - Blog-operation
  - Blockchain
  - Electricity
  - Issues
  - Terms
  - NFT
  - Tech
  - Automation
  - Information-protection
- Investment
- CES
- SMR
- Movie
- Cryptocurrency
- Exhibition
- KIA
- Korean-history-proficiency-..
- AFK
- Japan
- Entertainments
- T1-skin
- Qoo10
- Economy
- RealEstate
- Marble
- College-entrance-exam
- Apartment-sales
- Breakdance
- Malaria
- Tesla
- Passport
- S-1
- Finance
- Coding
  - vercel
  - Rust
- CSS
- Life
  - Astronomy
  - Game
  - Art
  - Save-Recipe
  - Car-self-maintenance
  - Volkswagen-golf
  - Creation
  - Law
  - Design
  - Story
  - Daily
  - Daily-Health
  - Dental-health
- Advertising-phone-number
- Post
- Life-Insight
- Book-review
- Tax
- Human-History
- Product-User-Manual
- Product-review
- AppsScript
- WordPress
- Company-Life-Document-Form
- Trading
- Music
- YouTube
- Project
  - AI-Music-Lab
- Adsense
- Bitcoin
- Geopolitics
- Google-Policy-Update
- Fed-Interest-Rate
- Travel
- AppleWatch
- Motivation
- Emotion
- Mental-Health
- Relationships
- IT
  - Python
  - HTML
  - Dividend-Stock
  - Postman
  - Excel
  - GoogleSheet
  - Facebook
  - Php
  - Window
  - GoogleDocs
  - PyPy
  - Database
  - AWS
  - Artificial-Intelligence
  - Open-Source
  - Chrome
  - Cloud
  - Domain
  - Express
  - SSL
  - MSYS2
  - LLM
- Digital-Tools
- Coffee
- NJZ
- Earth-Science
- Musicals
- Sport
  - AG
  - Olympic
  - Premier12
  - NBA
- Mental-Care
- Photo
- Retirement&Finance
- Botany
- International-Issues
- Food
- Artistic-Inspiration
- Fit
- Hacking

ABOUT ME

-

페이스북
트위터
인스타그램

Today: -

Yesterday: -

Total: -

GoldenKey GoldenKey

컨텐츠 검색 블로그 내 검색

VideoLDM - Latent Diffusion Model

IT-Information/Issues 2023. 4. 23. 13:43

VideoLDM - Latent Diffusion Model을 이용한 고해상도 Text-to-Video

LDM은 압축된 저차원 Latent 공간에서 Diffusion Model을 학습해서 많은 컴퓨팅 리소스 없이도 고해상 이미지 합성이 가능하다. 이 LDM을 고해상 비디오에 적용한 NVidia의 논문을 기반으로 하고 있다.

LDM을 이미지 전용으로 사전학습하고, Temporal Dimension을 도입해 인코딩된 이미지 시퀀스를 미제 조정해서 이미지 생성기를 비디오 생성기로 전환한다.

확산모델 업샘플러를 얼라인하여 temporally consistent한 초고해상 비디오 모델로 바꿔준다.

https://research.nvidia.com/labs/toronto-ai/VideoLDM/

https://research.nvidia.com/labs/toronto-ai/VideoLDM/

Align your Latents: High-Resolution Video Synthesis with Latent Diffusion Models

Align your Latents: High-Resolution Video Synthesis with Latent Diffusion Models

research.nvidia.com

저작자표시
관련글 관련글 더보기

Popular

Copyright 2025

티스토리툴바