Generative Adversarial Networks

생성적 적대 신경망(生成的敵對神經網, 영어: generative adversarial network; GAN)은 비지도 학습에 사용되는 인공지능 알고리즘으로, 제로섬 게임 틀 안에서 서로 경쟁하는 두 개의 신경 네트워크 시스템에 의해 구현된다. 이 개념은 2014년에 이안 굿펠로우(Ian. j. Goodfellow)에 의해 발표되었다.

Conditional GAN(Conditional Generative Adversarial Net, CGAN), Cycle GAN 등 여러 종류가 존재한다.

Abstractor

We propose a new framework for estimating generative models via an adversarial process, in which we simultaneously train two models: a generative model G that captures the data distribution, and a discriminative model D that estimates the probability that a sample came from the training data rather than G. The training procedure for G is to maximize the probability of D making a mistake. This framework corresponds to a minimax two-player game. In the space of arbitrary functions G and D, a unique solution exists, with G recovering the training data distribution and D equal to 1/2 everywhere. In the case where G and D are defined by multilayer perceptrons, the entire system can be trained with backpropagation. There is no need for any Markov chains or unrolled approximate inference networks during either training or generation of samples. Experiments demonstrate the potential of the framework through qualitative and quantitative evaluation of the generated samples.

Documentation

Generative Adversarial Networks: https://arxiv.org/abs/1406.2661

Progressive Growing of GANs for Improved Quality, Stability, and Variation: https://arxiv.org/pdf/1710.10196.pdf; 1710.10196.pdf; https://research.nvidia.com/publication/2017-10_Progressive-Growing-of; https://github.com/tkarras/progressive_growing_of_gans; https://www.reddit.com/r/MachineLearning/comments/9ybstu/r_tdls_progressive_growing_of_gans/

파생 기술

Audio & Music

GANSynth - Making music with GANs

GANSynth is an algorithm for synthesizing audio with generative adversarial networks. The details can be found in the ICLR 2019 Paper. It achieves better audio quality than a standard WaveNet baselines on the NSynth Dataset, and synthesizes audio thousands of times faster.

WaveGAN - Adversarial Audio Synthesis - https://arxiv.org/abs/1802.04208
MusicGAN - https://github.com/HStuart18/tfworldhackathon
Voicebox - Meta, 음성용 Generative AI 모델 Voicebox 공개
AudioCraft - Meta, 오디오를 위한 생성형 AI 오픈소스로 공개

Image

pix2pix - Image-to-Image Translation with Conditional Adversarial Networks
CycleGAN - Unpaired Image-to-Image Translation using Cycle-Consistent Adversarial Networks
BicycleGAN - Toward Multimodal Image-to-Image Translation
Dark Model Adaptation
StyleGAN - A Style-Based Generator Architecture for Generative Adversarial Networks
StyleGAN2
StackGAN - Text to Photo-realistic Image Synthesis with Stacked Generative Adversarial Networks
3D ShapeNet - 이라고 부르는 데이터셋을 사용해 신경망을 훈련해 진짜 같은 형상을 생성하게 해 본다.
DCGAN - 애니메이션 캐릭터를 생성하게 해 본다.
SRGAN - 신경망을 구현해 고해상도 이미지를 생성하게 해 본다.
Age-cGAN - 사람이 나이를 먹어도 신경망이 그 사람의 얼굴을 잘 알아볼 수 있게 한다.
A Neural Algorithm of Artistic Style
Noise2Noise - Learning Image Restoration without Clean Data
SinGAN - Learning a Generative Model from a Single Natural Image
WGAN
NVIDIA Canvas - AI로 사실적인 그림 그리기
DragGAN - Implementation of DragGAN: Interactive Point-based Manipulation on the Generative Image Manifold

Video

Gen-2 - Runway, 비디오 생성형 AI "Gen-2" 공개
Goku - ByteDance의 Flow 기반 비디오 생성 모델

Colorization

Github - aleju/colorizer

This project uses GANs (generative adversarial networks) to add color to black and white images.

Apartment Building Design

ArchiGAN - a Generative Stack for Apartment Building Design: https://devblogs.nvidia.com/archigan-generative-stack-apartment-building-design/

Simulation

Genesis - A Generative and Universal Physics Engine for Robotics and Beyond

UI/GUI

v0 - Vercel이 만든 생성형 UI 시스템

검색 엔진 (Search Engine)

phind - 개발자를 위한 Generative AI 검색 엔진

Game

스몰빌 (Smallville) - AI 에이전트가 살아가는 가상 마을

Projects

라이브러리 목록:

TensorFlow-GAN (TF-GAN)
Torch-GAN (torchgan)
Keras-GAN
Mimicry
IBM-GAN toolkit
PaddleGAN
GAN Lab
pygan
VeGANs
StudioGAN
StyleSwin - [CVPR 2022] StyleSwin: Transformer-based GAN for High-resolution Image Generation
Karlo - 카카오브레인, 이미지 생성모델

당신이 정말로 알아야 할 6가지 GAN 아키텍처 (2022년 3월 21일)

6 GAN Architectures You Really Should Know - neptune.ai -

가장 인기 있는 GAN 공식은 다음과 같습니다.

한 도메인에서 다른 도메인으로 이미지 변환(CycleGAN)
텍스트 설명에서 이미지 생성 (text-to-image)
고해상도 이미지(ProgressiveGAN) 등을 생성합니다.

이 기사에서는 가장 인기 있는 GAN 아키텍처, 특히 GAN (Generative Adversarial Networks)에 대한 다양한 범위를 다루기 위해 알아야 할 6가지 아키텍처에 대해 설명합니다.

CycleGAN
StyleGAN
pixelRNN
text-2-image
DiscoGAN
lsGAN

Favorite site

논문반/논문세미나 - NIPS2016 Tutorials - Genertive Adeversarial Networks
Advanced GANs ¹
Generating custom photo-realistic faces using AI (AI를 사용하여 사실적인 사실적인 얼굴 생성; 새로운 TL-GAN 모델을 사용하여 제어 된 이미지 합성 및 편집)
SAMSUNG SDS - 새로운 인공지능 기술 GAN ① - 스스로 학습하는 인공지능 (기초 이론)
SAMSUNG SDS - 새로운 인공지능 기술 GAN ② - GAN의 개념과 이해 (기초 이론)
[추천] GAN — Some cool applications of GANs. ² (GAN으로 할 수 있는 어플리케이션 목록들 소개)
Github - The GAN Zoo - A list of all named GANs!
Github - PyTorch-GAN - PyTorch implementations of Generative Adversarial Networks
정확한 AI 모델을 만드는 ‘합성 데이터’ 해부하기 | NVIDIA Blog
List of synthetic data startups and companies — 2021 (Synthetic data)
맥킨지(McKinsey), 생성형 AI의 경제적 잠재력에 대한 보고서 발간 (66p/PDF | GeekNews
- [원문] Economic potential of generative AI | McKinsey
- PDF Download Link

Article

생성형 AI의 불편한 골짜기에 대한 고찰 | GeekNews
- [원문] Reckoning with generative AI’s uncanny valley | Thoughtworks
- 기존의 생성형 AI에 대한 멘탈 모델과 개념은 무시할 수 있는 부차적인 문제가 아니라 근본적인 디자인 문제임
- 생성형 AI의 불편한 골짜기는 고쳐야 할 문제가 아니라 우리가 이 기술에 대해 정말로 원하고 기대하는 바를 재평가할 기회임

References

Advanced_GANs_-_ratsgos_blog.pdf ↩
GAN_-Some_cool_applications_of_GANs-Jonathan_Hui-_Medium.pdf ↩