Dies ist eine Übersichtsseite mit Metadaten zu dieser wissenschaftlichen Arbeit. Der vollständige Artikel ist beim Verlag verfügbar.
Argmax Flows and Multinomial Diffusion: Learning Categorical Distributions
36
Zitationen
5
Autoren
2021
Jahr
Abstract
Generative flows and diffusion models have been predominantly trained on ordinal data, for example natural images. This paper introduces two extensions of flows and diffusion for categorical data such as language or image segmentation: Argmax Flows and Multinomial Diffusion. Argmax Flows are defined by a composition of a continuous distribution (such as a normalizing flow), and an argmax function. To optimize this model, we learn a probabilistic inverse for the argmax that lifts the categorical data to a continuous space. Multinomial Diffusion gradually adds categorical noise in a diffusion process, for which the generative denoising process is learned. We demonstrate that our method outperforms existing dequantization approaches on text modelling and modelling on image segmentation maps in log-likelihood.
Ähnliche Arbeiten
Deep learning
2015 · 80.583 Zit.
Learning Multiple Layers of Features from Tiny Images
2024 · 25.472 Zit.
GAN(Generative Adversarial Nets)
2017 · 21.794 Zit.
Unpaired Image-to-Image Translation Using Cycle-Consistent Adversarial Networks
2017 · 21.750 Zit.
SSD: Single Shot MultiBox Detector
2016 · 20.702 Zit.