Generative Model
We want to sample from , but how? We usually have four ways:
| Paradigm | Idea |
| Directly model | Explicit density: Or tractable forms: |
| Latent Variable | Marginalization: Variational bound: |
| Implicit Generation | Pushforward measure: No explicit density |
| Score-Based | Score matching: Learn Reverse diffusion process |
1. Continuous Generative Models
-
Discrete Generative Models
-
Pros:
- Efficient Inference: They can be very fast, often requiring only one pass of the transformer to generate a sequence.
-
Cons:
- Quality Issues: Discrete tokens suffer from a "quality issue" due to high data compression, which results in the loss of fine details. Reconstructed images can look significantly different up close from the original.
- Fundamental Compression Flaw: To be manageable, a sequence of discrete tokens must compress information far more than a continuous representation, which is a fundamental limitation.
-
-
Continuous Generative Models
-
Pros:
- High-Quality Samples: They generally offer much better reconstruction than discrete models.
-
Cons:
- Speed Issues: Continuous models, particularly diffusion, have a "speed issue" because they require many iterative steps to generate a sample. This multi-pass process makes inference slow and computationally demanding.
-
Do inference-time scaling benefit generative pre-training algorithms? Maybe.