Generative AI
Exploring techniques in generative artificial intelligence
Generative AI can be applied to various tasks, including (most common):
| Task | Examples |
|---|---|
| Text to Image | Models: DALL-E, Stable Diffusion, Midjourney; API Providers: OpenAI, Anthropic, Stability AI |
| Speech to Text (S2T) | - |
| Text to Speech (T2S) | - |
| Video Generation | - |
Text to Image
Image generation, a key area in generative AI, often uses diffusion models. These models employ supervised learning to progressively reduce noise in images.
Diffusion Model Process
| Stage | Input | Output |
|---|---|---|
| Training | Noisy image (A) | Less noisy image (B) |
| Generation Step | Input | Output |
|---|---|---|
| 1 | Pure noise | Less noisy image |
| 2 | Less noisy image | Even less noisy image |
| 3 | Minimally noisy image | Clean picture |
This iterative process gradually transforms random noise into a coherent image based on the text prompt.
