I'm more interested in how this "cross attention" part works. Being able to comb...

samsartor · on Aug 21, 2023

Cross attention is not really a way to "combine multiple AI models" but there are many ways to do that, and actually diffusion models are really good at being combined with stuff. Especially thanks to tricks like score distillation (see dreamfusion3d.github.io). But it isn't anything like AGI because the AI is not inventing the combinations itself, and even if you could, there is no clear way to make it self-directed. These are still processes that require lots of programmers being very clever.

Edit: typo