GPT-4o + Vidu

GPT-4o for keyframes, Vidu for inbetweens.

A simple, elegant caption looks good between video rows, after each row, or doesn't have to be there at all.