Generative AI (genAI) for visual content is evolving quickly from content productivity to experience delivery. Architects must design stacks that balance low-latency interaction with cost-efficient batch scaling while embedding provenance and governance throughout the pipeline. This report provides reference architecture and solution patterns to help architects navigate the complex trade-offs between latency, cost, and brand safety.