Return to Article Details Dynamic Resource Optimization for Generative AI Workloads: A Simulation-Driven Approach to Mitigating Cold-Start Latency and Cost Inefficiency in Cloud Environments Download Download PDF