Daffodil International University

Faculty of Science and Information Technology => Game Design => MCT => Game Engine. => Topic started by: S. M. Monowar Kayser on April 15, 2026, 12:57:26 AM

Title: Can Generative Models Become Game Engines?
Post by: S. M. Monowar Kayser on April 15, 2026, 12:57:26 AM: If the first wave of AI in game engines concerns accelerating subsystems, the second wave concerns a far more radical possibility: replacing substantial portions of the engine itself with learned generative simulation. Valevski et al. (2024) put this idea into concrete form by using a diffusion model to simulate interactive DOOM rollouts in real time, effectively treating the game engine as a learned next-state generator conditioned on actions and prior frames. This differs sharply from traditional engines, whose value lies in explicit state, deterministic update rules, and debuggable modules for collision, animation, and logic. The attraction of neural engines is obvious: they promise rapid content prototyping, compact world simulation, and potentially new forms of stylized or data-driven interaction. However, the conceptual cost is high because classic engines provide inspectable state transitions, whereas generative rollouts can obscure errors behind visually convincing outputs. The result is a new form of brittleness in which gameplay may appear coherent while underlying causal structure drifts, making networking, replay determinism, AI training, and formal testing especially difficult. Newbury et al. (2024) note more broadly that differentiable and learned simulators must negotiate trade-offs among speed, accuracy, and gradient usefulness, and those trade-offs become even more acute in games where player experience depends on stable systemic semantics rather than merely approximate physical realism. The research gap, then, is not whether neural engines can render plausible interaction, but whether they can support the engineering invariants that commercial engines require, including synchronization, authority, tooling, and state introspection. Future research should therefore treat neural engines as candidates for selective replacement, preserving symbolic authority over gameplay-critical systems while using learned modules for perceptual compression, animation, or approximate simulation where bounded error is acceptable (Valevski et al., 2024; Newbury et al., 2024; Matthey et al., 2024).

References
1. Valevski, D., Leviathan, Y., Arar, M., & Fruchter, S. (2024). Diffusion models are real-time game engines. arXiv preprint arXiv:2408.14837.
2. Newbury, R., Collins, J., He, K., Pan, J., Posner, I., Howard, D., & Cosgun, A. (2024). A review of differentiable simulators. IEEE Access, 12, 97581-97604.
3. Matthey, L., et al. (2024). Scalable instructable multiworld agent (SIMA): A generalist AI agent for 3D virtual environments. Google DeepMind technical report.

S. M. Monowar Kayser
Lecturer, Department of Multimedia & Creative Technology (MCT)
Faculty of Science & Information Technology
Daffodil International University (DIU)
Daffodil Smart City, Savar, Dhaka, Bangladesh
Visit: https://monowarkayser.com/ (https://monowarkayser.com/)