Softonic
AI

This is Genie 2, the new model from Google DeepMind capable of generating interactive 3D worlds

AI is capable of generating interactive scenes in real time from a single image or text description

This is Genie 2, the new model from Google DeepMind capable of generating interactive 3D worlds
Pedro Domínguez

Pedro Domínguez

  • December 5, 2024
  • Updated: December 6, 2024 at 4:36 PM

DeepMind, the artificial intelligence research division of Google, has unveiled Genie 2, an innovative model capable of creating an apparently infinite variety of playable three-dimensional worlds. This model, which follows Genie, launched earlier this year, stands out for generating interactive scenes in real-time from a single image or text description, such as “a humanoid robot in Ancient Egypt.” Although it is reminiscent of developments by companies like World Labs and Decart, Genie 2 has features that make it unique.

The DeepMind proposal promises an immense diversity of 3D worlds rich in details, where users can perform actions like jumping or swimming with keyboard and mouse. Thanks to its training with videos, Genie 2 can simulate object interactions, animations, lighting, physics, and even the behavior of non-playable characters (NPCs). Many of these worlds resemble triple-A video games, which raises serious questions about whether its training included sessions from popular titles. For now, DeepMind has avoided revealing details about how it collected the data.

The model has also reignited the debate over intellectual property. As a subsidiary of Google, DeepMind can access YouTube videos, and the company itself has stated that its terms of service allow the use of these materials to train AI models.

Despite its limitations, such as simulations lasting between 10 and 60 seconds, Genie 2 is more consistent than other similar models. For example, it avoids common issues of visual artifacts and scene forgetting, something that affects competitors like Oasis, from Decart. Additionally, it can remember and render objects that had gone out of the field of view.

DeepMind does not see this model as a tool for traditional games, but as a creative and research resource. According to the company, “Genie 2 turns conceptual art into complete interactive environments” and facilitates the evaluation of AI agents in completely new tasks, opening new possibilities for prototypes and experimentation.

Gemini DOWNLOAD

Latest Articles

Loading next article