This is a working prototype that runs two models in sequence. A short written prompt becomes a single concept image of a house on its land, and that image is then lifted into a concept-stage 3D mesh you can orbit and zoom. The point is the whole path, words to image to form, rather than any one step on its own.
Two models do the work, each kept to one job. The first is Google's Gemini image model, which turns the prompt into a clean, centred concept render on a plain background. The second is TRELLIS, an image-to-3D model, which lifts that render into a mesh loaded straight into the viewer. The prompt carries the intent, the image model carries the look, and the 3D model carries the geometry.
This is concept-stage only. The result is a visual mesh, not measured geometry: no real walls, rooms, or dimensions, and the ground under it is generated rather than a real site. It is useful for feeling a house on its land and turning it over, not for anything that needs to be accurate. Treat it as a starting sketch a person still has to take forward.
A quick way to see a form, not a measured model. The further you push it toward documentation, the more a professional has to own.
Generation runs through the site, calling fal.ai for both the image and the 3D step, so it needs a key configured in the server environment. To keep the demo usable for everyone, each visitor can run it a limited number of times per day. With no key set, the tool still loads, and the generate steps return a clear message instead of a result.