My name is Deheng Zhang, I am currently Researcher atINSAIT supervised by Prof. Luc Van Gool and Dr. Danda Paudel. Previously, I finished my MSc at ETH Zürich where I worked on 3D Vision/Graphics research projects in Disney Research (Studio) Zürich overseen by Prof. Dr. Markus Gross and VLG overseen by Prof. Dr. Siyu Tang. Before that, I finished Bachelor's degree at CityU of Hong Kong.
My research lies at the intersection of vision-language modeling, spatial AI, and controllable visual representations. I am particularly interested in developing models that can jointly reason about language and 3D environments, while enabling fine-grained, controllable generation and editing of both 2D and 3D scene representations.




