Q-NeRF: Neural Radiance Fields on a Simulated Gate-based Quantum Computer

Abstract

Recently, Quantum Visual Fields (QVFs) have shown promising improvements in model compactness and convergence speed for learning 2D images. Meanwhile, novel-view synthesis has seen major advances with Neural Radiance Fields (NeRFs), where models learn a compact representation from 2D images to render 3D scenes, albeit at the cost of large models and intensive training. In this work, we extend the approach of QVFs by introducing Q-NeRF, the first hybrid quantum-classical model designed for novel-view synthesis from 2D images. QNeRF leverages parameterized quantum circuits to encode spatial and view-dependent information via quantum superposition and entanglement, resulting in more compact models. We present two architectural variants. Full QNeRF maximally exploits all quantum amplitudes to enhance representational capabilities. In contrast, Dual-Branch QNeRF introduces a task-informed inductive bias by branching spatial and view-dependent quantum state preparations, drastically reducing the complexity of this operation and ensuring scalability and potential hardware compatibility. Our experiments demonstrate that—when trained on images of reduced resolution—QNeRF matches or outperforms classical NeRF baselines while using less than half the number of parameters. These results suggest that Quantum Machine Learning can serve as a competitive alternative for continuous signal representation in high-level tasks in Computer Vision, such as 3D representation learning.

Quantum Neural Radiance Fields (Q-NeRF)

In this work, we propose Q-NeRF, a novel hybrid quantum-classical architecture for neural radiance fields. We design two variants of Q-NeRF: Full Q-NeRF (left) and Dual-Branch Q-NeRF (right). Both architectures leverage parameterized quantum circuits to encode 3D spatial coordinates and 2D view directions into quantum states, which are then processed to predict color and density values for novel-view synthesis. Full Q-NeRF utilizes a single amplitude embedding step to encode both spatial and view-dependent information, maximizing the use of quantum amplitudes for enhanced representational capacity. In contrast, Dual-Branch Q-NeRF employs separate quantum encodings for spatial coordinates and view directions, introducing an inductive bias that simplifies state preparation and improves scalability. This design choice allows Dual-Branch Q-NeRF to maintain competitive performance while (exponentially) reducing quantum resource requirements.

MLP-Based Quantum Embedding

In the Full embedding, the features obtained after positional encoding are mapped by an MLP to a real-valued vector \(\mathcal{M}(x) \in \mathbb{R}^{2^n}\), which is normalized and directly amplitude-encoded into an \(n\)-qubit quantum state:

\[ \lvert \phi(x) \rangle = \sum_{i=0}^{2^n-1} \alpha_i(x)\,\lvert i \rangle, \qquad \sum_i \lvert \alpha_i(x) \rvert^2 = 1. \]

In the Dual-Branch embedding, positional and view-dependent features are treated separately, similarly to what is done in classical NeRFs. Two MLPs produce amplitude vectors of sizes \(2^{n_p}\) and \(2^{n_v}\), which are independently encoded and combined into a tensor-product state:

\[ \lvert \phi(x) \rangle = \lvert \phi_p(x_p) \rangle \otimes \lvert \phi_v(x_v) \rangle . \]

This construction uses \(n_p+n_v=n\) qubits while requiring only \(2^{n_p}+2^{n_v}\) amplitudes instead of \(2^n\), resulting in an exponential reduction of the state preparation complexity.

Output Scaling

Visualisation of the effect of output scaling for a Full QNeRF on the drums scene.

Variational quantum circuits exhibit exponential concentration, where output distributions collapse around the mean as the number of qubits grows. To mitigate this, each channel output \(\mathcal{O}_c(\mathbf{x})\) is multiplied by a learnable scaling factor \(\alpha_c\) and clipped to \([0,1]\):

\[ \text{output}_c = \text{clip}\big(\alpha_c \mathcal{O}_c(\mathbf{x}), 0, 1\big) \]

This scaling widens the range of outputs, counteracting the collapse due to exponential concentration and improving trainability, particularly for high-variance quantities such as predicted densities.

Experimental Results

Final PSNR (dB) of the proposed models compared to the classical baseline. Each value is averaged over 5 seeds and presented with the standard deviation.

Blender
Model	Materials	Ficus	Lego	Drums	Average
Full QNeRF	33.88 ± 0.16	30.26 ± 0.21	34.47 ± 0.04	28.07 ± 0.05	31.67 ± 0.11
DB QNeRF	29.94 ± 0.31	28.59 ± 0.27	31.32 ± 0.25	25.63 ± 0.19	28.87 ± 0.26

Class. Baseline	29.90 ± 0.19	29.74 ± 0.17	31.79 ± 0.16	26.70 ± 0.13	29.53 ± 0.16

LLFF
Model	Trex	Room	Horns	Fern	Average
Full QNeRF	22.87 ± 1.04	27.94 ± 0.45	23.45 ± 0.71	23.21 ± 0.41	24.37 ± 0.65
DB QNeRF	22.03 ± 0.42	26.12 ± 0.51	22.02 ± 0.48	22.22 ± 0.42	23.10 ± 0.46

Class. Baseline	22.11 ± 0.40	26.68 ± 0.50	21.02 ± 0.58	21.96 ± 0.37	22.94 ± 0.46

Mesh Extraction and 3D Model Visualization

A visualisation of the 3D meshes extracted from the learnt radiance fields.

Our model can be combined with each technique that can be applied to classical Radiance Fields. Here, we demonstrate the extraction of 3D meshes from the learnt radiance fields using the Marching Cubes algorithm.

BibTex

            @article{LizzioBosco2026QNeRF, 
                title={QNeRF: Neural Radiance Fields on a Simulated Gate-based Quantum Computer}, 
                author={{Lizzio Bosco}, Daniele and Wang, Shuteng and Serra, Giuseppe and Golyanik, Vladislav}, 
                journal={arXiv preprint arXiv:2601.05250}, 
                year={2026} 
            }

Contact information

Daniele Lizzio Bosco
lizziobosco.daniele@spes.uniud.it

Vladislav Golyanik
golyanik@mpi-inf.mpg.de