A single neural network “weights” represent a 3D model by getting input of camera pose parameters and outputting RGB and sigma (same notion as alpha channel in images).
When calculating the loss, we inject strong inductive bias by incorporating ray distribution.
NeRF also uses coarse-to-fine approach in order to better represent important portions of an object while staying efficient in terms of computation.