The short answer
Yes, it is reasonable to talk about internal structures that behave a bit like modules. But current evidence points less to clean software-like packages and more to semi-local, reusable latent structures: sparse features, circuits, attention heads, MLP subspaces, task vectors, and activation-space directions.
Pretraining grows a large ecology of latent features and circuits. Post-training usually reshapes access, routing, readout, and behavioral gating rather than installing clean new organs named “math”, “coding”, or “ethics”.