In https://github.com/Jaykef/ai-algorithms/blob/main/hybrid_normalization.ipynb there are **no** positional embedding whatsoever, which are needed for causal attention like this to function at all