What is normalization in the context of the softmax function?LLM Research/What is normalization in the context of the softmax function?