Opening review spoke
Why does the softmax function use the exponential function? — LLM Research | Unlo