Introduction Transformer Model from Math Perspective
Transformer Last year when OpenAI published ChatGPT, the large language model (LLM) making waves in AI community. why chatgpt works so well? can we understand the internal of GPT? Deep learning neural network like a black box, we can only know the result from it, but can’t understand the data process in it’s internal. for […]