explain me how attention mechnaism or transofrmers work in LLM

视频信息