Why decoder-only? LLM架构的演化之路