📚 KiWiki 총 1개 논문 최종 업데이트: 2026-04-07 검색 전체 cs.CL Attention Is All You Need Ashish Vaswani, Noam Shazeer 외2017 The dominant sequence transduction models are based on complex recurrent or convolutional neural networks in an encoder-decoder configuration. The best performing models also connect the encoder and decoder through an attention mechanism. We propose a new simple network architecture, the…