Lesson 3 of 3 · Transformer Architecture

Advanced — deep dive ⏱ 27 min

Attention Is All You Need | Paper Explained

Watch this lesson to understand Attention Is All You Need | Paper Explained — takes about 27 minutes.

What You Will Learn

Transformerアーキテクチャの基本的な理解
マルチヘッドアテンションのしくみ
Transformerのトレーニングプロセスの概要

Key Concepts

Transformerアーキテクチャは、エンコーダーとデコーダーから構成されます。エンコーダーは入力テキストを処理し、デコーダーは出力テキストを生成します。マルチヘッドアテンションは、エンコーダーとデコーダーで使用される重要なコンポーネントで、入力テキストの異なる部分之间の関係を学習します。トレーニングプロセスでは、Tokenizerを使用してテキストをトークン化し、エンコーダーとデコーダーを経由して出力テキストを生成します。

Code Examples

# トークン化されたテキストをエンコーダーに渡す
encoder_input = tokenizer.encode("How are you today?")

エンコーダーへの入力例。

# マルチヘッドアテンションの出力
attention_output = multi_head_attention(query, key, value)

マルチヘッドアテンションの出力例。

# デコーダーの出力
decoder_output = decoder(attention_output)

デコーダーの出力例。

Lesson Summary

このレッスンでは、Transformerアーキテクチャの基本的な構成とマルチヘッドアテンションのしくみを解説しました。Transformerは、エンコーダーとデコーダーから構成され、エンコーダーは入力テキストを処理し、デコーダーは出力テキストを生成します。マルチヘッドアテンションは、入力テキストの異なる部分之间の関係を学習する重要なコンポーネントです。トレーニングプロセスでは、Tokenizerを使用してテキストをトークン化し、エンコーダーとデコーダーを経由して出力テキストを生成します。マルチヘッドアテンションの出力は、デコーダーに渡され、最終的に出力テキストが生成されます。

Practice Exercise

次のテキストをトークン化し、エンコーダーへの入力として使用してください。“Hello, how are you?”

What Is Next

次のレッスンでは、Transformerのトレーニングプロセスとハイパーパラメータのチューニングについて詳しく解説します。

Go deeper on NLPと大規模言語モデル

Hand-picked books and tools our learners actually use. Affiliate-supported — same price for you, supports free lessons.

📚 Books, films & collectibles about Attention Is All You Need | Paper Explained

Amazon New editions, films & cookbooks

Programming books Browse Amazon → Python for beginners Browse Amazon → Web development Browse Amazon → AI / Machine learning Browse Amazon →

eBay Vintage posters, rare books & memorabilia

Used programming books Browse eBay → Video course bundles Browse eBay →

As an Amazon Associate and eBay Partner we earn from qualifying purchases.

Attention Is All You Need | Paper Explained

What You Will Learn

Key Concepts

Code Examples

Lesson Summary

Practice Exercise

What Is Next

Free for readers

Get the Best of the Week — One Email, Every Sunday

Go deeper on NLPと大規模言語モデル

Choose Language

Attention Is All You Need | Paper Explained

What You Will Learn

Key Concepts

Code Examples

Lesson Summary

Practice Exercise

What Is Next

Free for readers

Get the Best of the Week — One Email, Every Sunday

Go deeper on NLPと大規模言語モデル

🍪 We Value Your Privacy

Cookie Preferences