Understanding Text Generation Parameters in Transformers

This post is divided into seven parts; they are: - Core Text Generation Parameters - Experimenting with Temperature - Top-K and Top-P Sampling - Controlling Repetition - Greedy Decoding and Sampling - Parameters for Specific Applications - Beam Search and Multiple Sequences Generation Let's pick the GPT-2 model as an example.

Apr 23, 2025 - 16:23
 0
Understanding Text Generation Parameters in Transformers
This post is divided into seven parts; they are: - Core Text Generation Parameters - Experimenting with Temperature - Top-K and Top-P Sampling - Controlling Repetition - Greedy Decoding and Sampling - Parameters for Specific Applications - Beam Search and Multiple Sequences Generation Let's pick the GPT-2 model as an example.