Govur University Logo
--> --> --> -->
...

What is the role of the [UNK] token in vocabulary management?



The [UNK] token, short for 'unknown' token, serves as a placeholder for words that are not present in the model's vocabulary. Vocabulary management is a crucial part of preparing text for a language model like ChatGPT. The vocabulary contains all the words the model recognizes. When the model encounters a word it hasn't seen during training, it ....

Log in to view the answer



Redundant Elements