4 references to CodeGenAddedTokens
Microsoft.ML.Tokenizers (4)
Model\CodeGenTokenizer.cs (2)
1896new RegexPreTokenizer(TiktokenTokenizer.P50kBaseRegex(), CodeGenTokenizer.CodeGenAddedTokens), 1898CodeGenTokenizer.CodeGenAddedTokens,
Model\Phi2Tokenizer.cs (2)
116vocabStream, mergesStream, new RegexPreTokenizer(TiktokenTokenizer.P50kBaseRegex(), CodeGenTokenizer.CodeGenAddedTokens), normalizer: null, 117CodeGenTokenizer.CodeGenAddedTokens, addPrefixSpace: addPrefixSpace, addBeginningOfSentence: addBeginOfSentence, addEndOfSentence: addEndOfSentence);