2 writes to _cache
Microsoft.ML.GenAI.LLaMA (2)
Module\LlamaModel.cs (2)
41
this.
_cache
= new DynamicKVCache();
58
this.
_cache
= input.OverrideCache;
2 references to _cache
Microsoft.ML.GenAI.LLaMA (2)
Module\LlamaModel.cs (2)
131
pastKeyValue: this.
_cache
,
148
return new CausalLMModelOutput(lastHiddenState: hiddenStates, allHiddenStates: allHiddenStates.ToArray(), attentions: allAttentions.ToArray(), cache: this.
_cache
);