Posts tagged deep-learning
1 post
Understanding LLM Inference Internals
Tracing the path from prompt to token — tokenization, embedding, attention, KV caching, and sampling explained.
1 post
Tracing the path from prompt to token — tokenization, embedding, attention, KV caching, and sampling explained.