Posts tagged deep-learning

1 post

Understanding LLM Inference Internals

Tracing the path from prompt to token — tokenization, embedding, attention, KV caching, and sampling explained.