bash Clone the repo locally for offline study git clone https://github.com/amitshekhariitbhu/ai-engineering-interview-questions.git cd ai-engineering-interview-questions Browse the main README cat README.md Or open in your editor code README.md Key Concepts: - Transformer architecture: encoder-only, decoder-only, encoder-decoder - Self-attention: Q (Query), K (Key), V (Value) matrices - Multi-head attention vs Grouped-Query Attention (GQA) - Tokenization: BPE, WordPiece, SentencePiece - Positional encoding (absolute, learned, RoPE) - KV Cache: speeds up autoregressive inference by caching pas…