foundation-models | Sadjad Alikhani

May 06, 2026	Contrastive Self-Supervised Learning: CLIP, SimCLR, and DINO
May 05, 2026	The Transformer Architecture: A First-Principles Deep Dive
May 01, 2026	Multimodal Foundation Models: Teaching AI to See and Read Together
Apr 30, 2026	Neural Scaling Laws: The Power Laws Governing Every LLM
Apr 29, 2026	Chain-of-Thought: Why Thinking Out Loud Makes AI Smarter
Apr 28, 2026	Retrieval-Augmented Generation: Grounding LLMs in Facts
Apr 26, 2026	RoPE and ALiBi: Giving Transformers Unlimited Memory
Apr 25, 2026	Vision Transformers: How Attention Conquered Computer Vision
Apr 22, 2026	Mamba and State Space Models: The Sequence Modelling Revolution
Apr 21, 2026	Mixture of Experts: Scaling AI Without Breaking the Bank
Apr 20, 2026	Flash Attention: Making Transformers Faster Than Ever
Apr 19, 2026	In-Context Learning: How LLMs Learn Without Gradient Updates
Apr 16, 2026	Contrastive Self-Supervised Learning: CLIP, SimCLR, and DINO
Apr 15, 2026	The Transformer Architecture: A First-Principles Deep Dive
Apr 11, 2026	Multimodal Foundation Models: Teaching AI to See and Read Together
Apr 10, 2026	Neural Scaling Laws: The Power Laws Governing Every LLM
Apr 09, 2026	Chain-of-Thought: Why Thinking Out Loud Makes AI Smarter
Apr 08, 2026	Retrieval-Augmented Generation: Grounding LLMs in Facts
Apr 07, 2026	RoPE and ALiBi: Giving Transformers Unlimited Memory
Apr 06, 2026	Vision Transformers: How Attention Conquered Computer Vision
Apr 01, 2026	Mixture of Experts: Scaling AI Without Breaking the Bank
Apr 01, 2026	Mamba and State Space Models: The Sequence Modelling Revolution
Apr 01, 2026	Flash Attention: Making Transformers Faster Than Ever
Nov 28, 2025	Sparse Spatio-Temporal Attention (SSTA)
Nov 27, 2025	Large Wireless Model (LWM)