Blog
Machine Learning and Recommender Systems
-
DualPath: Breaking the Storage Bandwidth Bottleneck in Agentic LLM Inference
How DeepSeek uses idle decode-side NICs to double KV-Cache loading throughput in prefill-decode disaggregated serving.
-
Reinforcement Learning for LLMs
An intuition-first guide to the RL concepts behind RLHF, PPO, and GRPO — the background you need before diving into alignment algorithms.
-
PPO & GRPO for LLM Alignment
A first-principles guide to PPO and GRPO for LLM alignment, for ML engineers with minimal RL background.
-
Hashing for large scale similarity
Machine Learning
-
Implementing Matrix Factorisation using Tensorflow
My quora response
-
How exactly is machine learning used in recommendation engines?
My quora response