Darpan’s Blog

Nov 10, 2025
Beyond the Numbers
Rethinking success in a world optimized for metrics instead of meaning
May 18, 2025
Policy Optimization Algorithms for Alignment
Steering language models toward safer outputs…
May 12, 2025
Gaming the System: Understanding Reward Hacking in Language‑Model Training
How clever shortcuts can derail our smartest algorithms, and what is being done about it
Mar 23, 2025
Multi-Armed Bandit Algorithms for Recommendation Systems
Meet the efficient mathematical gamblers working behind the scenes that predict your desires before they form. Multi-armed Bandits are why you probably can’t stop scrolling.
Mar 9, 2025
Resilient Verses: Exploring the Enduring Power of "Invictus"
There’s something rather spectacular about this poem that manages to remain relevant over a century later.