Footnotes of a Curious Mind 📝
About

Posts

  • May 19, 2025

    Policy Optimization Algorithms for Alignment

  • May 13, 2025

    Gaming the System: Understanding Reward Hacking in Language‑Model Training

  • Mar 24, 2025

    Multi-Armed Bandit Algorithms for Recommendation Systems

  • Mar 10, 2025

    Resilient Verses: Exploring the Enduring Power of "Invictus"

subscribe via RSS

Footnotes of a Curious Mind 📝

  • Footnotes of a Curious Mind 📝
  • darpannjainn@gmail.com
  • darpan-jain
  • darpann-jain
  • FunnyFriedDerp

A collection of my work in AI, deep dives, and intellectual footnotes—in a quiet corner of the internet 🙂