Writing

Research Notes

Coming soon — posts below are illustrative of planned writing topics.

AI Safety8 min2025-04-15Placeholder

The Alignment Problem in Agentic AI: Beyond Reward Hacking

As AI systems become increasingly autonomous, the alignment problem takes new dimensions. This post explores failure modes unique to agentic architectures and governance mechanisms needed to address them.

AI SafetyAgentic AIAlignment

Read

Research Insights6 min2025-03-28Placeholder

Blockchain as a Trust Anchor for Autonomous AI Systems

Can blockchain's immutability and decentralized verification solve the auditability crisis in autonomous AI? Reflections from our IEEE ICCA 2025 work on blockchain-monitored agentic pipelines.

BlockchainTrustArchitecture

Read

Technical10 min2025-02-10Placeholder

Prompt Injection and the Fragility of LLM-Based Agents

Prompt injection is one of the most pervasive and underappreciated threats to LLM-based agentic systems. A deep dive into attack vectors, mitigation strategies, and open research questions.

LLMsSecurityAdversarial AI

Read

Stay Updated

Research articles on agentic AI safety are in progress. Follow on Google Scholar for publication updates.

Follow on Google Scholar