AI agents AI interpretability Projects Soft prompts as a window into introspection in large language models Socials Resume Email GitHub LinkedIn Google Scholar Twitter