1 link tagged with all of: language-models + neural-networks + assistant
Click any tag below to further narrow down your results
Links
This article explores how large language models (LLMs) adopt the "Assistant" persona during interactions. It discusses the concept of the "Assistant Axis," a neural framework that defines how models behave and how steering techniques can either stabilize or destabilize their responses. The research highlights the challenges of maintaining consistency in the Assistant's character and the risks of persona drift.