1 year, 11 months ago

ChatGPT creators try to use artificial intelligence to explain itself – and come across major problems

Sign up to our free weekly IndyTech newsletter delivered straight to your inbox Sign up to our free IndyTech newsletter Sign up to our free IndyTech newsletter SIGN UP I would like to be emailed about offers, events and updates from The Independent. They did so by attempting to create an automated process that would allow the system to provide natural language explanations of the neuron’s behaviour – and apply that to another, earlier language model. Part of the problem may be that explaining how the system is working in normal language is impossible – because the system may be using individual concepts that humans cannot name. “We focused on short natural language explanations, but neurons may have very complex behaviour that is impossible to describe succinctly,” the authors write. “For example, neurons could be highly polysemantic or could represent single concepts that humans don’t understand or have words for.” It also runs into problems because it is focused on specifically what each neuron does individually, and not how that might affect things later on in the text.

Discover Related