OpenAI creators say this is the secret behind success of ChatGPT: ‘Humans feel…’
Hindustan TimesOpenAI team said that when the company introduced ChatGPT in 2022 they never thought that the AI tool would become so popular. OpenAI team said that the secret behind the success of ChatGPT is a technique called reinforcement learning from human feedback. Developers of ChatGPT trained the model to generate responses preferred by human users which helped in the refinement of the technology, they said. Jan Leike, the leader of OpenAI's alignment team, explained, “One of the lines that emerged in this training was 'As a language model trained by OpenAI' It wasn't explicitly put in there, but it's one of the things the human raters ranked highly.” Sandhini Agarwal, who works as an AI Policy Researcher at OpenAI said human raters ranked the ChatGPT model based on various criteria, adding, “But they also began preferring things that they considered good practice, like not pretending to be something that you're not.”