Tag: CriticGPT

  • CriticGPT

    CriticGPT

    OpenAI has developed CriticGPT, a model trained to identify bugs in GPT-4’s code. They are beginning to integrate such models into the RLHF alignment pipeline to assist humans in supervising AI on complex tasks. CriticGPT, based on GPT-4, writes critiques of ChatGPT responses to help human trainers detect mistakes during RLHF. What is CriticGPT? CriticGPT,…