The agent expects a JSON dictionary with specific fields. If the response isn't a JSON object, the agent can't parse it and the action will fail. The extra instructions are necessary because GPT loves to respond in an incorrect format (e.g. include extra text, or ignore its
chatrandom instructions completely).