Ask HN: When ChatGPT Deleted Evidence of Its Own Mistake

2 points | by susdamso 12 hours ago

7 comments

pulvinar 6 hours ago
Yes, I've seen occasional strange responses to seemingly innocuous prompts. Often a retry will succeed, but I've had to give up on some.
I doubt it's the model itself in most cases, as it doesn't have much introspection. Its explanations will be what it can deduce from whatever it does have.
ggm 12 hours ago
I suspect introspection and meta questions flag you up into logical systems which assume threat not outcome focussed responses.
[-]
- susdamso 11 hours ago
  Thank you for your reply. Could you elaborate a little more?
  [-]
  - ggm 10 hours ago
    I don't work in AI but if I did it'd regard introspective questions to aspects of my own LLMs behaviour as threat risk more than purposeful debugging by customers. I'd code my systems accordingly. Slowing down service or being less exposing might be defensive or protecting.
    [-]
    - susdamso 7 hours ago
      Thank you for your detailed response. I'm having a bit of a hard time with this issue right now.
lobito25 10 hours ago
LLM's are pattern recognition models, they don't truly understand how things work, they only see patterns.
[-]
- susdamso 7 hours ago
  Thank you for your reply. But what do you think about the deleted/reedited response? Isn't that related to data integrity?