Show HN: A new benchmark for testing LLMs for deterministic outputs

(interfaze.ai)

60 points | by khurdula 7 days ago

30 comments