PinnedPublished inTDS ArchiveHow to Validate OpenAI GPT Model Performance with Text SummarizationPart 1 of a study on generative AI usage and testingApr 4, 20235Apr 4, 20235
Published inTDS ArchiveHow to Perform Hallucination Detection for LLMsHallucination metrics for open-domain and closed-domain question answeringJan 22, 20243Jan 22, 20243
Published inTDS ArchiveThe 5 Pillars of Trustworthy LLM TestingPart 4 of a study on generative AI usage and testingNov 20, 2023Nov 20, 2023
Published inTDS ArchiveQuantifying GPT-4’s Hidden Regressions Over TimePart 3 of a study on generative AI usage and testingSep 22, 20231Sep 22, 20231
Published inTDS ArchiveHow Well Do GPT Models Follow Prompts?Part 2 of a study on generative AI usage and testingMay 24, 2023May 24, 2023