PinnedPublished inTDS ArchiveHow to Validate OpenAI GPT Model Performance with Text SummarizationPart 1 of a study on generative AI usage and testingApr 4, 2023A response icon5Apr 4, 2023A response icon5
Published inTDS ArchiveHow to Perform Hallucination Detection for LLMsHallucination metrics for open-domain and closed-domain question answeringJan 22, 2024A response icon3Jan 22, 2024A response icon3
Published inTDS ArchiveThe 5 Pillars of Trustworthy LLM TestingPart 4 of a study on generative AI usage and testingNov 20, 2023Nov 20, 2023
Published inTDS ArchiveQuantifying GPT-4’s Hidden Regressions Over TimePart 3 of a study on generative AI usage and testingSep 22, 2023A response icon1Sep 22, 2023A response icon1
Published inTDS ArchiveHow Well Do GPT Models Follow Prompts?Part 2 of a study on generative AI usage and testingMay 24, 2023May 24, 2023