PinnedMark CheninTowards Data ScienceHow to Validate OpenAI GPT Model Performance with Text SummarizationPart 1 of a study on generative AI usage and testingApr 4, 20235Apr 4, 20235
Mark CheninTowards Data ScienceHow to Perform Hallucination Detection for LLMsHallucination metrics for open-domain and closed-domain question answeringJan 222Jan 222
Mark CheninTowards Data ScienceThe 5 Pillars of Trustworthy LLM TestingPart 4 of a study on generative AI usage and testingNov 20, 2023Nov 20, 2023
Mark CheninTowards Data ScienceQuantifying GPT-4’s Hidden Regressions Over TimePart 3 of a study on generative AI usage and testingSep 22, 20231Sep 22, 20231
Mark CheninTowards Data ScienceHow Well Do GPT Models Follow Prompts?Part 2 of a study on generative AI usage and testingMay 24, 2023May 24, 2023