Mark Chen – Medium

Mark Chen

Pinned

Published in
TDS Archive

How to Validate OpenAI GPT Model Performance with Text Summarization

Part 1 of a study on generative AI usage and testing

Apr 4, 2023

How to Validate OpenAI GPT Model Performance with Text Summarization

Apr 4, 2023

Published in
TDS Archive

How to Perform Hallucination Detection for LLMs

Hallucination metrics for open-domain and closed-domain question answering

Jan 22, 2024

How to Perform Hallucination Detection for LLMs

Jan 22, 2024

Published in
TDS Archive

The 5 Pillars of Trustworthy LLM Testing

Part 4 of a study on generative AI usage and testing

Nov 20, 2023

The 5 Pillars of Trustworthy LLM Testing

Nov 20, 2023

Published in
TDS Archive

Quantifying GPT-4’s Hidden Regressions Over Time

Part 3 of a study on generative AI usage and testing

Sep 22, 2023

Quantifying GPT-4’s Hidden Regressions Over Time

Sep 22, 2023

Published in
TDS Archive

How Well Do GPT Models Follow Prompts?

Part 2 of a study on generative AI usage and testing

May 24, 2023

How Well Do GPT Models Follow Prompts?

May 24, 2023

Mark Chen

Mark Chen

MLE R&D @ Kolena

Following

Help
Status
About
Careers
Press
Blog
Privacy
Rules
Terms
Text to speech