Mark Chen – Medium

Mark Chen

Home

About

Pinned

Mark Chen
in
Towards Data Science

How to Validate OpenAI GPT Model Performance with Text Summarization

Part 1 of a study on generative AI usage and testing

Apr 4, 2023

How to Validate OpenAI GPT Model Performance with Text Summarization

Apr 4, 2023

Mark Chen
in
Towards Data Science

How to Perform Hallucination Detection for LLMs

Hallucination metrics for open-domain and closed-domain question answering

Jan 22

How to Perform Hallucination Detection for LLMs

Jan 22

Mark Chen
in
Towards Data Science

The 5 Pillars of Trustworthy LLM Testing

Part 4 of a study on generative AI usage and testing

Nov 20, 2023

The 5 Pillars of Trustworthy LLM Testing

Nov 20, 2023

Mark Chen
in
Towards Data Science

Quantifying GPT-4’s Hidden Regressions Over Time

Part 3 of a study on generative AI usage and testing

Sep 22, 2023

Quantifying GPT-4’s Hidden Regressions Over Time

Sep 22, 2023

Mark Chen
in
Towards Data Science

How Well Do GPT Models Follow Prompts?

Part 2 of a study on generative AI usage and testing

May 24, 2023

How Well Do GPT Models Follow Prompts?

May 24, 2023

Mark Chen

Mark Chen

MLE R&D @ Kolena

Following

Caitlin Kindig
Ludovic Benistant
Towards Data Science
Katherine Prairie
Ben Huberman

Help
Status
About
Careers
Press
Blog
Privacy
Terms
Text to speech
Teams