Open in app

Sign in

Write

Sign in

Mark Chen
Mark Chen

103 Followers

Home

About

Pinned
Towards Data Science

Published in

Towards Data Science

How to Validate OpenAI GPT Model Performance with Text Summarization

Part 1 of a study on generative AI usage and testing

Apr 4, 2023
5
How to Validate OpenAI GPT Model Performance with Text Summarization
How to Validate OpenAI GPT Model Performance with Text Summarization
Apr 4, 2023
5
Towards Data Science

Published in

Towards Data Science

How to Perform Hallucination Detection for LLMs

Hallucination metrics for open-domain and closed-domain question answering

Jan 22
2
How to Perform Hallucination Detection for LLMs
How to Perform Hallucination Detection for LLMs
Jan 22
2
Towards Data Science

Published in

Towards Data Science

The 5 Pillars of Trustworthy LLM Testing

Part 4 of a study on generative AI usage and testing

Nov 20, 2023
The 5 Pillars of Trustworthy LLM Testing
The 5 Pillars of Trustworthy LLM Testing
Nov 20, 2023
Towards Data Science

Published in

Towards Data Science

Quantifying GPT-4’s Hidden Regressions Over Time

Part 3 of a study on generative AI usage and testing

Sep 22, 2023
1
Quantifying GPT-4’s Hidden Regressions Over Time
Quantifying GPT-4’s Hidden Regressions Over Time
Sep 22, 2023
1
Towards Data Science

Published in

Towards Data Science

How Well Do GPT Models Follow Prompts?

Part 2 of a study on generative AI usage and testing

May 24, 2023
How Well Do GPT Models Follow Prompts?
How Well Do GPT Models Follow Prompts?
May 24, 2023
Mark Chen

Mark Chen

103 Followers

MLE R&D @ Kolena

Following
  • Caitlin Kindig

    Caitlin Kindig

  • Ludovic Benistant

    Ludovic Benistant

  • Towards Data Science

    Towards Data Science

  • Katherine Prairie

    Katherine Prairie

  • Ben Huberman

    Ben Huberman

See all (9)

Help

Status

About

Careers

Press

Blog

Privacy

Terms

Text to speech

Teams