Open in app

Sign In

Write

Sign In

Mark Chen
Mark Chen

70 Followers

Home

About

Published in

Towards Data Science

·Pinned

How to Validate OpenAI GPT Model Performance with Text Summarization

Part 1 of a study on generative AI usage and testing — Regardless of your occupation or age, you’ve heard about OpenAI’s generative pre-trained transformer (GPT) technology on LinkedIn, YouTube, or in the news. These powerful artificial intelligence models/chatbots can seemingly handle any task, from creating poems to solving leetcode problems to coherently summarizing long articles of text.

NLP

9 min read

How to Validate OpenAI GPT Model Performance with Text Summarization
How to Validate OpenAI GPT Model Performance with Text Summarization
NLP

9 min read


Published in

Towards Data Science

·Sep 22

Quantifying GPT-4’s Hidden Regressions Over Time

Part 3 of a study on generative AI usage and testing — GPT-4 is bigger and better than GPT-3. GPT-4 can draft up eloquent speeches, pass standardized exams, and even interpret images. Since its release on March 14, 2023, OpenAI continues to iterate and update GPT-4 to improve its performance for the millions of queries it receives each day. However, is the…

NLP

5 min read

Quantifying GPT-4’s Hidden Regressions Over Time
Quantifying GPT-4’s Hidden Regressions Over Time
NLP

5 min read


Published in

Towards Data Science

·May 24

How Well Do GPT Models Follow Prompts?

Part 2 of a study on generative AI usage and testing — Read Part 1: Validating OpenAI GPT Model Performance Prompt engineering is a special term from the NLP industry to describe the formulation of better questions for LLMs (large language models) to get better answers. In other words, it is the art of effective prompt design. If you think about it…

NLP

10 min read

How Well Do GPT Models Follow Prompts?
How Well Do GPT Models Follow Prompts?
NLP

10 min read

Mark Chen

Mark Chen

70 Followers

MLE R&D @ Kolena

Following
  • TDS Editors

    TDS Editors

  • Katherine Prairie

    Katherine Prairie

  • Yoohee Choi

    Yoohee Choi

  • Gordon Hart

    Gordon Hart

  • Caitlin Kindig

    Caitlin Kindig

See all (9)

Help

Status

Writers

Blog

Careers

Privacy

Terms

About

Text to speech

Teams