article thumbnail

Evaluation Derangement Syndrome (EDS) in the GPU-poor’s GenAI. Part 1: the case for Evaluation-Driven Development

deepsense.ai

References A survey of Generative AI Applications , Gozalo-Brizuela R., 2023 From ChatGPT to ThreatGPT: Impact of generative AI in cybersecurity and privacy , Gupta M., 2023 Art and the science of generative AI: A deeper dive , Epstein Z., 2023 [link] [link] [link] BERTScore: Evaluating text generation with BERT , Zhang T.,