Remove BERT Remove Generative AI Remove Hybrid AI Remove Prompt Engineer
article thumbnail

Evaluation Derangement Syndrome (EDS) in the GPU-poor’s GenAI. Part 1: the case for Evaluation-Driven Development

deepsense.ai

In short, EDS is the problem of the widespread lack of a rational approach to and methodology for the objective, automated and quantitative evaluation of performance in terms of generative model finetuning and prompt engineering for specific downstream GenAI tasks related to practical business applications. Garrido-Merchán E.C.,