Remove Categorization Remove Hybrid AI Remove Natural Language Processing Remove Prompt Engineering
article thumbnail

Evaluation Derangement Syndrome (EDS) in the GPU-poor’s GenAI. Part 1: the case for Evaluation-Driven Development

deepsense.ai

In short, EDS is the problem of the widespread lack of a rational approach to and methodology for the objective, automated and quantitative evaluation of performance in terms of generative model finetuning and prompt engineering for specific downstream GenAI tasks related to practical business applications. There is a ‘but’, however.