Common Flaws in NLP Evaluation Experiments
Ehud Reiter
JANUARY 15, 2024
We discovered early on in the project that none of the papers we considered replicating had sufficient information for replicability and that only 13% of authors were willing and able to provide the missing information ( paper ) ( blog ). Once researchers may decide this is not worth doing.
Let's personalize your content