Evaluating LLMs is a minefield Roger on 09/10/2023 https://www.cs.princeton.edu/~arvindn/talks/evaluating_llms_minefield/ Related Category: Notes Post navigation Previous: Previous post: Non-engineers guide: Train a LLaMA 2 chatbotNext: Next post: Measuring the user experience