Evaluating Generative AI Systems is a Social Science Measurement Challenge

Generative AI models, LLM Model evaluation