D: What prompts do you use to evaluate new LLM capabilities?

Posted by head_robotics@reddit | LocalLLaMA | View on Reddit | 11 comments

What prompts do you use to evaluate new LLM capabilities?

Do you have a series of standard prompts you use to see what an LLM can do?
What things have you found the most useful for overall evaluation?