🤐 Do Chatbot LLMs Talk Too Much? Introducing YapBench
We introduce YapBench, a benchmark for measuring how much LLMs over-explain simple questions. Our evaluation of 76 models reveals an order-of-magnitude spread in verbosity, with newer models trending longer.