MiniAppBench: Evaluating the Shift from Text to Interactive HTML Responses in LLM-Powered Assistants

Human-AI interaction is evolving from static text responses to dynamic, interactive applications.

MiniAppBench is the first comprehensive benchmark designed to evaluate principle-driven, interactive application generation. While traditional benchmarks focus on static layouts or algorithmic snippets, MiniAppBench shifts the paradigm toward MiniApps—HTML-based applications that require both visual rendering and complex interaction logic.