
METAL: A Metamorphic Testing Framework for Large Language Model Quality Assessment
This research introduces a novel metamorphic testing paradigm, operationalized through the METAL framework, to address the critical shortcomings of traditional LLM quality assurance methods by providing a scalable, annotation-free, and comprehensive assessment pipeline.
