Loading paper
CHBench: A Chinese Dataset for Evaluating Health in Large Language Models | Tomesphere