TCEval: Using Thermal Comfort to Assess Cognitive and Perceptual Abilities of AI
Jingming Li

TL;DR
TCEval introduces a novel framework using thermal comfort scenarios to evaluate AI's cognitive abilities, focusing on reasoning, causal understanding, and decision-making in real-world contexts.
Contribution
This work pioneers the use of thermal comfort as a paradigm for assessing AI cognition, providing a new benchmark that emphasizes embodied perception and adaptive decision-making.
Findings
LLMs show limited exact alignment with human thermal comfort feedback.
Directional consistency in LLM responses improves with a 1 PMV tolerance.
LLMs perform near-random in discrete thermal comfort classification.
Abstract
A critical gap exists in LLM task-specific benchmarks. Thermal comfort, a sophisticated interplay of environmental factors and personal perceptions involving sensory integration and adaptive decision-making, serves as an ideal paradigm for evaluating real-world cognitive capabilities of AI systems. To address this, we propose TCEval, the first evaluation framework that assesses three core cognitive capacities of AI, cross-modal reasoning, causal association, and adaptive decision-making, by leveraging thermal comfort scenarios and large language model (LLM) agents. The methodology involves initializing LLM agents with virtual personality attributes, guiding them to generate clothing insulation selections and thermal comfort feedback, and validating outputs against the ASHRAE Global Database and Chinese Thermal Comfort Database. Experiments on four LLMs show that while agent feedback has…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBuilding Energy and Comfort Optimization · Emotion and Mood Recognition · Advanced Sensor and Energy Harvesting Materials
