Recording First-person Experiences to Build a New Type of Foundation Model
Dionis Barcari, David Gamez, Aliya Grig

TL;DR
This paper introduces a novel recording rig that captures comprehensive human sensory and physiological data to train foundation models that more accurately replicate human behavior and internal states, surpassing current personality-based models.
Contribution
It presents a new data collection method and proposes a foundation model trained on this data to improve human behavior simulation in AI systems.
Findings
Prototype recording rig developed and tested.
Preliminary data processing methods demonstrated.
Potential applications in recommendation and personal assistance.
Abstract
Foundation models have had a big impact in recent years and billions of dollars are being invested in them in the current AI boom. The more popular ones, such as Chat-GPT, are trained on large amounts of Internet data. However, it is becoming apparent that this data is likely to be exhausted soon, and technology companies are looking for new sources of data to train the next generation of foundation models. Reinforcement learning, RAG, prompt engineering and cognitive modelling are often used to fine-tune and augment the behaviour of foundation models. These techniques have been used to replicate people, such as Caryn Marjorie. These chatbots are not based on people's actual emotional and physiological responses to their environment, so they are, at best, a surface-level approximation to the characters they are imitating. To address these issues, we have developed a recording rig…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsEvacuation and Crowd Dynamics · Data Visualization and Analytics
MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Byte Pair Encoding · Softmax · Dense Connections · Dropout · Linear Layer · Attention Dropout · Residual Connection · Linear Warmup With Linear Decay · BART
