Logo credits: Milena Rmus

Centaur: a foundation model of human cognition

Establishing a unified theory of cognition has been a major goal of psychology. While there have been previous attempts to instantiate such theories by building computational models, we currently do not have one model that captures the human mind in its entirety. Here we introduce Centaur, a computational model that can predict and simulate human behavior in any experiment expressible in natural language. We derived Centaur by finetuning a state-of-the-art language model on a novel, large-scale data set called Psych-101. Psych-101 reaches an unprecedented scale, covering trial-by-trial data from over 60,000 participants performing over 10,000,000 choices in 160 experiments. Centaur not only captures the behavior of held-out participants better than existing cognitive models, but also generalizes to new cover stories, structural task modifications, and entirely new domains. Furthermore, we find that the model's internal representations become more aligned with human neural activity after finetuning. Taken together, Centaur is the first real candidate for a unified model of human cognition. We anticipate that it will have a disruptive impact on the cognitive sciences, challenging the existing paradigm for developing computational models.

Contact: Marcel Binz

HuggingFace arXiv GitHub
Model Paper Code

I want to use Centaur. How?

  1. Have access to at least one 80 GB GPU (e.g. A100)? You can run the model locally using unsloth.
  2. Have neither GPUs nor money? You can play with a smaller version on Google Colab's free GPUs.
  3. Have money but no GPUs? You can deploy the model via a dedicated Inference Endpoint (more soon).

I want to prompt Centaur. How?

  1. You can find prompt examples here or in the Appendix of our preprint.
  2. We did not employ a particular prompt template – just phrase everything in natural language.
  3. Human choices are encapsulated by "<<" and ">>" tokens.
  4. Most experiments in the training data are framed in terms of button presses. If possible, it is recommended to use that style.

I would like to contribute data. How?

    We recently launched a collaborative effort to create a new large-scale data set of human behavior in a natural language format. Please check out Psych-201 if you are interested in contributing.