Compositional Clustering Demo

This is a demonstration of a reinforcement learning model that generalizes components of task structure compositionally, for the paper Compositional clutering in task structure learning (Franklin & Frank, Plos Comp Bio, 2018; https://doi.org/10.1371/journal.pcbi.1006116)

When an simple, artificial agent, such as a Q-learner, encounters a new task it is required to learn the properties of the task from scratch via trial and error. A more efficient approach is to generalize skills and goals gained in a previous task to a new one. Previous human subject research has suggested that people generalize rules, or task-sets, from one context to another (see Collins & Frank, Psych Review, 2013). This behavior is consistent with generalization models that utilize a non-parametric Bayesian clustering algorithm that treats contexts as belonging to a "set" of contexts that share the same structure. While this is useful to explain behavior it is limited computationally -- in ecological settings both people and artificial agents are likely to encounter contexts that share only a partial similarity with each other.

A more useful approach is to learn pieces of task structure and generalize them separately. This is particularly useful for goal-directed behavior, a hallmark of which is the ability to combine experiences to generate a novel course of action in an unfamiliar environment.

What task components are useful to generalize separately? One possible division of components is a division between reusable skills and frequently encountered goals. Broadly speaking, skills reflect structure in the outcomes of an agents actions whereas goals reflect the desirability of various outcomes. In a reinforcement learning setting, skills might be thought as generalized options

This repository contains a theoretical demonstration that parallels work in human behavior. Subsequent empirical studies that show humans follow these statistical properties was subsequently published in a companion paper (Franklin & Frank, Plos Comp Bio, 2020; https://doi.org/10.1371/journal.pcbi.1007720).

Notebooks:

A demonstration of the model's performance can be found in the notebook file Demonstration for paper.ipynb
A demonstration of a meta-agent that uses a reinforcement learning process to arbitrate between Joint and Independent clustering can be found in the notebook file Demonstration - Meta agent.ipynb
An information theoretic analysis detailing under what conditions it is useful to cluster can be found in Information Theoretic Analysis.ipynb
A demonstration of a problem where the explorations compound with each new context can be found in Rooms Problem.ipynb
The code to simulate varying the parameters of the rooms problem can be found in Rooms Growth.pynb

Installation Instructions

This library run on Python 2.7 and unlike most python code, requries compilation with Cython before use. This requires a C compiler (gcc), for which you can find documentation here.

If you have already installed python 2.7, pip and gcc on on your system , you can install cython and the other dependencies with pip (if you don't already have them), run:
pip install -r requirements.txt

To compile the cython code, run:
python setup.py build_ext --inplace

Files:

model.gridworld.py: Defines the task environments
model.agents.py: Defines the reinforcement learning agents. Core functions rely on cython
model.crp.py: Backend for Normative analysis
model.cython_libary: core functions optomized for speed with cython
model.rooms_problem, model.rooms_agents: special agents/models need for rooms simulation

Corrigendum:

This code has been amended to reflect an error. Specifically, the MetaAgent.select_action() and RLMetaAgent.select_action() now call the funcion .choose_operating_model(), which was missing in the origial version.

Name		Name	Last commit message	Last commit date
Latest commit History 112 Commits
.idea		.idea
model		model
.gitignore		.gitignore
Demonstration - Meta agent.ipynb		Demonstration - Meta agent.ipynb
Demonstration for paper.ipynb		Demonstration for paper.ipynb
Information Theoretic Analysis.ipynb		Information Theoretic Analysis.ipynb
README.md		README.md
Rooms Growth.ipynb		Rooms Growth.ipynb
Rooms Problem.ipynb		Rooms Problem.ipynb
Supplement -- Grid World simulations learning full Transition function.ipynb		Supplement -- Grid World simulations learning full Transition function.ipynb
requirements.txt		requirements.txt
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Compositional Clustering Demo

Notebooks:

Installation Instructions

Files:

Corrigendum:

About

Releases

Packages

Languages

nicktfranklin/IndependentClusters

Folders and files

Latest commit

History

Repository files navigation

Compositional Clustering Demo

Notebooks:

Installation Instructions

Files:

Corrigendum:

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages