Skip to content
Change the repository type filter

All

    Repositories list

    • Testing ranking algorithms to improve social cohesion
      Python
      33210Updated Mar 26, 2025Mar 26, 2025
    • A benchmark environment for fully cooperative human-AI performance.
      Jupyter Notebook
      205929121Updated Mar 22, 2025Mar 22, 2025
    • tensor-trust

      Public
      A prompt injection game to collect data for robust ML research
      Python
      668324Updated Jan 27, 2025Jan 27, 2025
    • imitation

      Public
      Clean PyTorch implementations of imitation and reward learning algorithms
      Python
      2971.7k7620Updated Jan 7, 2025Jan 7, 2025
    • Prosocial Ranking Challenge Perspective Ranker
      Jupyter Notebook
      0101Updated Nov 26, 2024Nov 26, 2024
    • PRC: Testing ranking algorithms to improve social cohesion
      JavaScript
      3000Updated Sep 21, 2024Sep 21, 2024
    • PRC: Civirank submission
      3000Updated Sep 8, 2024Sep 8, 2024
    • Code for "Evidence of Learned Look-Ahead in a Chess-Playing Neural Network"
      Jupyter Notebook
      112610Updated Jun 4, 2024Jun 4, 2024
    • Dataset for the Tensor Trust project
      Jupyter Notebook
      54810Updated Mar 17, 2024Mar 17, 2024
    • reward-function-interpretability

      Public
      Jupyter Notebook
      0140Updated Nov 30, 2023Nov 30, 2023
    • Library to compare and evaluate reward functions
      Python
      86742Updated Oct 23, 2023Oct 23, 2023
    • seals

      Public
      Benchmark environments for reward modelling and imitation learning algorithms.
      Python
      94661Updated Sep 19, 2023Sep 19, 2023
    • Code for "On the Utility of Learning about Humans for Human-AI Coordination"
      Python
      4511000Updated Apr 17, 2023Apr 17, 2023
    • ray

      Public
      A fast and simple framework for building and running distributed applications. Ray is packaged with RLlib, a scalable reinforcement learning library, and Tune, …
      Python
      7.2k009Updated Mar 4, 2023Mar 4, 2023
    • eirli

      Public
      An Empirical Investigation of Representation Learning for Imitation (EIRLI), NeurIPS'21
      Python
      43623Updated Mar 4, 2023Mar 4, 2023
    • Checking the divisibility of neural networks, and investigating the nature of the pieces networks can be divided into.
      Python
      2602Updated Feb 11, 2023Feb 11, 2023
    • Web application where humans can play Overcooked with AI agents.
      JavaScript
      286086Updated Dec 6, 2022Dec 6, 2022
    • A training framework for Stable Baselines3 reinforcement learning agents, with hyperparameter optimization and pre-trained agents included.
      Python
      588100Updated Nov 30, 2022Nov 30, 2022
    • A simple webpage that can visualize a sgf string encoded as a url fragment.
      CSS
      0000Updated Sep 29, 2022Sep 29, 2022
    • Python
      1300Updated Aug 11, 2022Aug 11, 2022
    • sacred

      Public
      Sacred is a tool to help you configure, organize, log and reproduce experiments developed at IDSIA.
      Python
      389103Updated Jul 24, 2022Jul 24, 2022
    • Supporting code for Assistance Games as a Framework paper
      Python
      1300Updated Jul 11, 2022Jul 11, 2022
    • dmc2gym

      Public
      OpenAI Gym wrapper for the DeepMind Control Suite
      Python
      68200Updated Jun 16, 2022Jun 16, 2022
    • Docker files to help reproduce bug described in https://forums.developer.nvidia.com/t/kernel-oops-null-pointer-dereference-when-closing-cuda-application-katago/…
      Dockerfile
      0000Updated May 25, 2022May 25, 2022
    • Preprocessing reward functions to make them more interpretable
      Python
      0400Updated May 11, 2022May 11, 2022
    • Code for the paper "Emergent Complexity via Multi-agent Competition"
      Python
      156400Updated Apr 19, 2022Apr 19, 2022
    • Find best-response to a fixed policy in multi-agent RL
      Python
      4828880Updated Apr 1, 2022Apr 1, 2022
    • PyTorch version of Stable Baselines, improved implementations of reinforcement learning algorithms.
      Python
      2.1k300Updated Nov 6, 2021Nov 6, 2021
    • Script for automatically creating the reconnaissance email.
      HTML
      1500Updated Nov 2, 2021Nov 2, 2021
    • Model summary in PyTorch similar to `model.summary()` in Keras
      Python
      414000Updated Oct 29, 2021Oct 29, 2021