LM Playground

This codebase is designed around HuggingFace's (HF) transformers and datasets libraries. It is designed to be modular but with low-coupling, so that it is easy to extend for different kinds of experiments around LLMs.

The three basic components are models, trainers and evaluators, all of which are meant to be very loosely coupled, e.g. no back and forth communication between them. Some projects may require the implementation of a new type of model, or a new type of trainer, or a new type of evaluator, each of which can go from thinly wrapping external resources, e.g. a HF trainer, to a manually written component in PyTorch. Aside from these basic components, logging and config file management are also supported and should not require any changes for new projects.

This is work in progress.

Installation

We strongly recommend using UV for all things Python. If you do, installing this code is as simple as follows:

uv sync

How to Use and Extend this Codebase

There are a few examples of config files for training and evaluating models in the examples folder. For example, here's how to train a model with a HF trainer using adapters:

python run.py --config examples/examples_train_hf_peft.yaml

Config Files

Each job creates a directory to store model checkpoints, logs, etc. and is managed by a YAML config file where all job settings are defined. The main components of every job are defined as top-level config keys in the config file.

Models

All models inherit from HF's PreTrainedModel, so that they can be used with any library that supports HF models. A very thin wrapper is required for using pre-trained models from the Hub, but custom models can also be created so they are widely supported as any HuggingFace model is. For an example of a custom model, see tuned lens model.

Trainers

All trainers are meant to take as input a model and some configuration settings for how to train the given model. A trainer can be implemented based on any external library. For example, see LightningTrainer for a trainer based on Lightning.

Evaluators

Similar to trainers, all evaluators are meant to take in a model and some configuration settings for how to evaluate the given model. These too can be implemented using any external library, e.g. the Language Model Evaluation Harness..

How to Contribute

For any bugs of feature requests, please make an issue.

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
data		data
examples		examples
lm_playground		lm_playground
local		local
scripts		scripts
.gitignore		.gitignore
.python-version		.python-version
README.md		README.md
main.py		main.py
pyproject.toml		pyproject.toml
run.py		run.py
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

LM Playground

Installation

How to Use and Extend this Codebase

Config Files

Models

Trainers

Evaluators

How to Contribute

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

LM Playground

Installation

How to Use and Extend this Codebase

Config Files

Models

Trainers

Evaluators

How to Contribute

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages