Skip to content

Releases: pytorch/torchtitan

v0.2.2

20 Feb 22:47
73a0e69

Choose a tag to compare

v0.2.2 Pre-release
Pre-release

Dependency

PyTorch Version: torch-2.12.0.dev20260220+cu126
TorchAO Version: torchao-0.17.0.dev20260220+cu126

What's changed

🚀 Features

🧠 Model

🐛 Bug Fixes

🧪 Experiments / CI / Infra

🔧 Typing / Lint Cleanup

  • [lint] Ignore all existing pyrefly errors (#2240) by @xmfan
  • [Typing] Fix CI Typing Issues (#2245) by @fegin
  • [Typing] Improve ModelProtocol typing (#2246) by @fegin
  • [Typing] Remove deprecated enable_symm_mem_for_group (#2260) by @fegin
  • [Typing] Remove unused pyrefly ignore (#2280) by @fegin
  • [Typing] Fix pyrefly-ignore in train.py (#2282) by @fegin
  • [Typing] Fix pyrefly ignores in checkpoint.py (#2283) by @fegin
  • [Typing] Fix the ignores in activation_checkpoint.py (#2284) by @fegin
  • [Typing] Fix the ignores in tokenizer.py (#2285) by @fegin
  • [Typing] Fix the ignores in validate.py (#2286) by @fegin
  • [Typing] Fix some pyrefly ignores in optimizer.py (#2294) by @fegin
  • [Typing] Improve typing for some distributed modules (#2295) by @fegin
  • [Typing] Fix the pyrefly ignores in llama3 model.py (#2302) by @fegin
  • [Typing] Fix pyrefly ignores in llama4 model.py (#2303) by @fegin
  • [Typing] Fix pyrefly ignores in qwen3 model.py (#2304) by @fegin
  • [Typing] Fix pyrefly ignores in deepseek model.py (#2305) by @fegin

v0.2.1

26 Dec 23:29
81af883

Choose a tag to compare

v0.2.1 Pre-release
Pre-release

Dependency

pytorch verison: torch-2.11.0.dev20251226+cu126
torchao version: torchao-0.16.0.dev20251226+cu126

What's Changed

Features

  • Use new DeviceMesh unflatten to rewrite parallel_dims by @fegin in #1660
  • Re:Run Torchtitan ROCm workflow on cron schedule & push to Main branch only by @akashveramd in #2018
  • adding variable length attention to llama3 8b by @liangel-02 in #2000
  • [Local Tensor] Replace dry_run.py with fake mode implementation by @fegin in #2057

Model

Bug Fix

  • [FLOPs] Fix attention FLOPs estimate by @shuhuayu in #1923
  • Fix apply_compile called multiple times in PP initialization by @xmfan in #2135
  • Fix qwen3 attention scaling calculation by @wwwjn in #2173

Experiments

Full Changelog: v0.2.0...v0.2.1

v0.2.0

18 Oct 04:26
29624e3

Choose a tag to compare

v0.2.0 Pre-release
Pre-release

Dependency

pytorch verison: torch-2.10.0.dev20251019+cu126
torchao version: torchao-0.15.0.dev20251015+cu126

Full Changelog: v0.1.0...v0.2.0

v0.1.0

18 Jun 00:39
f7084fc

Choose a tag to compare

v0.1.0 Pre-release
Pre-release

This is the first pre-release of torchtitan, following the release practice outlined in #688.

torch version: torch-2.8.0.dev20250617+cu126
torchao version: torchao-0.12.0.dev20250617+cu126