What config relate to learning rate warm up, weight decay, and momentum in 1 node n GPUs (n > 1 && n < 8) config?

## ❓ How to do something using VISSL

Describe what you want to do, including:
1. what I am trying to do: I have read the paper [Imagenet-1hour](https://arxiv.org/pdf/1706.02677.pdf). In there they mentioned the learning rate warm-up, weight decay, and momentum when implementing distributed training in 1 node multi gpus. However, I could not find any documents related to these configs. How could I properly set them?
2. what outputs you are expecting: A config and an explanation related to learning rate warm-up strategy, weight decay, and momentum in 1 node n gpus machine?

## ❓ What does an API do and how to use it?
Please link to which API or documentation you're asking about from
https://github.com/facebookresearch/vissl/tree/main/docs


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

What config relate to learning rate warm up, weight decay, and momentum in 1 node n GPUs (n > 1 && n < 8) config? #584

❓ How to do something using VISSL

❓ What does an API do and how to use it?

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

What config relate to learning rate warm up, weight decay, and momentum in 1 node n GPUs (n > 1 && n < 8) config? #584

Description

❓ How to do something using VISSL

❓ What does an API do and how to use it?

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions