Skip to content
This repository was archived by the owner on Mar 19, 2024. It is now read-only.

What config relate to learning rate warm up, weight decay, and momentum in 1 node n GPUs (n > 1 && n < 8) config? #584

@tungts1101

Description

@tungts1101

❓ How to do something using VISSL

Describe what you want to do, including:

  1. what I am trying to do: I have read the paper Imagenet-1hour. In there they mentioned the learning rate warm-up, weight decay, and momentum when implementing distributed training in 1 node multi gpus. However, I could not find any documents related to these configs. How could I properly set them?
  2. what outputs you are expecting: A config and an explanation related to learning rate warm-up strategy, weight decay, and momentum in 1 node n gpus machine?

❓ What does an API do and how to use it?

Please link to which API or documentation you're asking about from
https://github.com/facebookresearch/vissl/tree/main/docs

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions