Skip to content

Commit 1addbec

Browse files
mlunar-metameta-codesync[bot]
authored andcommitted
Add MCCL_SCUBA_LOG_LEVEL cvar for log filtering (#734)
Summary: Pull Request resolved: #734 Adds a new configuration variable to control the verbosity of MCCL Scuba logging. This enables operators to filter out lower-priority logs and reduce noise in Scuba tables, improving observability by focusing on the most relevant events (critical errors, high-priority events, etc.) while reducing storage costs and query complexity. Also making MCCL_SCUBA_ENABLED=true by default so that we always have log for any job. Reviewed By: dboyda Differential Revision: D93675354 fbshipit-source-id: 323d3b1c2e5b50b005447a0e27b2567cc9c68e6f
1 parent 48868e8 commit 1addbec

File tree

1 file changed

+15
-0
lines changed

1 file changed

+15
-0
lines changed

comms/utils/cvars/nccl_cvars.yaml

Lines changed: 15 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -2807,6 +2807,21 @@ cvars:
28072807
events to dedicated MCCL Scuba tables for observability and debugging.
28082808
This is separate from NCCLX logging to minimize risk.
28092809
2810+
- name : MCCL_SCUBA_LOG_LEVEL
2811+
type : enum
2812+
default : HIGH
2813+
choices : CRITICAL, HIGH, NORMAL, LOW
2814+
description : |-
2815+
Minimum log priority level for MCCL Scuba logging. Only logs at or above
2816+
this priority level will be forwarded to Scuba; lower-priority logs are
2817+
dropped. The priority levels from highest to lowest are:
2818+
CRITICAL - Critical logs only (e.g., errors, failures)
2819+
HIGH - Critical and high-priority logs (e.g., important events)
2820+
NORMAL - Critical, high, and normal logs (default operational logs)
2821+
LOW - All logs including low-priority (verbose/debug)
2822+
For example, setting MCCL_SCUBA_LOG_LEVEL=HIGH means only HIGH and
2823+
CRITICAL priority logs will be forwarded to Scuba.
2824+
28102825
- name : NCCL_GIN_GDAKI_NIC_HANDLER
28112826
type : int
28122827
default : 0

0 commit comments

Comments
 (0)