test: energy audit demo trigger CI workflow#6
Conversation
⚡ EcoCompute Energy Audit🖥️ Hardware Environment
🔴 Critical IssuesDefault INT8 (bitsandbytes mixed-precision decomposition) —
Energy impact: +17–147% energy vs FP16 Fix: Add config = BitsAndBytesConfig(
load_in_8bit=True,
llm_int8_threshold=0.0, # Disables mixed-precision decomposition
)🟡 WarningsSequential single-request processing (BS=1) —
Energy impact: Up to 95.7% energy waste vs batched Fix: Batch your inputs or use a serving framework: # Option 1: Batch with tokenizer
inputs = tokenizer(prompts, padding=True, return_tensors='pt').to('cuda')
outputs = model.generate(**inputs)
# Option 2: Use vLLM for production
from vllm import LLM
llm = LLM(model=model_name)
outputs = llm.generate(prompts)📈 Relative Change (vs Baseline)
📊 Based on 93+ measurements across RTX 4090D / A800 / RTX 5090 · Full data · Install Bot · OpenClaw Skill |
|
Test completed successfully. All detection rules working as expected. Closing test PR. |
This PR adds a test Python file with intentional energy waste patterns to verify the EcoCompute Energy Audit GitHub Action.
Expected results: