ZC Technologies Trust Per-node monthly license rev. 01 build 21da0fd6

Cool the DGX Spark without throttling production.

A small daemon that watches GPU temperature and clamps the clock ceiling in 150 MHz steps when the silicon enters the warning band. Stock NVIDIA actuator, hysteresis controller, auto-relax when temperature returns to floor.

Buy license  → $249 / node / month   •   cancel anytime
[01] Measured Δ
−11 °C
[02] Baseline
83 °C @ 94% util
[03] Steady state
72 °C, same util
[04] Install time
< 60 s
[05] Target
GB10 / Blackwell
[06] Workload
Ollama, llama.cpp

Mechanism

[ SENSE ] nvidia-smi temp.gpu every 30s [ DECIDE ] 3-band hysteresis step ±150 MHz [ LOCK ] nvidia-smi -lgc

Hot band ≥ 78 °C → step max clock down. Warm band → hold. Cool band ≤ 72 °C for 3 consecutive samples → step back up. Floor 1800 MHz, ceiling 3000 MHz.

Validation log (spark-23, production GB10)

TimeTempClockUtilAction
07:46:2882 °C2463 MHz94 %STEP_DOWN
07:47:2883 °C2463 MHz94 %STEP_DOWN
07:56:2976 °C1976 MHz95 %HOLD
08:13:4472 °C2093 MHz94 %HOLD (cool streak 1)
08:14:1472 °C2093 MHz94 %HOLD (cool streak 2)

Same Ollama workload, same util band. Temperature fell from 83 °C → 72 °C over ≈ 28 minutes. No thermal-throttle events.

Install

curl -O https://thermal.zctechnologies.org/dl/zc-thermal-control.tar.gz
tar xzf zc-thermal-control.tar.gz
cd zc-thermal-control
sudo bash install.sh

Writes /usr/local/bin/zc-thermal-control, a systemd unit, and a scoped sudoers rule for nvidia-smi -lgc. SHA-256 of the bundle: 781907430427eb6d90877b76999fa194da82434a0c427af82553107f18aee8fd.

Defaults

FlagDefaultMeaning
-hot78 °CStep clock down at or above this
-cool72 °CStep up after 3 consecutive samples at or below this
-floor1800 MHzMinimum max-clock during cooling
-ceil3000 MHzMaximum max-clock when relaxed
-step150 MHzAdjustment per sample
-interval30 sSampling period

What this is not

Not an undervolt tool. Not a fan-curve controller (GB10 does not expose fan PWM). Not a workload scheduler. The single actuator is nvidia-smi --lock-gpu-clocks; the controller chooses when and how much.

License. Per-node monthly. One license entitles one GPU node. Quantity adjustable at checkout up to 64. Issued to ZC Technologies Trust. Source binary is delivered post-purchase; redistribution is prohibited. Uninstall removes all installed components. No telemetry is collected by the binary; the only network call made by the controller is nvidia-smi against the local driver.