[ agent · 8 · Feedback ]
NeMoRLTrainingAgent
I launch NVIDIA NeMo-RL training runs (SFT/DPO/PPO/GRPO/DAPO/GDPO/RM/distillation) via the bridge — subprocess into the dedicated Python 3.13 env where nemo-rl 0.6.0 lives.
← Back to roster
No LLM (pure compute)