Document new GPU brokerage filters: vram, architecture, CUDA version#77
Open
EdwardKaravakis wants to merge 4 commits into
Open
Document new GPU brokerage filters: vram, architecture, CUDA version#77EdwardKaravakis wants to merge 4 commits into
EdwardKaravakis wants to merge 4 commits into
Conversation
Update the GPU hardware check section to reflect the new two-stage brokerage logic (ATLASPANDA-1684): - CRIC as GPU capability gate - WN GPU monitoring as source for all attribute checks - New gpu_spec fields: vram (min VRAM in MB), architecture (microarch generation), version (min CUDA version)
Add driver_version (NVIDIA kernel driver minimum version) to the gpu_spec parameter table and the two-stage attribute matching description.
Add shorthand attribute table (vram, cuda, uarch, driver, model keys), ART header examples, and JSON prun examples. Update vram description to show operator-prefixed string format consistent with other attributes.
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
Summary
Updates the GPU hardware check section of the brokerage documentation to reflect the new two-stage logic introduced in panda-server PR #714 (ATLASPANDA-1684):
gpuentry)MV_WORKER_NODE_GPU_SUMMARY) as the source for all attribute checksgpu_specfields:vram— minimum GPU memory in MB (e.g.40960for 40 GB)architecture— GPU microarchitecture generation or list (e.g."Ampere"or["Ampere", "Hopper"])version— minimum CUDA version (e.g.>=12.0)Related PRs
--architecturehelp text)