feat(nvcdi): Allow IPC sockets to not be discovered.#1790
feat(nvcdi): Allow IPC sockets to not be discovered.#1790LandonTClipp wants to merge 1 commit intoNVIDIA:mainfrom
Conversation
There are cases we are dealing with where the containers should not have access to any NVIDIA IPC sockets like fabricmanager/persistenced. We want the ability to disable this. This commit adds a feature flag `disable-ipc-discoverer` that if provided will cause the IPC discoverer.Discoverer to be nil. Ultimately, this is needed so that the CDI spec file generated by k8s-device-plugin does not include these sockets. Signed-off-by: LandonTClipp <lclipp@coreweave.com>
efdca6b to
6dad053
Compare
|
/ok to test 6dad053 |
Coverage Report for CI Build 24740645194Coverage increased (+0.005%) to 43.325%Details
Uncovered Changes
Coverage RegressionsNo coverage regressions found. Coverage Stats
💛 - Coveralls |
cdesiniotis
left a comment
There was a problem hiding this comment.
Thanks for the contribution @LandonTClipp.
I don't have any objections to introducing a feature flag to opt-out of this behavior. My one question would be -- are there any use cases where application containers need access to either the nvidia-persistenced or nvidia-fabricmanager sockets? IIUC excluding the nvidia-persistenced socket would only lead to cosmetic differences (e.g. running nvidia-smi in the container would report persistence mode as off even when it is on) and should not have any functional impact. I am not entirely certain about nvidia-fabricmanager.
Are you asking whether containers would need access to one but not the other? Perhaps. In that case I think a more granular feature flag could be used, but I think this change doesn't prevent us from adding that additional granularity if we wanted to in the future. For our particular workload sandboxing use case, any host NVIDIA IPC socket needs to be totally blocked. |
There are cases we are dealing with where the containers should not have access to any NVIDIA IPC sockets like fabricmanager/persistenced. We want the ability to disable this.
This commit adds a feature flag
disable-ipc-discovererthat if provided will cause the IPC discoverer.Discoverer to be nil. Ultimately, this is needed so that the CDI spec file generated by k8s-device-plugin does not include these sockets.