To start working with NVIDIA NIM, see Get Started with NVIDIA NIM
To configure your NIM Model client to work through Artifactory, see Set Up NIM Model client
The JFrog Artifactory integration with NVIDIA NIM allows you to cache NVIDIA NIM models in Artifactory via the remote repository.
NVIDIA NIM is a set of easy-to-use microservices for accelerating the deployment of foundation models on any cloud or data center and helps keep your data secure. NIM has production-grade runtimes, including ongoing security updates. Run your business applications with stable APIs backed by enterprise-grade support.
NGC Catalog is the public registry for NVIDIA NIM Models.
To learn more about NVIDIA NIM Models, refer to Deploy Generative AI With NVIDIA NIM.
NVIDIA NIM Repositories
NVIDIA NIM Remote Repositories: Remote repositories in Artifactory act as proxies for repositories on remote servers. Artifactory first checks its local cache for the requested package. If the package is not found in the cache, Artifactory retrieves it from the remote repository via the Internet. Once retrieved, the file is cached locally in Artifactory, making it available for future requests without an internet connection. Importantly, only the requested package is cached, not the entire remote repository.
Artifactory supports proxying remote NVIDIA NIM registries through remote repositories. The Remote Repository in Artifactory is a caching proxy for a registry managed at a remote URL https://api.ngc.nvidia.com/
.
Resources that are requested from a remote repository are cached on demand. You can remove downloaded resources from the remote repository cache. However, you cannot manually push resources to a remote repository.
Main Features of NVIDIA NIM in Artifactory
The following are the main features of NVIDIA NIM in Artifactory:
Single Source of Truth: This integration allows users to work with NVIDIA NIM consumption and Enterprises' best practices for external artifact consumption via a single source of truth in Artifactory.
Consistent and Reliable Access to Remote NVIDIA NIM Models: Remote NVIDIA NIM Repositories in Artifactory proxy external resources from the NGC Catalog and cache downloaded NVIDIA NIM Models. This reduces overall networking and creates fast, consistent, reliable access to NIM Models on these remote resources.
NVIDIA NIM Supported Clients
Docker
Note
Currently, we do not support NGC CLI and NGC SDK.
Limitations of NVIDIA NIM in Artifactory
Currently, JFrog Artifactory does not support the following with NVIDIA NIM packages:
Xray
Curation
Local and Virtual Repositories
Anonymous Access
Note
Supports NIMs with Version 1.3 or later