Drive Your Data Flow Pipeline to Production Using Artifactory Conda Repositories

Conda Repositories in Artifactory

Over the past decade, there has been an immense data explosion. Most companies use analytics data collected over months, weeks, or even days for business decisions and can respond to changes minute by minute. To do this real-time analysis and data mining, data scientists, data analysts, and software developers use Conda, as their open source, cross-platform tool for managing packages.

JFrog has expanded its universal repository management offering to include support for Conda repositories, available from version 6.3, providing Python, R, Ruby, Lua, Scala, Java, JavaScript, C/ C++, FORTRAN developers with all the benefits of using JFrog Artifactory. This includes scaling, security, and visibility for your Conda packages.

Combined with the recent CRAN repository release, JFrog Artifactory support for Conda package repositories gives users full control of their deployment and resolution process of their packages.

Data Flow Pipeline to Production using Artifactory Conda Repository

In this blog post, we will highlight six reasons for using Conda Repositories in Artifactory and how they have been designed to tackle the main DevOps challenges facing data scientists.

6 Reasons to Use Conda Repositories in Artifactory

Reason 1: Secure Your Data Science Artifacts

Get secure and private local Conda repositories with fine-grained access control. Artifactory lets you host your internal sources and binaries using local repositories so you can share your packages inside your organization.

Reason 2: Get Version Management

Keep older versions of your packages uploaded to local repositories.

Reason 3: Proxy External Dependencies in Remote Conda Repositories

Stay independent of network and remote resources by proxying remote Conda resources, such as the public Conda registry, and caching downloaded packages using remote repositories in Artifactory.

This means that you’re not dependent on the network performance between your data center and any external repositories. And your performance won’t be impacted due to outages or changes to the packages in the external repository.

Reason 4: Collaborate and Share Data Models

Access both locally hosted Conda repositories as well as proxied remote Conda repositories from a single URL, using virtual repositories.

Remote Conda Repositories in Artifactory

Reason 5: Resolve Dependencies from an Intuitive UI or Conda Client

Resolve your dependencies directly in the Artifactory UI. When a Conda repository is selected in the Artifacts Tree Browser, click Set Me Up to view the code snippets you can use to publish a Conda package or to configure your Conda client to resolve artifacts using the selected repository.

Reason 6: View Conda Package Metadata

View selected metadata of the Conda package directly from the Artifactory UI. Metadata calculation of the Conda packages is hosted in the Artifactory local repositories.

In the Tree Browser, select your virtual Conda repository and scroll down to find and select the package you want to inspect. The metadata is displayed in the Conda Info tab.

view metadata

Try it yourself and see how easy it is to get started.