Requirements:
Artifactory version 7.77 and above
Hugging Face client version 0.19.0 and above
HF_HUB_ETAG_TIMEOUTparameter enabled
If these requirements are not met, you can resolve Hugging Face datasets by resolving whole repositories via API. For more information, see Resolve Hugging Face Datasets via API.
To resolve Hugging Face datasets using libraries:
Run the following command:
from datasets import load_dataset
dataset = load_dataset("<DATASET_NAME>")Where:
<DATASET_NAME>: The name of the dataset you want to resolve, formatted according to Hugging Face repository naming structureorganization/name
For example:
from datasets import load_dataset
dataset = load_dataset("wikimedia/wikipedia")