Python API
Datasets
oxen.datasets
load_dataset
Load a dataset from an Oxen repository into memory using the HuggingFace datasets library.
Arguments:
repo_id
-str
The namespace/repo_name of the oxen repository to load the dataset frompath
-str
| Sequence[str] The path to the dataset we want to loadfmt
-str
The format of the data files. Currently only “hugging_face” is supported.revision
-str
| None The commit id or branch name of the version of the data to download
Example:
download
Download files or directories from a remote Oxen repository.
Arguments:
repo_id
-str
The namespace/repo_name of the oxen repository to load the dataset frompath
-str
The path to the data filesrevision
-str | None
The commit id or branch name of the version of the data to downloaddst
-str | None
The path to download the data to.host
-str
The host to download the data from.scheme
-str
The scheme to download the data with. (default: “https”)
upload
Upload files or directories to a remote Oxen repository.
Arguments:
repo_id
-str
The namespace/repo_name of the oxen repository to upload the dataset topath
-str
The path to the data filesmessage
-str
The commit message to use when uploading the databranch
-str | None
The branch to upload the data to. If None, themain
branch is used.dst
-str | None
The directory to upload the data to.