Update Datasets
Edit Dataset Files
Local Editing (Git / SDK)
Clone the dataset repository locally via Git or Python SDK, make changes to files, then push the updates to the remote.
Using Git:
# Clone the repository
git lfs install
git clone https://<username>:<access-token>@<platform-host>/<namespace>/<dataset-name>
cd <dataset-name>
# Edit files, then commit and push
git add .
git commit -m "update dataset files"
git push
Using Python SDK:
from pycsghub.repository import Repository
repo = Repository(
repo_id="<namespace>/<dataset-name>",
repo_type="dataset",
endpoint="https://<platform-host>",
token="<access-token>"
)
repo.clone()
# Edit files in the local repository, then push
repo.push()
Web Online Editing
- Navigate to the dataset detail page and switch to the Files tab
- Click the file name you want to edit to open the file viewer
- Click the Edit button to enter online editing mode
- Modify the file content, enter a commit message, and click Commit to save
Tip
Web online editing is suitable for modifying text files such as README.md or configuration files. For large data file updates, use Git or CLI instead.
Repository Settings
Navigate to the dataset detail page and click the Settings tab to modify repository configuration.
Non-modifiable Fields
The following fields cannot be changed after creation:
| Field | Description |
|---|---|
| Dataset Name | The English identifier of the dataset; fixed once created |
| Visibility | Defaults to private; the platform does not currently support self-service visibility changes via the UI |
Modifiable Fields
| Field | Description |
|---|---|
| Nickname | Update the friendly display name for the dataset |
| Description | Update the text description of the dataset |
| Task Tags | Add or modify task type tags (e.g., text-classification, question-answering) |
| Industry Tags | Add or modify industry domain tags (e.g., finance, healthcare, education) |
Delete a Dataset
Warning
Deleting a dataset is irreversible. All data files and version history will be permanently removed and cannot be recovered.
To delete a dataset:
- Navigate to the dataset detail page and click the Settings tab
- Scroll to the Delete Dataset section at the bottom of the page
- Type the full name of the dataset in the confirmation input field
- Click the Delete button to confirm