Last updated:

Model Fine-tuning FAQ

FAQ

The fine-tune button is grayed out with message "Fine-tuning framework does not support this model"

Cause: The current fine-tuning framework (LLaMA-Factory or MS-Swift) does not yet support this model architecture.

Solution: Contact the platform administrator with the model name and relevant details. The administrator will evaluate and add support as soon as possible.

The fine-tune button is grayed out with message "Model metadata not recognized"

Cause: The model files are incomplete, or the model's configuration information (architecture info in config.json) cannot be automatically recognized.

Solution:

Confirm the model repository contains a complete config.json file.
Contact the platform administrator to manually trigger a model metadata scan.

Training loss is not decreasing

Possible Causes:

Learning rate is too high or too low.
Dataset format is incorrect, causing samples to be skipped.
Training data volume is too small.

Solution:

Check that the dataset format matches the selected framework's requirements.
Adjust the learning rate (starting from 1e-4 is recommended).
Ensure sufficient training samples (at least a few hundred is recommended).

Out of memory (OOM) during training

Solution:

Switch to a compute configuration with more VRAM.
Enable quantized fine-tuning in the framework settings (e.g., QLoRA with 4-bit quantization).
Reduce the batch size.
Use gradient accumulation to achieve an equivalent larger batch size.