Model Fine-tuning FAQ
FAQ
The fine-tune button is grayed out with message "Fine-tuning framework does not support this model"
Cause: The current fine-tuning framework (LLaMA-Factory or MS-Swift) does not yet support this model architecture.
Solution: Contact the platform administrator with the model name and relevant details. The administrator will evaluate and add support as soon as possible.
The fine-tune button is grayed out with message "Model metadata not recognized"
Cause: The model files are incomplete, or the model's configuration information (architecture info in config.json) cannot be automatically recognized.
Solution:
- Confirm the model repository contains a complete
config.jsonfile. - Contact the platform administrator to manually trigger a model metadata scan.
Training loss is not decreasing
Possible Causes:
- Learning rate is too high or too low.
- Dataset format is incorrect, causing samples to be skipped.
- Training data volume is too small.
Solution:
- Check that the dataset format matches the selected framework's requirements.
- Adjust the learning rate (starting from
1e-4is recommended). - Ensure sufficient training samples (at least a few hundred is recommended).
Out of memory (OOM) during training
Solution:
- Switch to a compute configuration with more VRAM.
- Enable quantized fine-tuning in the framework settings (e.g., QLoRA with 4-bit quantization).
- Reduce the batch size.
- Use gradient accumulation to achieve an equivalent larger batch size.