< img height="1" width="1" style="display:none;" alt="" src="https://px.ads.linkedin.com/collect/?pid=3131724&fmt=gif" />
Last updated:

    Model Fine-tuning FAQ

    FAQ

    The fine-tune button is grayed out with message "Fine-tuning framework does not support this model"

    Cause: The current fine-tuning framework (LLaMA-Factory or MS-Swift) does not yet support this model architecture.

    Solution: Contact the platform administrator with the model name and relevant details. The administrator will evaluate and add support as soon as possible.


    The fine-tune button is grayed out with message "Model metadata not recognized"

    Cause: The model files are incomplete, or the model's configuration information (architecture info in config.json) cannot be automatically recognized.

    Solution:

    1. Confirm the model repository contains a complete config.json file.
    2. Contact the platform administrator to manually trigger a model metadata scan.

    Training loss is not decreasing

    Possible Causes:

    • Learning rate is too high or too low.
    • Dataset format is incorrect, causing samples to be skipped.
    • Training data volume is too small.

    Solution:

    1. Check that the dataset format matches the selected framework's requirements.
    2. Adjust the learning rate (starting from 1e-4 is recommended).
    3. Ensure sufficient training samples (at least a few hundred is recommended).

    Out of memory (OOM) during training

    Solution:

    1. Switch to a compute configuration with more VRAM.
    2. Enable quantized fine-tuning in the framework settings (e.g., QLoRA with 4-bit quantization).
    3. Reduce the batch size.
    4. Use gradient accumulation to achieve an equivalent larger batch size.