GPT-4o Finetuning Failed

Nas 0 Reputation points
2025-04-22T03:02:49.8666667+00:00

I'm receiving an error.

After preparing data files following guidelines (https://learn.microsoft.com/en-us/azure/ai-services/openai/tutorials/fine-tune?tabs=command-line).

During Preprocessing Files:

status : Training file: Preprocessing Summary: The provided data failed validation due to: contains invalid schema (22). Please visit our docs to learn how to resolve these issues, and try again. Details - Samples of lines per error type: contains invalid schema: Line numbers --> 1, 3, 4, 5, 6, 7, 9, 11, 12, 13, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26

After: When I force my files to UTF-8 encoding without BOM, I now get error like this, just simple File Preprocessing failed.

File preprocessing failed

Any input will be appreciated!!.
Thanks.

Azure Machine Learning
Azure Machine Learning
An Azure machine learning service for building and deploying models.
3,237 questions
{count} votes

1 answer

Sort by: Most helpful
  1. Nas 0 Reputation points
    2025-04-24T04:39:29.5833333+00:00

    Hey @Ravada Shivaprasad , Thank you for your prompt response. I still see the same error as can be seen in the images below. I validated the JSON schema in both the training and validation files, which was fine.

    I noted that my JSON file contains a variety of characters, HTML tags, and Unicode escape sequences (like \ud83d\udc49). Will this affect the training?

    User's image

    User's image


Your answer

Answers can be marked as Accepted Answers by the question author, which helps users to know the answer solved the author's problem.