AWS Sagemaker unable to parse csv

Question:

I’m trying to run a training job on AWS Sagemaker, but it keeps failing giving the following error:

ClientError: Unable to parse csv: rows 1-5000, file /opt/ml/input/data/train/KMeans_data.csv

I’ve selected ‘text/csv’ as the content type and my CSV file contains 5 columns with numerical content and text headers.

Can anyone point out what could be going wrong here?

Thanks!

Answer:

From https://docs.aws.amazon.com/sagemaker/latest/dg/cdf-training.html CSV must not have headers:

Amazon SageMaker requires that a CSV file doesn’t have a header record …

Try removing the header row.

Leave a Reply