Hi Hopsworks Community!
I recently encountered an issue while working on a project and was hoping to get some insights from this forum. I am getting a RestAPIError when running both
“X_train, X_test, y_train, y_test = fv.train_test_split(test_size=0.2)”
and
“X_train, y_train, X_test, y_test = fv.get_train_test_split(training_dataset_version=1)”.
I tried the second one after seeing training data versions accumulate in the feature view overview section on the web site.
The error message is as follows:
RestAPIError: Metadata operation error: (url: https://c.app.hopsworks.ai/hopsworks-api/api/project/38123/featurestores/38017/featureview/historical_data_21/version/1/trainingdatasets/version/1/statistics). Server response:
HTTP code: 400, HTTP reason: Bad Request, error code: 270137, error msg: Error saving statistics, user msg: Not a valid JSON
Which leads me to a JSON response that says:
{"type":"restApiJsonResponse","errorCode":200003,"errorMsg":"Authorization header not set."}
This issue is a little weird, as the same code works seamlessly with a similar, albeit smaller, dataset and feature group. Additionally, I noticed that training data versions are accumulating in the feature views overview section on the Hopsworks website with the larger dataset regardless of the error. Makes me wonder if it’s just a limitation of the free API.
For context, I am working from the loan approval training pipeline notebook (with my data) that was covered in the workshop held in Seattle a couple of weeks ago, and I am running Hopsworks 3.0.5 in Python 3.9.0.
Has anyone else faced this issue or have any suggestions on how to resolve it? I would greatly appreciate any help or guidance on this matter.
Thanks in advance!
Trent Leslie