Documentation on Connecting to S3

HI Team,

Can we get some documentation on using the S3 connection with Feature Group using the new library hsfs?

When trying to use the hops library for import_featuregroup_s3 we are ending up with the following error, seems it is library issue with hopsworks 2.1 version

Code:

featurestore.import_featuregroup_s3(“s3_connector”, “fg”, “raw_data”,
description=“House sale transactions in Sacramento”,
featurestore=featurestore.project_featurestore(),
featuregroup_version=1,
data_format=“csv”)

Error:

An error was encountered:
‘descStatsEnabled’
Traceback (most recent call last):
File “/srv/hops/anaconda/envs/theenv/lib/python3.7/site-packages/hops/featurestore.py”, line 2067, in import_featuregroup_s3
core._get_featurestore_metadata(featurestore, update_cache=True),
File “/srv/hops/anaconda/envs/theenv/lib/python3.7/site-packages/hops/featurestore_impl/core.py”, line 115, in _get_featurestore_metadata
metadata_cache = FeaturestoreMetadata(response_object)
File “/srv/hops/anaconda/envs/theenv/lib/python3.7/site-packages/hops/featurestore_impl/dao/common/featurestore_metadata.py”, line 25, in init
online_featurestore_connector = self._parse_featurestore_metadata(metadata_json)
File “/srv/hops/anaconda/envs/theenv/lib/python3.7/site-packages/hops/featurestore_impl/dao/common/featurestore_metadata.py”, line 53, in _parse_featurestore_metadata
fg_obj = Featuregroup(fg)
File “/srv/hops/anaconda/envs/theenv/lib/python3.7/site-packages/hops/featurestore_impl/dao/featuregroups/featuregroup.py”, line 34, in init
constants.REST_CONFIG.JSON_FEATUREGROUP_DESCRIPTIVE_STATISTICS_ENABLED]
KeyError: ‘descStatsEnabled’

Thanks,
Guru.

Hi @Arumugaguru_M,

You can follow examples how to configure S3 connector and ingest data to feature store here hops-examples/S3-Ingest-to-Feature-Store-basics.ipynb at master · logicalclocks/hops-examples · GitHub

Here is documentation about storage connectors Storage Connector - Hopsworks Documentation

Best regards,
Davit

1 Like