I would like to know more on what is the difference between on-demand vs cached feature groups.Correct me if my understanding is wrong
On-demand: It will build a query to get the data from the external datasource like mysql
cached: Storing the data directly to the feature group
If my understanding is correct, i have some questions around them.
If I create a training dataset from on-demand feature group will the data be persisted in the training dataset (or) it will be pulled everytime when I call the training dataset?
Also, I am not able to find any option on the cached dataset to pull data from mysql connection, so I assume only we can have on-demand feature group for external data source
Our Actual Test Case:
We are evaluating the Hopswork Feature Store to be a part of our FW for Data Science team to store their derived features into Feature Store, in one of the training from hopswork team, we heard even if the data get changes in the external source like mysql it will be automatically updated in the feature group/ training dataset.
But when trying to test the above scenario I got confused that the on-demand feature group will only fetch data when we are calling it, in that case how it will be automatically fetched and stored.
If some one can spot some light will be more helpful.