MachineLearningNotebooks: Re-registering a dataset does not create new version
When trying to register a dataset with the flag create_new_version=True, I get an error instead of creating a new version.
I get the following error:
File "register.py", line 110, in register_dataset
ds.register(workspace, name=name, create_new_version=True)
File "/data/anaconda/envs/cyril/lib/python3.6/site-packages/azureml/data/_loggerfactory.py", line 106, in wrapper
return func(*args, **kwargs)
File "/data/anaconda/envs/cyril/lib/python3.6/site-packages/azureml/data/abstract_dataset.py", line 313, in register
raise result
Exception: An identical dataset had already been registered, which can be retrieved with `Dataset.get_by_name(workspace, name="cases_train_data.csv", version=1)`.
Here is my call to register the data set:
ds = Dataset.Tabular.from_delimited_files(path)
ds.register(workspace, name=name, create_new_version=True)
SDK Version: azureml-sdk==1.2.0
About this issue
- Original URL
- State: closed
- Created 4 years ago
- Reactions: 1
- Comments: 19 (4 by maintainers)
@jmwoloso, is there a way that I can reproduce your issue? I can’t produce it locally. Thanks!
Version incrementing happens, however, amls seems to simply append the new data to the existing previous versions which seems like a bug.
Nevermind! I am sorry I thought I was registering the same data set under the same name but I was wrong by a single letter. Sorry! This is all resolved! Thanks for all your time!
@stemoor we will review your feedback and get back to you shortly. Thanks.