models: [BUG] Problems installing and running getting started notebooks
Bug description
Dependency conflict on pip install
Steps/Code to reproduce bug
- Create and activate rapids conda environment with
conda create -y -n rapids -c rapidsai -c nvidia -c conda-forge rapids=22.06 python=3.8 cudatoolkit=11.2 - Pip install as instructed in README
pip install merlin-models
Expected behavior
ERROR: pip's dependency resolver does not currently take into account all the packages that are installed. This behaviour is the source of the following dependency conflicts.
dask-cudf 22.6.1 requires cupy-cuda115, which is not installed.
cudf 22.6.1 requires cupy-cuda115, which is not installed.
cudf-kafka 22.6.1 requires cython, which is not installed.
Running the first cell of the getting started notebook, Tensorflow is then missing
ModuleNotFoundError: No module named 'tensorflow'
I try to pip install this with pip install tensorflow>=2.8, and the this fails with the same error:
(rapids-merlin-models) azureuser@mason-v100-new:~/cloudfiles/code/Users/mason.cusack$ pip install tensorflow>=2.8
ERROR: pip's dependency resolver does not currently take into account all the packages that are installed. This behaviour is the source of the following dependency conflicts.
cudf 22.6.1 requires cupy-cuda115, which is not installed.
cudf-kafka 22.6.1 requires cython, which is not installed.
cudf 22.6.1 requires protobuf<3.21.0a0,>=3.20.1, but you have protobuf 3.19.6 which is incompatible.
Environment details
- Merlin version: merlin-core==0.10.0, merlin-models==0.10.0
- Platform: Ubuntu 20.04
- Python version: 3.8
- PyTorch version (GPU?):
- Tensorflow version (GPU?): 2.11.0 (installed as in requirements >=2.8)
Additional context
About this issue
- Original URL
- State: closed
- Created a year ago
- Comments: 28 (15 by maintainers)
are you working with Mason? if no, can you please open a new bug ticket? thanks.
@masoncusack sorry for the inconvenience and agree on that
installing from source exposes us to breaking changes, so probably wouldn't be possible in production.. Merlin team is working on publishing pypi versions, hopefully that’d help to the users.Yes, but I don’t want to get in the habit of ignoring warnings. A lot of them I don’t understand
^Not sure whether this kind of thing is important and might break something later on or not.
So making sure we have full context.
I restarted and imports worked fine. Now in the second code cell (feature engineering with NVTabular) I’m getting the error
I’ll look into this further tomorrow. Sorry I am just pasting errors to you but there have been so many I’m not sure what is solvable without the understanding of a maintainer and what isn’t.
Is there a near-term plan to fix the installation process of this and Transformers4Rec, so we can just install a specific version and have all the sub dependencies installed and pinned? I’m concerned that installing from source exposes us to breaking changes, so probably wouldn’t be possible in production.