models: [BUG] Problems installing and running getting started notebooks
Bug description
Dependency conflict on pip install
Steps/Code to reproduce bug
- Create and activate rapids conda environment with
conda create -y -n rapids -c rapidsai -c nvidia -c conda-forge rapids=22.06 python=3.8 cudatoolkit=11.2
- Pip install as instructed in README
pip install merlin-models
Expected behavior
ERROR: pip's dependency resolver does not currently take into account all the packages that are installed. This behaviour is the source of the following dependency conflicts.
dask-cudf 22.6.1 requires cupy-cuda115, which is not installed.
cudf 22.6.1 requires cupy-cuda115, which is not installed.
cudf-kafka 22.6.1 requires cython, which is not installed.
Running the first cell of the getting started notebook, Tensorflow is then missing
ModuleNotFoundError: No module named 'tensorflow'
I try to pip install this with pip install tensorflow>=2.8
, and the this fails with the same error:
(rapids-merlin-models) azureuser@mason-v100-new:~/cloudfiles/code/Users/mason.cusack$ pip install tensorflow>=2.8
ERROR: pip's dependency resolver does not currently take into account all the packages that are installed. This behaviour is the source of the following dependency conflicts.
cudf 22.6.1 requires cupy-cuda115, which is not installed.
cudf-kafka 22.6.1 requires cython, which is not installed.
cudf 22.6.1 requires protobuf<3.21.0a0,>=3.20.1, but you have protobuf 3.19.6 which is incompatible.
Environment details
- Merlin version: merlin-core==0.10.0, merlin-models==0.10.0
- Platform: Ubuntu 20.04
- Python version: 3.8
- PyTorch version (GPU?):
- Tensorflow version (GPU?): 2.11.0 (installed as in requirements >=2.8)
Additional context
About this issue
- Original URL
- State: closed
- Created a year ago
- Comments: 28 (15 by maintainers)
are you working with Mason? if no, can you please open a new bug ticket? thanks.
@masoncusack sorry for the inconvenience and agree on that
installing from source exposes us to breaking changes, so probably wouldn't be possible in production.
. Merlin team is working on publishing pypi versions, hopefully that’d help to the users.Yes, but I don’t want to get in the habit of ignoring warnings. A lot of them I don’t understand
^Not sure whether this kind of thing is important and might break something later on or not.
So making sure we have full context.
I restarted and imports worked fine. Now in the second code cell (feature engineering with NVTabular) I’m getting the error
I’ll look into this further tomorrow. Sorry I am just pasting errors to you but there have been so many I’m not sure what is solvable without the understanding of a maintainer and what isn’t.
Is there a near-term plan to fix the installation process of this and Transformers4Rec, so we can just install a specific version and have all the sub dependencies installed and pinned? I’m concerned that installing from source exposes us to breaking changes, so probably wouldn’t be possible in production.