-
Notifications
You must be signed in to change notification settings - Fork 3.9k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
CUDA Container Build Error (Ubuntu 20.04 base image) #5415
Comments
Thanks for using LightGBM, and for the excellent write-up with a reproducible example! The issue you're facing is because of this line: git clone --recursive --branch v3.1.1 --depth 1 https://github.com/microsoft/LightGBM The I recommend removing |
Hi @jameslamb, Thank you for your quick reply! Now I am getting an error that LightGBM cannot be found. I added the stage Build log
Dockerfile
|
Try changing python3 -m pip show lightgbm to pip3 show lightgbm I'm not familiar with the base image you're using, but it seems to me that your code cares about the difference between |
I just use |
I see, ok. Glad that worked for you! Thanks for using LightGBM, and for taking the time to come back and close this with an explanation that other people can find from search engines 😊 |
This issue has been automatically locked since there has not been any recent activity since it was closed. To start a new related discussion, open a new issue at https://github.com/microsoft/LightGBM/issues including a reference to this. |
Description
Building LightGBM and the Python-interface goes smoothly. However, upon test I receive a "ModuleNotFound" error. My goal is to compile LightGBM for CUDA 11.5 for use with TensorFlow 2.9.1-gpu base image. Docker build output is:
Reproducible example
Environment info
GPU: NVIDIA A100
Host OS: CentOS 7
Architecture: x86_64
LightGBM version or commit hash
v3.3.2
Command(s) you used to install LightGBM
Please see the dockerfile. Found install instructions at lightgbm.readthedocs.io. Note: the flag
-DUSE_CUDA_EXP=1
gets the following warning duringdocker build
, so I used-DUSE_CUDA=1
instead.I double checked the Python-Interface instructions on Github. Note that useing
python3 setup.py install --cuda-exp
yields the following error duringdocker build
.Additional Comments
If the line
RUN /bin/bash -c "cd /usr/local/src/lightgbm/LightGBM/python-package && python3 setup.py install --cuda --opencl-include-dir=/usr/local/cuda/include/ --opencl-library=/usr/local/cuda/lib64/libOpenCL.so"
is replaced withRUN pip3 install lightgbm
then the image builds.So somehow following the instructions isn't putting LightGBM into site packages?
The text was updated successfully, but these errors were encountered: