tensorflow: fatal error: cuda/include/cuda.h: No such file or directory

System information

Have I written custom code (as opposed to using a stock example script provided in TensorFlow): Yes, I followed the official documentation for custom operations.
OS Platform and Distribution (e.g., Linux Ubuntu 16.04):

Linux pc 4.4.0-81-generic #104-Ubuntu SMP Wed Jun 14 08:17:06 UTC 2017 x86_64 x86_64 x86_64 GNU/Linux

TensorFlow installed from (source or binary): tried both
TensorFlow version (use command below):

version	status	comment
(‘v1.2.0-rc1-7529-g8a4d849’, ‘1.4.0’)	not working	from pip or from source
(‘v1.2.0-rc2-21-g12f033d’, ‘1.2.0’)	working	from pypip

Python version: 2.7 (irrelevant)
Bazel version (if compiling from source):

Build label: 0.6.1
Build target: bazel-out/local-fastbuild/bin/src/main/java/com/google/devtools/build/lib/bazel/BazelServer_deploy.jar
Build time: Thu Oct 5 21:54:59 2017 (1507240499)
Build timestamp: 1507240499
Build timestamp as int: 1507240499

GCC/Compiler version (if compiling from source): irrelevant tried both
- g++4.8
- g++5.0
CUDA/cuDNN version: irrelevant

nvcc: NVIDIA (R) Cuda compiler driver
Copyright (c) 2005-2016 NVIDIA Corporation
Built on Sun_Sep__4_22:14:01_CDT_2016
Cuda compilation tools, release 8.0, V8.0.44

GPU model and memory: irrelevant
Exact command to reproduce:

cd /tmp
mkdir tf_issue
cd tf_issue
virtualenv test
source test/bin/activate
pip install tensorflow # in some way: either tensorflow-gpu or from wheel package created by bazel
git clone https://github.com/cgtuebingen/tf_custom_op
cd tf_custom_op
cmake .
make

Describe the problem

Compiling custom ops with

#include "tensorflow/core/util/cuda_kernel_helper.h"

fails due to missing files

/code/lib/python2.7/site-packages/tensorflow/include/tensorflow/core/util/cuda_kernel_helper.h:24:31: fatal error: cuda/include/cuda.h: No such file or directory

This file cuda/include/cuda.h does not exists! Neither in the pip package nor in the git repository. Removing #include "tensorflow/core/util/cuda_kernel_helper.h"

gives plenty of other issues

/code/kernels/matrix_add_kernel.cu(13): error: namespace "tensorflow" has no member "CudaLaunchConfig"
/code/kernels/matrix_add_kernel.cu(59): error: namespace "tensorflow" has no member "CudaLaunchConfig"
/code/kernels/matrix_add_kernel.cu(59): error: expected a ";"
/code/kernels/matrix_add_kernel.cu(63): error: identifier "cfg" is undefined
/code/kernels/matrix_add_kernel.cu(88): error: namespace "tensorflow" has no member "CudaLaunchConfig"
/code/kernels/matrix_add_kernel.cu(88): error: expected a ";"
/code/kernels/matrix_add_kernel.cu(92): error: identifier "cfg" is undefined

As I already wrote in a related issue #12860, the commit 2c598e874e6a7b6b3185846ce9bac97a7d5d0169 is causing this issue by changing

-#include "third_party/gpus/cuda/include/cuda.h"
+#include "cuda/include/cuda.h"

Copying the old cuda.h gives

[...]/local/lib/python2.7/site-packages/tensorflow/include/tensorflow/core/platform/default/mutex.h:25:22: fatal error: nsync_cv.h: No such file or directory

which does not exist, too.

This problem is not related to custom code, it is related to ignore/omitting files in commit 2c598e874e6a7b6b3185846ce9bac97a7d5d0169

As mention in #12860, this affects many people. In fact, the entire way of writing customs ops with CUDA seems to be broken. Copying own source-code to the TensorFlow-repo was not necessary until TF1.3. Interestingly, even recent NIPS paper implementations state in their readme, they only support TFv1.2. I don’t think the proposed workaround of downgrading to TFv1.2 should be the way to go.

About this issue

Original URL
State: closed
Created 7 years ago
Reactions: 8
Comments: 32 (19 by maintainers)

Commits related to this issue

fix the error: No OpKernel was registered to support Op 'ExampleOp'. But encounter with another one: cuda/include/cuda.h: No such file or directory. As mentioned in https://github.com/tensorflow/tenso... — committed to FrankWork/tf-tutorials by FrankWork 7 years ago
add more include paths to work in TF1.4 (not working currently due to issue tensorflow/tensorflow#15002 — committed to cgtuebingen/tf_custom_op by PatWie 6 years ago

Most upvoted comments

@gunan I think you find the solution. I have just tried the new command without -D GOOGLE_CUDA=1. I compiled a custom op with tf rc1.5 installed from source and pip. No error occurred and the custom op just passed the test on both cup and gpu. 😆😆😆 Anyone help to try this solution with other versions of tf?

+11

Queequeg92 on Jan 19, 2018

Fixed this by specifying paths using -I while running the nvcc command. Another error popped up which required a cuda_config.h file. Removing -D GOOGLE_CUDA=1 solved this.

mathpopo on Dec 5, 2018

Removing the flag simply removes the entire CUDA part which is not most people want to do.

PatWie on Dec 5, 2018

Looking at the documentation, it looks like we recommend setting -DGOOGLE_CUDA=1 when building the ops. I am surprised we need that. That macro then triggers PLATFORM_GOOGLE which should only turn true internally, and that causes the error we see about cuda_config.h externally, I think it may be something we would like to avoid. @martinwicke any ideas on GOOGLE_CUDA macro?

gunan on Jan 18, 2018

What happens if you add -I/usr or wherever your cuda installation is to your build command?

gunan on Jan 3, 2018

We now build pip packages starting from 1.5.0rc1 with cuda_config.h included.

av8ramit on Jan 17, 2018

FYI everything works now fine using a Makefile or CMake on the master branch at commit https://github.com/tensorflow/tensorflow/commit/4cb0c13c7779da536cac6c682180c5757611b384

Most parts can be hidden from the user. The only missing thing is indeed the cuda_config.h (see Line in CMakeLists.txt) which must be hard-coded until (#12860) is resolved.

Btw, this missing file only contains:

// DO NOT EDIT: automatically generated file
#ifndef CUDA_CUDA_CONFIG_H_
#define CUDA_CUDA_CONFIG_H_

#define TF_CUDA_CAPABILITIES CudaVersion("3.0")

#define TF_CUDA_VERSION "8.0"
#define TF_CUDNN_VERSION "5"

#define TF_CUDA_TOOLKIT_PATH "/usr/local/cuda-8.0"

#endif  // CUDA_CUDA_CONFIG_H_

And neither TF_CUDNN_VERSION nor TF_CUDA_TOOLKIT_PATH is correct, although TensorFlow works correctly and I set the configs correctly during the compilation.

Not sure, if this resolves this issue of missing files in the pip package.

PatWie on Jan 9, 2018

We should import some GPU headers in our custom op test once this gets fixed, but I have no idea how the CUDA headers should be packaged and unfortunately I don’t have time to dig in to it.

@flx42 @gunan know anyone who can work on our header packaging for CUDA custom ops? Not quite sure who broke it, but it would be good to have fixed.

allenlavoie on Dec 1, 2017