keras: Tensorflow backend - bug in model._make_predict_function(...)

There appears to be a bug in the make_predict_function for the tensorflow backend. The following error message appears for me when trying to call model.predict(…)

self._make_predict_function()
  File "/usr/local/lib/python3.4/dist-packages/keras/engine/training.py", line 679, in _make_predict_function
    **self._function_kwargs)
  File "/usr/local/lib/python3.4/dist-packages/keras/backend/tensorflow_backend.py", line 615, in function
    return Function(inputs, outputs, updates=updates)
  File "/usr/local/lib/python3.4/dist-packages/keras/backend/tensorflow_backend.py", line 589, in __init__
    with tf.control_dependencies(self.outputs):
  File "/usr/local/lib/python3.4/dist-packages/tensorflow/python/framework/ops.py", line 3192, in control_dependencies
    return get_default_graph().control_dependencies(control_inputs)
  File "/usr/local/lib/python3.4/dist-packages/tensorflow/python/framework/ops.py", line 2993, in control_dependencies
    c = self.as_graph_element(c)
  File "/usr/local/lib/python3.4/dist-packages/tensorflow/python/framework/ops.py", line 2291, in as_graph_element
    raise ValueError("Tensor %s is not an element of this graph." % obj)
ValueError: Tensor Tensor("Sigmoid_2:0", shape=(?, 17), dtype=float32) is not an element of this graph.

This does not happen when using the theano backend.

Notes: The model is loaded from json, and is defined as follows:

    seq1=Input(dtype='int32',shape=(400,),name='input_text')
    seq2=Input(dtype='int32',shape=(20,),name='input_titles')

    embeddeding=Embedding(max_features,embedding_dims,dropout=0.3)

    encoding_1=embeddeding(seq1)
    encoding_2=embeddeding(seq2)

    filter_lengths = [1,3,6]


    def max_1d(X):
        return K.max(X, axis=1)
    convs1=[]
    convs2=[]
    for fl in filter_lengths:

        conv1=Convolution1D(nb_filter=nb_filter,
                        filter_length=fl,
                        border_mode='valid',
                        activation='relu',
                        subsample_length=1)(encoding_1)
        conv1=Lambda(max_1d, output_shape=(nb_filter,))(conv1)
        convs1.append(conv1)

        conv2=Convolution1D(nb_filter=nb_filter,
                        filter_length=fl,
                        border_mode='valid',
                        activation='relu',
                        subsample_length=1)(encoding_2)
        conv2=Lambda(max_1d, output_shape=(nb_filter,))(conv2)
        convs2.append(conv2)

    m=merge([*convs1,*convs2],mode='concat')
    m=Highway(activation='relu')(m)
    m=Highway(activation='relu')(m)
    m=Dropout(0.5)(m)
    hovedkategori_loss=Dense(labsHovedKat.shape[1],activation='sigmoid',name='hovedkategori')(m)

    m1=merge([hovedkategori_loss,m],mode='concat')
    underkategori_loss=Dense(labsUnderKat.shape[1],activation='sigmoid',name='underkategori')(m1)

    model=Model(input=[seq1,seq2],output=[hovedkategori_loss,underkategori_loss])
    model.compile(optimizer='adam',loss='binary_crossentropy',metrics={'hovedkategori':'accuracy','underkategori':'accuracy'})

Check that you are up-to-date with the master branch of Keras. You can update with: pip install git+git://github.com/fchollet/keras.git --upgrade --no-deps
If running on Theano, check that you are up-to-date with the master branch of Theano. You can update with: pip install git+git://github.com/Theano/Theano.git --upgrade --no-deps

About this issue

Original URL
State: closed
Created 8 years ago
Reactions: 15
Comments: 108 (10 by maintainers)

Links to this issue

python - Keras, TensorFlow : "TypeError: Cannot interpret feed_dict key as Tensor" - Stack Overflow

Commits related to this issue

more polish — committed to Akababa/Chess-Zero by Akababa 7 years ago
possible fix to Cannot interpret feed_dict key as Tensor https://github.com/keras-team/keras/issues/2397 — committed to SreehariRamMohan/Heart-Sounds-Deep-Learning by SreehariRamMohan 7 years ago
fix keras bug on Flask see https://github.com/keras-team/keras/issues/2397 — committed to easyandme/img-process by easyandme 6 years ago
Traffic Light Detection. Detection. (#7) ## What? Updated TLClassifier Class. Added models with weights for detection and classification of TL Lights. Added JSON model for TL Detection because of ... — committed to ooleksyuk/CarND-Capstone by ooleksyuk 6 years ago
Bug fix for async not working, see: https://github.com/keras-team/keras/issues/2397 — committed to KimmoHernborg/sis by deleted user 6 years ago
Workaround for bug with Keras w/ Tensorflow backend - https://github.com/keras-team/keras/issues/2397 - Also added ability to issue multiple socketio messages after finishing an rsync request — committed to uas-at-ucla/suas-2019 by benlimpa 6 years ago
Fixed issue of using ROS+TF. It is a synchronization issue explained in keras-team repo https://github.com/keras-team/keras/issues/2397 — committed to AIRobolab-unilu/qt-pepper-emotion-classification by deleted user 6 years ago
Fix Flask keras bug In order for TensorFlow to run nicely multi-threaded, we need to retain the default Graph context manually With thanks to https://github.com/keras-team/keras/issues/2397 — committed to fatal-exception/project by fatal-exception 6 years ago
Workaround for bug with Keras w/ Tensorflow backend - https://github.com/keras-team/keras/issues/2397 - Also added ability to issue multiple socketio messages after finishing an rsync request Forme... — committed to uas-at-ucla/suas-2019 by benlimpa 6 years ago
Workaround for bug with Keras w/ Tensorflow backend - https://github.com/keras-team/keras/issues/2397 - Also added ability to issue multiple socketio messages after finishing an rsync request — committed to uas-at-ucla/suas-2019 by benlimpa 6 years ago
Workaround for bug with Keras w/ Tensorflow backend - https://github.com/keras-team/keras/issues/2397 - Also added ability to issue multiple socketio messages after finishing an rsync request — committed to uas-at-ucla/suas-2019 by benlimpa 6 years ago
Initial version of web api serving our TF model Currently single-threaded because of an issue in Keras: https://github.com/keras-team/keras/issues/2397. Will need to refactor this to properly serve t... — committed to robertvunabandi/mit-smart-confessions-data by citostyle 6 years ago
Update run_keras_server.py Added change to fix "Tensor is not an element of this graph", find reference here https://github.com/keras-team/keras/issues/2397#issuecomment-306687500 — committed to githubwei/simple-keras-rest-api by githubwei 5 years ago
predict 병렬처리 실패, pool을 사용해 predict 하려하였으나 타 thread 상황에서 학습모델의 graph 가 제대로 인식되지 않는 듯 하다. https://github.com/keras-team/keras/issues/2397 해결책이 있으나 keras 학습모델에 대한 이해부족으로 적용 실패 — committed to kakao-leo-jung/JCP-DS-Server by kakao-leo-jung 5 years ago
Make the model work in a multi-thread environment. - This is mainly done by using `graph.as_default()`. - Detailed discussions can be found at: https://github.com/keras-team/keras/issues/2397#issue... — committed to hunterhector/forte by hunterhector 4 years ago
Content Rewriter Demo (#208) * Write URIs in multi pack serialization instead of paths. * Add rewriter example. * add an example to read from local pack. * some ontology * Update example,... — committed to asyml/forte by hunterhector 4 years ago

Most upvoted comments

I had this problem when doing inference in a different thread than where I loaded my model. Here’s how I fixed the problem:

Right after loading or constructing your model, save the TensorFlow graph:

graph = tf.get_default_graph()

In the other thread (or perhaps in an asynchronous event handler), do:

global graph
with graph.as_default():
    (... do inference here ...)

I learned about this from https://www.tensorflow.org/versions/r0.11/api_docs/python/framework.html#get_default_graph

+644

avital on Oct 19, 2016

avital’s solution works!

keras with tensorflow backend.

Details: global thread:

        self.model = load_model(model_path)
        self.model._make_predict_function()
        self.graph = tf.get_default_graph()

another thread:

        with self.graph.as_default():
            labels = self.model.predict(data)

+182

jglwiz on Jun 7, 2017

Same problem here. Seems to work fine normally. When deployed as a webservice using Flask, get this error.

+99

Nr90 on Aug 26, 2016

@Froskekongen could you describe how you fixed this in more detail? I’m having an exactly the same error however in a different program.

It seems to work when I do it manually in a REPL, however when I deploy it as a webservice it breaks.

+58

jstypka on May 9, 2016

@avital Thanks! It works well with tensorflow, but my model was built on keras. How to fix this problem on keras?

+36

George-Zhu on Apr 27, 2017

If you’re using Flask, setting DEBUG=Falsein my Flask app settings solved this problem for me.

+35

shafy on Oct 23, 2017

It appears this bug had nothing to do with either keras or tensorflow, but rather how async events were handled by the webserver I am using.

+33

Froskekongen on Apr 20, 2016

Running the flask app with debug=False and threaded=False resolved the issue for me.

+32

rishabhbhardwaj on Apr 2, 2018

I think the proper way to handle this issue when loading multiple models is by manually specifying different graph and session for each loaded model. The code should be like this

graph = tf.Graph()
with graph.as_default():
       session = tf.Session()
       with session.as_default():
          ## your load model code
          ## for instance
          model = load_model('model.h5')

And you need to save each graph and session value to some variable for later prediction use.

graph_var = graph
session_var = session

and you can use above variables for prediction mode

with graph_var.as_default():
     with session_var.as_default():
            model.predict(INPUT)

Hopefully this can help.

+31

jurukode on Apr 30, 2018

参考这篇文章： https://zhuanlan.zhihu.com/p/27101000 就是说，当你引用模型后，随后进行一次预测，后面再用到时，就不会报错。

translate: refer to this article: https://zhuanlan.zhihu.com/p/27101000

To solve this problem, you have to predict once after load model ,like

import numpy as np
from keras.models import load_model

model = load_model('model_example.h5')
print('testing model:', model.predict(np.zeros((1, 299, 299, 3))))
...

+25

zhudaoruyi on Dec 27, 2017

to flesh out @rishabhbhardwaj answer, app.run(debug=False, threaded=False) fixes it for me

+15

morenoh149 on Aug 6, 2018

Probably noting that model._make_predict_function() will make Keras play nice with threads, and the additional with graph.as_default(): is only needed when you’re using anything with callbacks (by the look of it).

Shouldn’t this kind of stuff just be handled by keras though? Calling a private function is by definition a hack. It would be great if:

model.compile() should also call model._make_predict_function() by default
calls to model.predict(), model.train() do their own graph context handling.

+15

cshenton on Jul 23, 2017

class Serve(Base):

def start_socket_server(self, port, model_path):

    # Load model files
    self.model = load_model(model_path)
    self.model._make_predict_function()
    self.graph = tf.get_default_graph()

	...
	some code
	...

    while True:
        t = threading.Thread(target=self.client_thread, args=(conn, ))
        t.setDaemon(True)
        t.start()

def client_thread(self, conn):

	...
	some code
	...

    while True:
		...
		some code
		...
		# predict
		with self.graph.as_default():
			labels = self.model.predict(feature_segments_padded)

+14

jglwiz on Jul 16, 2017

Running the webservice with gunicorn in sync mode solved the issue.

+13

rkempter on May 27, 2016

I had the same issue. Although my code was written in an asynchronous style, there was a single worker, so for practical purposes it was pretty synchronous. Calling model._make_predict_function() after load_model did the trick, but I also got the impression that load_model should have called this for me.

+11

joaospinto on Aug 1, 2017

For me neither calling the private _make_predict_function() nor the withgraph.as_default() approach worked. Instead (as @j-o-d-o mentions) if I clear the session at the end of each async task, the problem is fixed.

from keras import backend as K
K.clear_session()

+10

DomHudson on Dec 4, 2017

Reopening, since it actually seems to be a problem that should be handled.

Froskekongen on Aug 3, 2017

the same problem and i solve it. thanks @avital my solution is https://justttry.github.io/justttry.github.io/not-an-element-of-Tensor-graph/

justttry on Jun 7, 2017

Same problem (model.predtict breaking) for me too, but it worked when i switched to theano backend from tensflow.

AbhishekAshokDubey on Aug 16, 2016

I also have the same error under the tensorflow backend, however, it works using the theano backend.
@jstypka @Froskekongen Have you found a solution to fix it?

pxlong on May 13, 2016

Not sure if it’s relevant to the original question, but maybe it’ll be useful to others:

Based on this answer, the following resolved tf’s multithreading compatibility for me:

# on thread 1
session = tf.Session(graph=tf.Graph())
with session.graph.as_default():
    k.backend.set_session(session)
    model = k.models.load_model(filepath)

# on thread 2
with session.graph.as_default():
    k.backend.set_session(session)
    model.predict(x, **kwargs)

The novelty here is keeping both the Session and the Graph for other threads. The model is loaded in their “context” (instead of the default ones) and kept for other threads to use. (By default the model is loaded to the default Session and the default Graph) Another plus is that they’re kept in the same object - easier to handle.

eliadl on Aug 7, 2019

For people who don’t want to import tf you can use directly the backend like:

graph = K.get_session().graph

The rest is the same.

roccolocko on Nov 14, 2017

@shafy Thanks, your solution of setting debug=False worked in my flask service. I didn’t need to use tf.get_default_graph or any other solution.

ashkat on Nov 1, 2017

@shafy Thanks a lot! This is very useful information for everybody running keras on any Flask app…

tushuhei on Oct 28, 2017

I had this issue when running a model from a flask web application.

It turned out that I had configured it to run with gunicorn and gevent, something like this: gunicorn wsgi_search_service:app --bind 0.0.0.0:80 --config gunicorn_conf.py --workers 3 --worker-class gevent --reload

Removing --worker-class gevent from the above statement has resolved this problem.

I assume that gevent serializes data for its async operation - and as such, it has tried to serialize data related to the Keras model, which actually couldn’t be serialized.

vklap on Feb 12, 2018

After 3 hours of trying I also solved the issue thanks to @zhudaoruyi.

It turned out that this error occurs if the first call of model.predict is not on the same thread as the model instantiation.

Possible solution like @zhudaoruyi said is to call model.predict right after creating the model (on the same thread) or instantiate the model lazily right before the first prediction request (so the two calls will be on the same thread)

matesmajda on Jan 19, 2018

same here, same issue! Works fine in REPL, issues running it behind a webservice.

rkempter on May 27, 2016

If anyone is still struggling with Keras, here is what I’ve done:

from keras import backend
with backend.get_session().graph.as_default() as g:
    model = load_model(MODEL_PATH)
    prediction = prediction_function(model, data)

bartkowiaktomasz on Jul 17, 2018

For me the is only solvable if i load the model inside the flask @app.route POST method. Which means I reload the model on every request which is very inefficient. Loading the model as a global either in the beginning of the flask app script or as global in the main(), does not work.

Any ideas on how to solve this?

gustavz on Oct 10, 2019

I’m still having this issue and the recommended fixes do not work for me. The only thing that works is loading the model and making a prediction with each prediction request. I’m requesting that this issue be reopened as no actual solution has been proposed, just several workarounds that seem to work intermittently.

brandonschabell on Dec 28, 2018

When working with (async) Webservices and you get this error from time to time when loading and using a model on different requests, don’t forget K.clear_session()!

aeon0 on Nov 15, 2017

Amazing solution!! I also encounter this problem. In my way, I define the model in the model.py. On the other hand, I create the model instance in main.py Just do the revision in the main.py!!!

SunnerLi on Jun 5, 2017

@SiddhardhaSaran hi dear. i know what you mean. but in my case, on uwsgi even i setting one process one thread it still no work, so i use gunicorn. anyway thanks your good advice.

shaoeChen on Mar 11, 2019

@SiddhardhaSaran thanks for reply. but using lazy-apps=true will make your system overload, if your thread is more than one. now i use gunicorn, it work well to me.

shaoeChen on Mar 8, 2019

Agreed, this is not really resolved. The current best solution is @dyngts which is basically “manage the backend’s sessions yourself”. My opinion is that keras should provide an API to predict and train one or more models without having to import tensorflow or theano separately to deal with their session management, thread locals or equivalent internals used for managing graphs.

DomHudson on Nov 6, 2018

@bdnettleton you might want to consider my approach here https://github.com/keras-team/keras/issues/2397#issuecomment-385317242 which provide more isolated graph leveraging session

jurukode on Nov 6, 2018

This worked in the default flask development server, but it’s not working and the program is stuck/hangs at “predict” method in uwsgi + nginx with flask.

SubigyaPanta on Oct 2, 2018

avital’s solution works!

keras with tensorflow backend.

Details: global thread:

        self.model = load_model(model_path)
        self.model._make_predict_function()
        self.graph = tf.get_default_graph()

another thread:

        with self.graph.as_default():
            labels = self.model.predict(data)

When I try this solution I got this Error InvalidArgumentError (see above for traceback): You must feed a value for placeholder tensor ‘Placeholder_366’ with dtype float and shape [2] [[Node: Placeholder_366 = Placeholderdtype=DT_FLOAT, shape=[2], _device=“/job:localhost/replica:0/task:0/device:CPU:0”]]

Akhtar303nu on Sep 14, 2018

In Flask I had to move my Rasa Agent setup code (which is using Keras with Tensorflow backend to predict) to the @app.before_first_request and then store the agent in the app variable to fix this issue for my project. Thank for everybody contributing above, it helped me a lot.

zoltan-fedor on Aug 13, 2018

@shafy Disabling debug mode in flask works, thanks a lot!

clduan on Nov 28, 2017