google-cloud-python: Connection reset by peer (httplib2 issue)

I’ve been attempting to batch upload just over a million entities (split into groups of 100-500 at a time) into datastore but I keep running into a socket.error: [Errno 104] Connection reset by peer error. I’ve also been experiencing this when attempting to query and pull down all entities of a single kind.

Is there some throttling that should be happening that’s not? Any suggestions on how to better utilize the gcloud lib?

About this issue

  • Original URL
  • State: closed
  • Created 8 years ago
  • Comments: 20 (11 by maintainers)

Most upvoted comments

@mauliksoneji httplib2 was removed in PR #3674, more than two years ago. google-cloud-datastore 1.3.0 and later use requests instead.

See httplib2shim and the discussion in #1214 to just avoid using httplib2 all together

The main problem is likely that httplib2.Http keeps a cache