Stable-Diffusion-WebUI-TensorRT: SDXL: RuntimeError: Expected all tensors to be on the same device

Hi,

i successfully installed and configured this extension according to the installation instructions

“Generate Default Engines” went well and created the unet.

But when selecting it with base SDXL model an error occurs when generating the image:

RuntimeError: Expected all tensors to be on the same device, but found at least two devices, cpu and cuda:0! (when checking argument for argument mat1 in method wrapper_CUDA_addmm)

Generating without the TensorRT unet still works fine.

System: Windows 10, RTX 4090, Nvidia Driver Version 545.84 WebUI version v1.6.0 (Running in Docker Container) • python: 3.10.9 • torch: 2.0.1+cu118 • xformers: 0.0.21.dev544 • gradio: 3.41.2 • checkpoint: e6bb9ea85b

Any idea what could cause this?

Complete Log: 2023-10-20 13:51:39 Activating unet: [TRT] base_sd_sd_xl_base_1.0_VAEFix 2023-10-20 13:51:39 Loading TensorRT engine: /stable-diffusion-webui/models/Unet-trt/base_sd_sd_xl_base_1.0_VAEFix_be9edd61_cc89_sample=1x4x96x96+2x4x128x128+8x4x128x128-timesteps=1+2+8-encoder_hidden_states=1x77x2048+2x77x2048+8x154x2048-y=1x2816+2x2816+8x2816.trt 2023-10-20 13:51:39 [I] Loading bytes from /stable-diffusion-webui/models/Unet-trt/base_sd_sd_xl_base_1.0_VAEFix_be9edd61_cc89_sample=1x4x96x96+2x4x128x128+8x4x128x128-timesteps=1+2+8-encoder_hidden_states=1x77x2048+2x77x2048+8x154x2048-y=1x2816+2x2816+8x2816.trt 2023-10-20 13:51:51 Profile 0: 2023-10-20 13:51:51 sample = [(1, 4, 96, 96), (2, 4, 128, 128), (8, 4, 128, 128)] 2023-10-20 13:51:51 timesteps = [(1,), (2,), (8,)] 2023-10-20 13:51:51 encoder_hidden_states = [(1, 77, 2048), (2, 77, 2048), (8, 154, 2048)] 2023-10-20 13:51:51 y = [(1, 2816), (2, 2816), (8, 2816)] 2023-10-20 13:51:51 latent = [(115), (115), (0)] 2023-10-20 13:51:51 0% 0/30 [00:00<?, ?it/s] 2023-10-20 13:51:52 *** Error completing request 2023-10-20 13:51:52 *** Arguments: ('task(5w0agwfsqv0h9gs)', 'a cat in a park', '', ['SDXL: Photographic'], 30, 'DPM++ 2M Karras', 1, 1, 7, 1024, 1024, False, 0.7, 2, '4x-UltraSharp', 0, 0, 0, 'Use same checkpoint', 'Use same sampler', '', '', [], <gradio.routes.Request object at 0x7fada8bfcf70>, 0, False, '', 0.8, -1, False, -1, 0, 0, 0, 0, False, False, {'ad_model': 'face_yolov8n.pt', 'ad_prompt': '', 'ad_negative_prompt': '', 'ad_confidence': 0.3, 'ad_mask_k_largest': 0, 'ad_mask_min_ratio': 0, 'ad_mask_max_ratio': 1, 'ad_x_offset': 0, 'ad_y_offset': 0, 'ad_dilate_erode': 4, 'ad_mask_merge_invert': 'None', 'ad_mask_blur': 4, 'ad_denoising_strength': 0.4, 'ad_inpaint_only_masked': True, 'ad_inpaint_only_masked_padding': 32, 'ad_use_inpaint_width_height': False, 'ad_inpaint_width': 512, 'ad_inpaint_height': 512, 'ad_use_steps': False, 'ad_steps': 28, 'ad_use_cfg_scale': False, 'ad_cfg_scale': 7, 'ad_use_checkpoint': False, 'ad_checkpoint': 'Use same checkpoint', 'ad_use_vae': False, 'ad_vae': 'Use same VAE', 'ad_use_sampler': False, 'ad_sampler': 'Euler a', 'ad_use_noise_multiplier': False, 'ad_noise_multiplier': 1, 'ad_use_clip_skip': False, 'ad_clip_skip': 1, 'ad_restore_face': False, 'ad_controlnet_model': 'None', 'ad_controlnet_module': 'inpaint_global_harmonious', 'ad_controlnet_weight': 1, 'ad_controlnet_guidance_start': 0, 'ad_controlnet_guidance_end': 1, 'is_api': ()}, {'ad_model': 'None', 'ad_prompt': '', 'ad_negative_prompt': '', 'ad_confidence': 0.3, 'ad_mask_k_largest': 0, 'ad_mask_min_ratio': 0, 'ad_mask_max_ratio': 1, 'ad_x_offset': 0, 'ad_y_offset': 0, 'ad_dilate_erode': 4, 'ad_mask_merge_invert': 'None', 'ad_mask_blur': 4, 'ad_denoising_strength': 0.4, 'ad_inpaint_only_masked': True, 'ad_inpaint_only_masked_padding': 32, 'ad_use_inpaint_width_height': False, 'ad_inpaint_width': 512, 'ad_inpaint_height': 512, 'ad_use_steps': False, 'ad_steps': 28, 'ad_use_cfg_scale': False, 'ad_cfg_scale': 7, 'ad_use_checkpoint': False, 'ad_checkpoint': 'Use same checkpoint', 'ad_use_vae': False, 'ad_vae': 'Use same VAE', 'ad_use_sampler': False, 'ad_sampler': 'Euler a', 'ad_use_noise_multiplier': False, 'ad_noise_multiplier': 1, 'ad_use_clip_skip': False, 'ad_clip_skip': 1, 'ad_restore_face': False, 'ad_controlnet_model': 'None', 'ad_controlnet_module': 'inpaint_global_harmonious', 'ad_controlnet_weight': 1, 'ad_controlnet_guidance_start': 0, 'ad_controlnet_guidance_end': 1, 'is_api': ()}, False, 'MultiDiffusion', False, True, 1024, 1024, 96, 96, 48, 4, 'None', 2, False, 10, 1, 1, 64, False, False, False, False, False, 0.4, 0.4, 0.2, 0.2, '', '', 'Background', 0.2, -1.0, False, 0.4, 0.4, 0.2, 0.2, '', '', 'Background', 0.2, -1.0, False, 0.4, 0.4, 0.2, 0.2, '', '', 'Background', 0.2, -1.0, False, 0.4, 0.4, 0.2, 0.2, '', '', 'Background', 0.2, -1.0, False, 0.4, 0.4, 0.2, 0.2, '', '', 'Background', 0.2, -1.0, False, 0.4, 0.4, 0.2, 0.2, '', '', 'Background', 0.2, -1.0, False, 0.4, 0.4, 0.2, 0.2, '', '', 'Background', 0.2, -1.0, False, 0.4, 0.4, 0.2, 0.2, '', '', 'Background', 0.2, -1.0, False, 3072, 192, True, True, True, False, <scripts.controlnet_ui.controlnet_ui_group.UiControlNetUnit object at 0x7fada8b870a0>, <scripts.controlnet_ui.controlnet_ui_group.UiControlNetUnit object at 0x7fada25f4d90>, <scripts.controlnet_ui.controlnet_ui_group.UiControlNetUnit object at 0x7fada9341180>, False, 1, 0.15, False, 'OUT', ['OUT'], 5, 0, 'Bilinear', False, 'Bilinear', False, 'Lerp', '', '', False, False, None, True, 'from modules.processing import process_images\n\np.width = 768\np.height = 768\np.batch_size = 2\np.steps = 10\n\nreturn process_images(p)', 2, False, False, 'positive', 'comma', 0, False, False, '', 1, '', [], 0, '', [], 0, '', [], True, False, False, False, 0, False, True, True, False, '#000000', False, 'Not set', True, True, '', '', '', '', '', 1.3, 'Not set', 'Not set', 1.3, 'Not set', 1.3, 'Not set', 1.3, 1.3, 'Not set', 1.3, 'Not set', 1.3, 'Not set', 1.3, 'Not set', 1.3, 'Not set', 1.3, 'Not set', False, 'None', None, None, False, None, None, False, None, None, False, 50, False, 4.0, '', 10.0, 'Linear', 3, False, 30.0, True, False, False, 0, 0.0, 'Lanczos', 1, True, 0, 0, 0.001, 75, 0.0, False, True, 'Illustration', 'svg', True, True, False, 0.5, False, 16, True, 16) {} 2023-10-20 13:51:52 Traceback (most recent call last): 2023-10-20 13:51:52 File "/stable-diffusion-webui/modules/call_queue.py", line 57, in f 2023-10-20 13:51:52 res = list(func(*args, **kwargs)) 2023-10-20 13:51:52 File "/stable-diffusion-webui/modules/call_queue.py", line 36, in f 2023-10-20 13:51:52 res = func(*args, **kwargs) 2023-10-20 13:51:52 File "/stable-diffusion-webui/modules/txt2img.py", line 55, in txt2img 2023-10-20 13:51:52 processed = processing.process_images(p) 2023-10-20 13:51:52 File "/stable-diffusion-webui/modules/processing.py", line 732, in process_images 2023-10-20 13:51:52 res = process_images_inner(p) 2023-10-20 13:51:52 File "/stable-diffusion-webui/extensions/sd-webui-controlnet/scripts/batch_hijack.py", line 42, in processing_process_images_hijack 2023-10-20 13:51:52 return getattr(processing, '__controlnet_original_process_images_inner')(p, *args, **kwargs) 2023-10-20 13:51:52 File "/stable-diffusion-webui/modules/processing.py", line 867, in process_images_inner 2023-10-20 13:51:52 samples_ddim = p.sample(conditioning=p.c, unconditional_conditioning=p.uc, seeds=p.seeds, subseeds=p.subseeds, subseed_strength=p.subseed_strength, prompts=p.prompts) 2023-10-20 13:51:52 File "/stable-diffusion-webui/modules/processing.py", line 1140, in sample 2023-10-20 13:51:52 samples = self.sampler.sample(self, x, conditioning, unconditional_conditioning, image_conditioning=self.txt2img_image_conditioning(x)) 2023-10-20 13:51:52 File "/stable-diffusion-webui/modules/sd_samplers_kdiffusion.py", line 235, in sample 2023-10-20 13:51:52 samples = self.launch_sampling(steps, lambda: self.func(self.model_wrap_cfg, x, extra_args=self.sampler_extra_args, disable=False, callback=self.callback_state, **extra_params_kwargs)) 2023-10-20 13:51:52 File "/stable-diffusion-webui/modules/sd_samplers_common.py", line 261, in launch_sampling 2023-10-20 13:51:52 return func() 2023-10-20 13:51:52 File "/stable-diffusion-webui/modules/sd_samplers_kdiffusion.py", line 235, in <lambda> 2023-10-20 13:51:52 samples = self.launch_sampling(steps, lambda: self.func(self.model_wrap_cfg, x, extra_args=self.sampler_extra_args, disable=False, callback=self.callback_state, **extra_params_kwargs)) 2023-10-20 13:51:52 File "/usr/local/lib/python3.10/site-packages/torch/utils/_contextlib.py", line 115, in decorate_context 2023-10-20 13:51:52 return func(*args, **kwargs) 2023-10-20 13:51:52 File "/stable-diffusion-webui/repositories/k-diffusion/k_diffusion/sampling.py", line 594, in sample_dpmpp_2m 2023-10-20 13:51:52 denoised = model(x, sigmas[i] * s_in, **extra_args) 2023-10-20 13:51:52 File "/usr/local/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1501, in _call_impl 2023-10-20 13:51:52 return forward_call(*args, **kwargs) 2023-10-20 13:51:52 File "/stable-diffusion-webui/modules/sd_samplers_cfg_denoiser.py", line 169, in forward 2023-10-20 13:51:52 x_out = self.inner_model(x_in, sigma_in, cond=make_condition_dict(cond_in, image_cond_in)) 2023-10-20 13:51:52 File "/usr/local/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1501, in _call_impl 2023-10-20 13:51:52 return forward_call(*args, **kwargs) 2023-10-20 13:51:52 File "/stable-diffusion-webui/repositories/k-diffusion/k_diffusion/external.py", line 112, in forward 2023-10-20 13:51:52 eps = self.get_eps(input * c_in, self.sigma_to_t(sigma), **kwargs) 2023-10-20 13:51:52 File "/stable-diffusion-webui/repositories/k-diffusion/k_diffusion/external.py", line 138, in get_eps 2023-10-20 13:51:52 return self.inner_model.apply_model(*args, **kwargs) 2023-10-20 13:51:52 File "/stable-diffusion-webui/modules/sd_models_xl.py", line 37, in apply_model 2023-10-20 13:51:52 return self.model(x, t, cond) 2023-10-20 13:51:52 File "/usr/local/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1501, in _call_impl 2023-10-20 13:51:52 return forward_call(*args, **kwargs) 2023-10-20 13:51:52 File "/stable-diffusion-webui/modules/sd_hijack_utils.py", line 17, in <lambda> 2023-10-20 13:51:52 setattr(resolved_obj, func_path[-1], lambda *args, **kwargs: self(*args, **kwargs)) 2023-10-20 13:51:52 File "/stable-diffusion-webui/modules/sd_hijack_utils.py", line 28, in __call__ 2023-10-20 13:51:52 return self.__orig_func(*args, **kwargs) 2023-10-20 13:51:52 File "/stable-diffusion-webui/repositories/generative-models/sgm/modules/diffusionmodules/wrappers.py", line 28, in forward 2023-10-20 13:51:52 return self.diffusion_model( 2023-10-20 13:51:52 File "/usr/local/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1501, in _call_impl 2023-10-20 13:51:52 return forward_call(*args, **kwargs) 2023-10-20 13:51:52 File "/stable-diffusion-webui/repositories/generative-models/sgm/modules/diffusionmodules/openaimodel.py", line 984, in forward 2023-10-20 13:51:52 emb = self.time_embed(t_emb) 2023-10-20 13:51:52 File "/usr/local/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1501, in _call_impl 2023-10-20 13:51:52 return forward_call(*args, **kwargs) 2023-10-20 13:51:52 File "/usr/local/lib/python3.10/site-packages/torch/nn/modules/container.py", line 217, in forward 2023-10-20 13:51:52 input = module(input) 2023-10-20 13:51:52 File "/usr/local/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1501, in _call_impl 2023-10-20 13:51:52 return forward_call(*args, **kwargs) 2023-10-20 13:51:52 File "/stable-diffusion-webui/extensions-builtin/Lora/networks.py", line 429, in network_Linear_forward 2023-10-20 13:51:52 return originals.Linear_forward(self, input) 2023-10-20 13:51:52 File "/usr/local/lib/python3.10/site-packages/torch/nn/modules/linear.py", line 114, in forward 2023-10-20 13:51:52 return F.linear(input, self.weight, self.bias) 2023-10-20 13:51:52 RuntimeError: Expected all tensors to be on the same device, but found at least two devices, cpu and cuda:0! (when checking argument for argument mat1 in method wrapper_CUDA_addmm)

About this issue

Original URL
State: closed
Created 8 months ago
Reactions: 2
Comments: 21

Most upvoted comments

@boehmi1988 - I was initially battling with that same error of “RuntimeError: Expected all tensors to be on the same device, but found at least two devices, cpu and cuda:0!” too. Just to be sure, are you using the latest DEV branch of Automatic1111’s SD? That seems to be required ATM.

arch1v1st on Oct 21, 2023