i915 Error: GUC: TLB invalidation Causes OS to Hang
Posted by No-Result-3830@reddit | linux | View on Reddit | 21 comments
A number of users, including myself, have been experiencing an issue where the OS becomes unresponsive, nothing is clickable, but the mouse is still able to move. Rebooting the PC is required in order to restore functionality. This happens approximately a day, often after waking from sleep. The journald error log is included below, as well as reports of this issue and attempted fixes from other users.
The maintainer of i915-sriov-dkms (https://github.com/strongtz/i915-sriov-dkms/issues/118) traced the fix of this issue to https://github.com/intel/linux-intel...1e5c99f666068e. The commit is summarized below:
The GuC firmware had defined the interface for Translation Look-Aside Buffer (TLB) invalidation. We should use this interface when invalidating the engine and GuC TLBs. Add additional functionality to intel_gt_invalidate_tlb, invalidating the GuC TLBs and falling back to GT invalidation when the GuC is disabled. The invalidation is done by sending a request directly to the GuC tlb_lookup that invalidates the table. The invalidation is submitted as a wait request and is performed in the CT event handler. This means we cannot perform this TLB invalidation path if the CT is not enabled. If the request isn't fulfilled in two seconds, this would constitute an error in the invalidation as that would constitute either a lost request or a severe GuC overload.
With this new invalidation routine, we can perform GuC-based GGTT invalidations. GuC-based GGTT invalidation is incompatible with MMIO invalidation so we should not perform MMIO invalidation when GuC-based GGTT invalidation is expected.
What would be the best course of action to resolve this issue for regular fedora users, without rebuilding the kernel?
System Info
OS: Fedora Linux 42 (Workstation Edition)
Firmware Version: R2DET38W (1.23 )
Kernel Version: Linux 6.14.11-300.fc42.x86_64
Windowing System: Wayland
CPU: Intel Core Ultra 7 155H
iGPU: Intel Arc™ graphics
GPU: NVIDIA RTX 500 Ada Generation Laptop GPU 4GB GDDR6 (Nvidia driver installed)
journald Error Log:
Jul 02 16:16:33 fedora kernel: i915 0000:00:02.0: [drm] ERROR GT0: GUC: TLB invalidation response timed out for seqno 85162 Jul 02 16:16:35 fedora kernel: i915 0000:00:02.0: [drm] ERROR GT0: GUC: TLB invalidation response timed out for seqno 85163 Jul 02 16:16:38 fedora kernel: i915 0000:00:02.0: [drm] ERROR GT0: GUC: TLB invalidation response timed out for seqno 85165 Jul 02 16:16:38 fedora kernel: i915 0000:00:02.0: [drm] ERROR GT0: GUC: TLB invalidation response timed out for seqno 85164 Jul 02 16:16:39 fedora kernel: Fence expiration time out i915-0000:00:02.0:gnome-shell[3281]:92f2! Jul 02 16:16:40 fedora kernel: i915 0000:00:02.0: [drm] ERROR GT0: GUC: TLB invalidation response timed out for seqno 85167 Jul 02 16:16:40 fedora kernel: i915 0000:00:02.0: [drm] ERROR GT0: GUC: TLB invalidation response timed out for seqno 85166 Jul 02 16:16:42 fedora kernel: i915 0000:00:02.0: [drm] ERROR GT0: GUC: TLB invalidation response timed out for seqno 85168 Jul 02 16:16:42 fedora kernel: i915 0000:00:02.0: [drm] ERROR GT0: GUC: TLB invalidation response timed out for seqno 85169 Jul 02 16:16:44 fedora kernel: i915 0000:00:02.0: [drm] ERROR GT0: GUC: TLB invalidation response timed out for seqno 85170 Jul 02 16:16:46 fedora kernel: i915 0000:00:02.0: [drm] ERROR GT0: GUC: TLB invalidation response timed out for seqno 85171 Jul 02 16:16:47 fedora kernel: i915 0000:00:02.0: [drm] GPU HANG: ecode 12:0:00000000 Jul 02 16:16:47 fedora kernel: i915 0000:00:02.0: [drm] GT0: Resetting chip for stopped heartbeat on rcs0 Jul 02 16:16:47 fedora kernel: i915 0000:00:02.0: [drm] GT0: GuC firmware i915/mtl_guc_70.bin version 70.44.1 Jul 02 16:16:47 fedora kernel: i915 0000:00:02.0: [drm] GT0: GUC: submission enabled Jul 02 16:16:47 fedora kernel: i915 0000:00:02.0: [drm] GT0: GUC: SLPC enabled
Reports of this issue have been made around the web, including attempts to resolve the issue, but none were successful. They are documented below:
- Updating
GRUB_CMDLINE_LINUX_DEFAULTparameters "i915.enable_guc=0", "i915.enable_fbc=0 i915.enable_psr=0", "intel_idle.max_cstate=1" & "i915.enable_dc=0", as well as removing all Gnome extensions. (https://www.reddit.com/r/Fedora/comm...on_zenbook_14/) - Disabling VT-d and changing to kernel version 6.13.5 on Ubuntu (https://askubuntu.com/questions/1534...ke-ack-request)
- User confirms it is occurring on both Wayland and X11 on System76 (https://www.reddit.com/r/System76/co...yland_freezes/)
- Setting intel_idle.max_cstate=1 (https://www.reddit.com/r/Ubuntu/comm...th_intel_core/)
- Lenovo Support also noted the issue with suggested fixes (https://forums.lenovo.com/topic/find...360254/6570223)
- User speculates issue is related to QEMU, but QEMU maintainer noted the issue is unrelated (https://github.com/intel/linux-intel-lts/issues/54, https://gitlab.com/qemu-project/qemu/-/issues/2722
theflakes@reddit
Same TLB issue here with Intel 255h and 140T iGPU running on Xe driver on Ubuntu 24.04 LTS.
Effective-Bowler7976@reddit
Same here. Fedora 43, TUXEDO InfinityBook Pro Intel Gen9.
BostonDrivingIsWorse@reddit
Same issue here. Commenting to follow.
m-jawad-b-khorasani@reddit
I have the same issue in Dell XPS 9440 and Ubuntu 24.04.03!
No-Result-3830@reddit (OP)
See: https://gitlab.freedesktop.org/drm/i915/kernel/-/issues/14469
No-Result-3830@reddit (OP)
See: https://gitlab.freedesktop.org/drm/i915/kernel/-/issues/14469
messiahua@reddit
Check out for details and investigation of this problem: https://gitlab.freedesktop.org/drm/i915/kernel/-/issues/14469
Basic_Researcher1437@reddit
Hi, i am facing same issues on my Zephyrus G16 Fedora. System would randomly freeze and becomes unresponsive. Has anyone found solution? Setting psr to 0 and guc to 0 or 2 didnt help.
No-Result-3830@reddit (OP)
Switching from i915 to xe appears to reduce the frequency of hangs for some users, but it still occurs. There's currently a discussion going on here: in https://discussion.fedoraproject.org/t/i915-error-guc-tlb-invalidation-causes-os-to-hang/157260/17
messiahua@reddit
I have the same problem with i915 (Ultra 7 155H) on ubuntu 25.04, 6.14.0-23-generic. Sometimes it unhangs, sometimes not.
zlice0@reddit
https://gitlab.com/freedesktop-mirror/drm-intel
probably file bug here if you can't find 1, theyre decently responsive
"NVIDIA RTX 500 Ada Generation Laptop GPU 4GB GDDR6" nvidia 'bumblebee' and w/e combo with igpu has suked for ages
maybe look up ways to disable nvidia and make sure youre only using intel gpu ?
zlice0@reddit
probably this? just saw hang / tlb https://gitlab.freedesktop.org/drm/i915/kernel/-/issues/14469
Ezmiller_2@reddit
Have you tried using the Nouveau driver? I'm not sure if that would help your situation or if it's a completely different beast.
No-Result-3830@reddit (OP)
Yes that’s the driver that originally came with Fedora I believe. Reverting back to Nouveau is next on my todo list, once the results of my current attempted fix are in.
Ezmiller_2@reddit
You'll know if it's working if you go open the Nvidia-settings app and it looks incomplete. I was using the Nouveau driver and didn't know it until I checked the Nvidia settings, and the hardware and driver info were absent. I could tell performance was different. Not terribly bad, but the vsync didn't work very well and scrolling on Firefox was terrible.
No-Result-3830@reddit (OP)
I uninstalled the Nvidia driver and reverted back to Nouveau, and the freezing problem still persists.
Ezmiller_2@reddit
Dang. I wonder if Mint would help you out in this situation.
Interject_@reddit
Have you tried using the
xekernel driver instead ofi915? It's possible that it might have other issues as it's not the default, but Meteor Lake is the last target that defaults toi915, everything after it usesxeby default. Could be worth trying out: https://www.kernel.org/doc/html//next/gpu/rfc/xe.html#xe-platformsNo-Result-3830@reddit (OP)
Update: switched to xe and the problem is still occurring -- it may be a different problem because with xe, not even the mouse is movable when the system freezes.
No-Result-3830@reddit (OP)
Giving it a try now. Will report back if stable
ArtichokeRelevant211@reddit
https://docs.fedoraproject.org/en-US/quick-docs/bugzilla-file-a-bug/