Gentoo Forums
Gentoo Forums
Gentoo Forums
Quick Search: in
amdgpu ring gfx_high timeout causing freezing plasma/wayland
View unanswered posts
View posts from last 24 hours
View posts from last 7 days

 
Reply to topic    Gentoo Forums Forum Index Desktop Environments
View previous topic :: View next topic  
Author Message
maiku
l33t
l33t


Joined: 24 Mar 2004
Posts: 605
Location: Escaping from NY

PostPosted: Mon Feb 03, 2025 6:57 pm    Post subject: amdgpu ring gfx_high timeout causing freezing plasma/wayland Reply with quote

Recently (cannot identify what changed beforehand), gimp has been locking up kde (I guess wayland 'cause it ain't X anmore) when I open gimp and pop open the file dialogue. I use kde/plasma as my DE.

After it locks up, I can SSH into the machine. If I kill gimp it becomes a zombie process. I cannot stop xdm. Everything becomes a zombie. I can't reboot my machine, so I have to hard power down.

How do I go about squashing this bug?

Quote:
media-gfx/gimp-2.10.38-r1:0/2::gentoo USE="aalib alsa heif mng postscript udev webp (-aqua) -debug -doc -gnome -jpeg2k -jpegxl -openexr -test -unwind -vector-icons -wmf -xpm" CPU_FLAGS_X86="mmx sse" 30,956 KiB


Code:
# emerge --info
Portage 3.0.66.1 (python 3.12.8-final-0, default/linux/amd64/23.0/desktop/plasma, gcc-13, glibc-2.40-r8, 6.6.74-gentoo x86_64)
=================================================================
System uname: Linux-6.6.74-gentoo-x86_64-AMD_Ryzen_5_2500U_with_Radeon_Vega_Mobile_Gfx-with-glibc2.40
KiB Mem:    31744112 total,  25870728 free
KiB Swap:   16777212 total,  16777212 free
Head commit of repository gentoo: e8f27f170f226d3a0cb5df5d3bdd3951ba8ae7fe

sh bash 5.2_p37
ld GNU ld (Gentoo 2.42 p6) 2.42.0
app-misc/pax-utils:        1.3.8::gentoo
app-shells/bash:           5.2_p37::gentoo
dev-build/autoconf:        2.13-r8::gentoo, 2.71-r7::gentoo, 2.72-r1::gentoo
dev-build/automake:        1.16.5-r2::gentoo
dev-build/cmake:           3.30.6::gentoo
dev-build/libtool:         2.5.4::gentoo
dev-build/make:            4.4.1-r100::gentoo
dev-build/meson:           1.6.1::gentoo
dev-java/java-config:      2.3.4::gentoo
dev-lang/perl:             5.40.0-r1::gentoo
dev-lang/python:           3.11.10_p1::gentoo, 3.12.8_p1::gentoo, 3.13.1_p1::gentoo
dev-lang/rust:             1.79.0-r100::gentoo, 1.80.1-r100::gentoo, 1.81.0-r100::gentoo
dev-lang/rust-bin:         1.79.0-r100::gentoo, 1.81.0-r101::gentoo, 1.82.0-r101::gentoo, 1.83.0::gentoo
llvm-core/clang:           18.1.8-r6::gentoo, 19.1.4::gentoo
llvm-core/lld:             18.1.8::gentoo, 19.1.4::gentoo
llvm-core/llvm:            18.1.8-r6::gentoo, 19.1.4::gentoo
sys-apps/baselayout:       2.17::gentoo
sys-apps/openrc:           0.55.1::gentoo
sys-apps/sandbox:          2.39::gentoo
sys-devel/binutils:        2.42-r2::gentoo, 2.43-r2::gentoo
sys-devel/binutils-config: 5.5.2::gentoo
sys-devel/gcc:             13.3.1_p20241025::gentoo, 14.2.1_p20241221::gentoo
sys-devel/gcc-config:      2.12.1::gentoo
sys-kernel/linux-headers:  6.6-r1::gentoo (virtual/os-headers)
sys-libs/glibc:            2.40-r8::gentoo
Repositories:

gentoo
    location: /usr/portage
    sync-type: git
    sync-uri: git://anongit.gentoo.org/repo/gentoo.git
    priority: -1000
    volatile: True

crossdev-rpi2
    location: /usr/local/portage-crossdev-rpi2
    masters: gentoo
    priority: 10
    volatile: True

ACCEPT_KEYWORDS="amd64"
ACCEPT_LICENSE="*"
CBUILD="x86_64-pc-linux-gnu"
CFLAGS="-march=native -O2 -pipe -fomit-frame-pointer"
CHOST="x86_64-pc-linux-gnu"
CONFIG_PROTECT="/etc /usr/lib64/libreoffice/program/sofficerc /usr/share/config /usr/share/gnupg/qualified.txt /var/bind"
CONFIG_PROTECT_MASK="/etc/ca-certificates.conf /etc/dconf /etc/env.d /etc/fonts/fonts.conf /etc/gconf /etc/gentoo-release /etc/php/apache2-php8.2/ext-active/ /etc/php/cgi-php8.2/ext-active/ /etc/php/cli-php8.2/ext-active/ /etc/php/fpm-php8.2/ext-active/ /etc/php/phpdbg-php8.2/ext-active/ /etc/revdep-rebuild /etc/sandbox.d"
CXXFLAGS="-march=native -O2 -pipe -fomit-frame-pointer"
DISTDIR="/tmp/portage/distfiles"
EMERGE_DEFAULT_OPTS="--autounmask=n"
ENV_UNSET="CARGO_HOME DBUS_SESSION_BUS_ADDRESS DISPLAY GDK_PIXBUF_MODULE_FILE GOBIN GOPATH PERL5LIB PERL5OPT PERLPREFIX PERL_CORE PERL_MB_OPT PERL_MM_OPT XAUTHORITY XDG_CACHE_HOME XDG_CONFIG_HOME XDG_DATA_HOME XDG_RUNTIME_DIR XDG_STATE_HOME"
FCFLAGS="-O2 -pipe"
FEATURES="assume-digests binpkg-docompress binpkg-dostrip binpkg-logs buildpkg-live config-protect-if-modified distlocks ebuild-locks fixlafiles ipc-sandbox merge-sync merge-wait multilib-strict network-sandbox news parallel-fetch pid-sandbox pkgdir-index-trusted preserve-libs protect-owned qa-unresolved-soname-deps sandbox sfperms strict unknown-features-warn unmerge-logs unmerge-orphans userfetch userpriv usersandbox usersync xattr"
FFLAGS="-O2 -pipe"
GENTOO_MIRRORS="http://distfiles.gentoo.org"
LANG="en_US"
LDFLAGS="-Wl,-O1 -Wl,--as-needed -Wl,-z,pack-relative-relocs"
LEX="flex"
LINGUAS="en ja"
MAKEOPTS="-j5"
PKGDIR="/var/cache/binpkgs"
PORTAGE_CONFIGROOT="/"
PORTAGE_RSYNC_OPTS="--recursive --links --safe-links --perms --times --omit-dir-times --compress --force --whole-file --delete --stats --human-readable --timeout=180 --exclude=/distfiles --exclude=/local --exclude=/packages --exclude=/.git"
PORTAGE_TMPDIR="/tmp"
SHELL="/bin/bash"
USE="3dnow 3dnowext X a52 aac aacplus aacs aalib accessibility acl acpi activities additions agg airplay alsa amd64 android anthy ao apache2 apm archive arts aspell atm audiofile autoipd avahi avi berkdb bluetooth bluray bmp branding bzip2 cairo cdda cddb cdparanoia cdr cet chardet chm cjk clang client connection-sharing corefonts cover crypt cups curl dbus declarative designer desktopglobe dhcp dhcpcd disable-watermark divx djvu dri dts dv dvd dvdnav dvdr dvdread eap-tls ebook elogind enca encode epub exif faac facebook fdk fftw flac fluidsynth font-server frontendonly ftp fts3 fuse game gd gdbm gif git glitz gme gmedia gold google gphoto2 grub gsm gstaudio gstreamer gstvideo gtk gui hal hbci hdri heif hwaccel ibus iconv icu id3 id3tag imagemagick imlib immqt-bc inotify ios ipod iptables ipv6 javascript jce joystick jpeg json kde kdenlive kf6compat kwallet ladspa lame lapacke lcms libass libcaca libnotify librtmp libsamplerate libtirpc live logrotate lz4 lzma lzo m17n-lib mad matroska mdnsresponder-compat melt messages mikmod minizip mjpeg mmap mms mmxext mng mobi mod modplug mono mount mp3 mp3tunes mp4 mpeg mppe-mppc msn mtp multilib music musicbrainz mysql mysqli ncurses networkmanager nfs njb nls nodroproot normalize notifier nptl nptlonly nsplugin ntfs nxclient ocr odbc ofa ofono-headset ogg oggvorbis ogm openal opencl opengl opus osmesa pam pango pcre pcre16 pcre32 pdf perl pic pipewire plasma pm-utils png pnp policykit postproc postscript printsupport pulseaudio qdoc qml qsui qt5 qt6 quicktime radiotap rar raw rdesktop rdp readline real realmedia reiserfs rpc rss rtc rustfmt samba screencast script sdl seccomp secure-delete semantic-desktop share sip sipim smpeg sndfile soprano sound source sox speex spell spice sql sqlite srtp sse3 ssl ssse3 staging startup-notification staticsocket subtitles subversion svg svn swfdec system-harfbuzz system-jpeg system-libevent system-libvpx system-sqlite systray taglib teamarena test-programs test-rust theora threads tiff timidity tk touchpad tray truetype tunepimp udev udisks unicode upcall upower usb v4l v4l2 vaapi vamp vcd vdpau vim vim-syntax virtfs visualization vlc vnc vorbis vpx vulkan wav wavpack wayland webchannel webkit webp widgets wifi winbind wma wmp wmv wps wxwidgets x264 x265 xattr xcb xcomposite xft xine xinerama xml xml2 xosd xv xvid zeroconf zip zlib" ABI_X86="64" ADA_TARGET="gcc_13" APACHE2_MODULES="authn_core authz_core socache_shmcb unixd actions alias auth_basic authn_anon authn_dbm authn_file authz_dbm authz_groupfile authz_host authz_owner authz_user autoindex cache cgi cgid dav dav_fs dav_lock deflate dir env expires ext_filter file_cache filter headers include info log_config logio mime mime_magic negotiation rewrite setenvif speling status unique_id userdir usertrack vhost_alias" CALLIGRA_FEATURES="karbon sheets words" COLLECTD_PLUGINS="df interface irq load memory rrdtool swap syslog" CPU_FLAGS_X86="mmxext sse2 ssse3 mmx sse sse3 aes avx avx2 sse4_1 sse4_2 f16c popcnt" ELIBC="glibc" GPSD_PROTOCOLS="ashtech aivdm earthmate evermore fv18 garmin garmintxt gpsclock greis isync itrax navcom oceanserver oncore rtcm104v2 rtcm104v3 sirf skytraq superstar2 tsip tripmate tnt ublox" GUILE_SINGLE_TARGET="3-0" GUILE_TARGETS="3-0" INPUT_DEVICES="mouse keyboard evdev synaptics joystick" KERNEL="linux" L10N="en-GB ja" LCD_DEVICES="bayrad cfontz glk hd44780 lb216 lcdm001 mtxorb text" LUA_SINGLE_TARGET="lua5-1" LUA_TARGETS="lua5-1" OFFICE_IMPLEMENTATION="libreoffice" PHP_TARGETS="php8-2" POSTGRES_TARGETS="postgres16" PYTHON_SINGLE_TARGET="python3_12" PYTHON_TARGETS="python3_12" QEMU_SOFTMMU_TARGETS="i386 x86_64" QEMU_USER_TARGETS="i386 x86_64" RUBY_TARGETS="ruby32" VIDEO_CARDS="amdgpu radeon radeonsi" XTABLES_ADDONS="quota2 psd pknock lscan length2 ipv4options ipp2p iface geoip fuzzy condition tarpit sysrq proto logmark ipmark dhcpmac delude chaos account"
Unset:  ADDR2LINE, AR, ARFLAGS, AS, ASFLAGS, CC, CCLD, CONFIG_SHELL, CPP, CPPFLAGS, CTARGET, CXX, CXXFILT, ELFEDIT, EXTRA_ECONF, F77FLAGS, FC, GCOV, GPROF, INSTALL_MASK, LC_ALL, LD, LFLAGS, LIBTOOL, MAKE, MAKEFLAGS, NM, OBJCOPY, OBJDUMP, PORTAGE_BINHOST, PORTAGE_BUNZIP2_COMMAND, PORTAGE_COMPRESS, PORTAGE_COMPRESS_FLAGS, PORTAGE_RSYNC_EXTRA_OPTS, PYTHONPATH, RANLIB, READELF, RUSTFLAGS, SIZE, STRINGS, STRIP, YACC, YFLAGS

_________________
Michael


Last edited by maiku on Wed Feb 05, 2025 2:51 pm; edited 2 times in total
Back to top
View user's profile Send private message
maiku
l33t
l33t


Joined: 24 Mar 2004
Posts: 605
Location: Escaping from NY

PostPosted: Tue Feb 04, 2025 2:07 pm    Post subject: Reply with quote

Notably, last night I had Firefox and Thunderbird lock up on my 4 times. I can get back to a terminal but I can't kill Firefox or Thunderbird, they become Zombies and I can't reboot.

I've noticed consistent freezing and lock-ups since I've upgraded to plasma with Wayland. However, the lockups eventually resolve after a few seconds to a minute (yes, it's incredibly annoying).

Not sure where to go on this one. dmesg doesn't show much. /var/log/messages shows a bunch of garbage:

Code:
Feb  3 22:57:44 test dbus-daemon[4054]: [system] Activating service name='org.kde.powerdevil.backlighthelper' requested by ':1.13' (uid=1000 pid=5472 comm="/usr/libexec/org_kde_powerdevil") (using servicehelper)
Feb  3 22:57:44 test org.kde.powerdevil.backlighthelper[6365]: Detected locale "C" with character encoding "ANSI_X3.4-1968", which is not UTF-8.
Qt depends on a UTF-8 locale, and has switched to "C.UTF-8" instead.
If this causes problems, reconfigure your locale. See the locale(1) manual
for more information.
Feb  3 22:57:44 test dbus-daemon[4054]: [system] Successfully activated service 'org.kde.powerdevil.backlighthelper'
Feb  3 22:57:53 test sm-notify[4741]: DNS resolution of Mikes-Mac-mini.knightsoftheround failed; retrying later
^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@
^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@
^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@



Code:
[ebuild   R    ] www-client/firefox-128.6.0:esr::gentoo  USE="X clang dbus gmp-autoupdate hwaccel jumbo-build pulseaudio system-av1 system-harfbuzz system-icu system-jpeg system-libevent system-libvpx system-webp telemetry wayland wifi -debug -eme-free -gnome-shell -hardened -jack -libproxy -openh264 -pgo (-selinux) -sndio -system-png -wasm-sandbox" L10N="en-GB ja -ach -af -an -ar -ast -az -be -bg -bn -br -bs -ca -ca-valencia -cak -cs -cy -da -de -dsb -el -en-CA -eo -es-AR -es-CL -es-ES -es-MX -et -eu -fa -ff -fi -fr -fur -fy -ga -gd -gl -gn -gu -he -hi -hr -hsb -hu -hy -ia -id -is -it -ka -kab -kk -km -kn -ko -lij -lt -lv -mk -mr -ms -my -nb -ne -nl -nn -oc -pa -pl -pt-BR -pt-PT -rm -ro -ru -sc -sco -si -sk -skr -sl -son -sq -sr -sv -szl -ta -te -th -tl -tr -trs -uk -ur -uz -vi -xh -zh-CN -zh-TW" LLVM_SLOT="19 -17 -18" 551,338 KiB
[ebuild   R    ] mail-client/thunderbird-128.6.0::gentoo  USE="X clang dbus hwaccel pulseaudio rust-extensions system-av1 system-harfbuzz system-jpeg system-libevent system-librnp system-libvpx system-webp wayland wifi -debug -eme-free -hardened -jack -libproxy -lto (-pgo) (-selinux) -sndio (-system-icu) -system-png" L10N="en-GB ja -af -ar -ast -be -bg -br -ca -cak -cs -cy -da -de -dsb -el -en-CA -es-AR -es-ES -es-MX -et -eu -fi -fr -fy -ga -gd -gl -he -hr -hsb -hu -id -is -it -ka -kab -kk -ko -lt -lv -ms -nb -nl -nn -pa -pl -pt-BR -pt-PT -rm -ro -ru -sk -sl -sq -sr -sv -th -tr -uk -uz -vi -zh-CN -zh-TW" LLVM_SLOT="19 -17 -18" 661,105 KiB


Fixed a long line. -- Banana
_________________
Michael
Back to top
View user's profile Send private message
Hu
Administrator
Administrator


Joined: 06 Mar 2007
Posts: 23095

PostPosted: Tue Feb 04, 2025 2:39 pm    Post subject: Reply with quote

Is there anything in dmesg after this happens?
Back to top
View user's profile Send private message
maiku
l33t
l33t


Joined: 24 Mar 2004
Posts: 605
Location: Escaping from NY

PostPosted: Tue Feb 04, 2025 2:54 pm    Post subject: Reply with quote

Hu wrote:
Is there anything in dmesg after this happens?
Nothing related to any error. Just normal messages. It looks like this:

Code:
[   17.733646] kwin_wayland[5184]: memfd_create() called without MFD_EXEC or MFD_NOEXEC_SEAL set
[   19.370531] sched: RT throttling activated
[   21.999824] usb 1-2: reset high-speed USB device number 3 using xhci_hcd
[   37.290677] input: MX Anywhere 2S Keyboard as /devices/virtual/misc/uhid/0005:046D:B01A.0004/input/input15
[   37.290880] input: MX Anywhere 2S Mouse as /devices/virtual/misc/uhid/0005:046D:B01A.0004/input/input16
[   37.291075] hid-generic 0005:046D:B01A.0004: input,hidraw3: BLUETOOTH HID v0.03 Keyboard [MX Anywhere 2S] on 70:c9:4e:57:e8:0a

_________________
Michael
Back to top
View user's profile Send private message
maiku
l33t
l33t


Joined: 24 Mar 2004
Posts: 605
Location: Escaping from NY

PostPosted: Wed Feb 05, 2025 2:27 pm    Post subject: Reply with quote

Today the crash produced results. I opened up kdenlive and the open file dialog (which seems to trigger it) and got this when I sshed in and ran dmesg

Code:
[   18.177534] kwin_wayland[5092]: memfd_create() called without MFD_EXEC or MFD_NOEXEC_SEAL set
[   23.210888] usb 3-2: reset high-speed USB device number 3 using xhci_hcd
[  318.367390] input: MX Anywhere 2S Keyboard as /devices/virtual/misc/uhid/0005:046D:B01A.0004/input/input16
[  318.367614] input: MX Anywhere 2S Mouse as /devices/virtual/misc/uhid/0005:046D:B01A.0004/input/input17
[  318.367876] hid-generic 0005:046D:B01A.0004: input,hidraw3: BLUETOOTH HID v0.03 Keyboard [MX Anywhere 2S] on 70:c9:4e:57:e8:0a
[  688.220745] usb 1-2: USB disconnect, device number 3
[  688.802984] usb 3-2: USB disconnect, device number 3
[ 1957.334226] [drm:amdgpu_job_timedout [amdgpu]] *ERROR* ring gfx_high timeout, signaled seq=122038, emitted seq=122040
[ 1957.334568] [drm:amdgpu_job_timedout [amdgpu]] *ERROR* Process information: process kwin_wayland pid 5092 thread kwin_wayla:cs0 pid 5223
[ 1957.334905] amdgpu 0000:03:00.0: amdgpu: GPU reset begin!
[ 1960.849012] amdgpu 0000:03:00.0: amdgpu: failed to write reg 28b4 wait reg 28c6
[ 1963.101537] sched: RT throttling activated
[ 1971.754582] Bluetooth: hci0: Opcode 0x200c failed: -110
[ 1971.754592] Bluetooth: hci0: Unable to disable scanning: -110
[ 1971.754595] Bluetooth: hci0: disable scanning failed: -110
[ 1971.754599] Bluetooth: hci0: start background scanning failed: -110
[ 1975.258374] Bluetooth: hci0: command tx timeout
[ 1992.251978] clocksource: Long readout interval, skipping watchdog check: cs_nsec: 1140535519 wd_nsec: 1140538811
[ 2113.033903] Bluetooth: hci0: ACL packet for unknown connection handle 1
[ 2113.033918] Bluetooth: hci0: ACL packet for unknown connection handle 1
[ 2113.033922] Bluetooth: hci0: ACL packet for unknown connection handle 1


If I had to guess, I'd say the amdgpu errors happened first then the bluetooth errors.

Is this a compositor error? How do I disable that in the new plasma?
_________________
Michael
Back to top
View user's profile Send private message
Hu
Administrator
Administrator


Joined: 06 Mar 2007
Posts: 23095

PostPosted: Wed Feb 05, 2025 3:05 pm    Post subject: Reply with quote

In my opinion, this cannot be a compositor bug, though the compositor may be integral in triggering the bug. amdgpu is a kernel driver, and barring a bug, the kernel should not get into this bad state, no matter how badly the compositor might misuse the kernel interfaces for accessing the graphics card.
Back to top
View user's profile Send private message
maiku
l33t
l33t


Joined: 24 Mar 2004
Posts: 605
Location: Escaping from NY

PostPosted: Wed Feb 05, 2025 3:15 pm    Post subject: Reply with quote

Hu wrote:
In my opinion, this cannot be a compositor bug, though the compositor may be integral in triggering the bug. amdgpu is a kernel driver, and barring a bug, the kernel should not get into this bad state, no matter how badly the compositor might misuse the kernel interfaces for accessing the graphics card.


What's a good fix for this you think? Maybe I'll try cleaning my fans and updating my bios.
_________________
Michael
Back to top
View user's profile Send private message
Display posts from previous:   
Reply to topic    Gentoo Forums Forum Index Desktop Environments All times are GMT
Page 1 of 1

 
Jump to:  
You cannot post new topics in this forum
You cannot reply to topics in this forum
You cannot edit your posts in this forum
You cannot delete your posts in this forum
You cannot vote in polls in this forum