View previous topic :: View next topic |
Author |
Message |
KShots Guru
Joined: 09 Oct 2003 Posts: 595 Location: Florida
|
Posted: Wed Jan 10, 2024 9:43 pm Post subject: |
|
|
Under 6.6.3, my root filesystem (under ZFS, an out-of-tree kernel module) would work for a minute or so, then start timing out on all operations. In order for ZFS to even try to work, I had to hard unmask it (using **, not ~amd64) as all earlier versions did not support beyond 6.5.x. Given I still had an OOPS even at that late a version, I didn't feel it merited much further testing. _________________ Life without passion is death in disguise |
|
Back to top |
|
|
KShots Guru
Joined: 09 Oct 2003 Posts: 595 Location: Florida
|
Posted: Mon Jan 22, 2024 9:26 pm Post subject: |
|
|
Update from kernel 6.6.13 - still has an oops. ZFS kernel modules are available (version 2.2 is compatible with up to kernel 6.6), but are unstable. Under 6.6.13, I get a display for a few seconds, then it flickers on and off (off for 1 minute, on for 2-3 seconds) until rebooted. Reverted back to 6.1.74, which reverted the problem. _________________ Life without passion is death in disguise |
|
Back to top |
|
|
KShots Guru
Joined: 09 Oct 2003 Posts: 595 Location: Florida
|
Posted: Sat Mar 16, 2024 7:12 pm Post subject: |
|
|
Still have problems with 6.6.21 - works for a minute or two, then the screens turn off. Tried switching to a console (ctrl-alt-f7), and got a blinking text cursor on the built-in display... and I guess a frame from the framebuffer on the primary monitor showing the gnome desktop. Only thing I could do from there was reboot. Reverted again back to 6.1.74. _________________ Life without passion is death in disguise |
|
Back to top |
|
|
logrusx Advocate
Joined: 22 Feb 2018 Posts: 2201
|
Posted: Sat Mar 16, 2024 7:35 pm Post subject: |
|
|
I had hangs on resume from S3 sleep while on battery on 6.6, which seems to be resolved on 6.8. You could try it. I hope it really works this time.
Generally - a problematic driver. A lot of changes, regressions, new issues...
Best Regards,
Georgi |
|
Back to top |
|
|
KShots Guru
Joined: 09 Oct 2003 Posts: 595 Location: Florida
|
Posted: Wed Mar 20, 2024 12:12 am Post subject: |
|
|
Unfortunately, 6.8 isn't an option yet - no ZFS support beyond 6.7 as of yet, so I can't mount my root filesystem. _________________ Life without passion is death in disguise |
|
Back to top |
|
|
logrusx Advocate
Joined: 22 Feb 2018 Posts: 2201
|
Posted: Wed Mar 20, 2024 6:07 am Post subject: |
|
|
I decided to stay with 6.1 for the time being. I'm quite unhappy how kernel development of stable lines goes. It doesn't look stable.
Best Regards,
Georgi |
|
Back to top |
|
|
KShots Guru
Joined: 09 Oct 2003 Posts: 595 Location: Florida
|
Posted: Fri May 03, 2024 9:54 pm Post subject: |
|
|
ZFS released a module compatible with 6.8 kernels this week, so I figured I'd give a shot using the latest 6.8 kernel - 6.8.9...
... Still no good. Same kernel panic on loading, and the screens flicker a few time then give up entirely. I was able to login remotely and snag a dmesg showing the kernel panic (note that the EFI framebuffer doesn't work with the amdgpu module either at this point, it flickers a couple of times on the console and gives up). Noteworthy in the dmesg output was this bit, which doesn't appear in 6.1.x kernels:
Code: | [ 32.024744] amdgpu 0000:03:00.0: [drm] Adding stream 00000000a7a76a87 to context failed with err 2!
[ 32.024895] amdgpu 0000:03:00.0: [drm] Adding stream 00000000a7a76a87 to context failed with err 2!
[ 32.025028] amdgpu 0000:03:00.0: [drm] Adding stream 00000000a7a76a87 to context failed with err 2! | I also see a gnome crash just above these: Code: | [ 31.655286] gnome-shell[3846]: segfault at 55be94f53ac1 ip 00007f3b9b878ccd sp 00007f3b61fe9fb0 error 4 in libc.so.6[7f3b9b85d000+15e000] likely on CPU 20 (core 10, socket 0)
[ 31.655296] Code: 84 e4 74 59 e8 44 48 fe ff 48 8b 5d 00 49 89 c6 48 85 db 75 15 eb 36 66 0f 1f 44 00 00 48 8b 5d 08 48 83 c5 08 48 85 db 74 23 <44> 3a 23 75 ee 4c 89 f2 48 89 de 4c 89 ef e8 10 49 fe ff 85 c0 75 | Definately not seeing an improvement here, and fully unusable in this state compared to 6.1.x
EDIT:
Relevant dmesg output below for amdgpu: Code: | rich@succubus ~ $ grep -i "drm\|amdgpu\|03:00.0" dmesg-6.8.9.log
[ 0.861268] pci 0000:03:00.0: [1002:744c] type 00 class 0x030000 PCIe Legacy Endpoint
[ 0.861286] pci 0000:03:00.0: BAR 0 [mem 0xff800000000-0xffbffffffff 64bit pref]
[ 0.861298] pci 0000:03:00.0: BAR 2 [mem 0xffc00000000-0xffc0fffffff 64bit pref]
[ 0.861308] pci 0000:03:00.0: BAR 4 [io 0x2000-0x20ff]
[ 0.861316] pci 0000:03:00.0: BAR 5 [mem 0xc1000000-0xc10fffff]
[ 0.861325] pci 0000:03:00.0: ROM [mem 0xfffe0000-0xffffffff pref]
[ 0.861341] pci 0000:03:00.0: BAR 0: assigned to efifb
[ 0.861375] pci 0000:03:00.0: PME# supported from D1 D2 D3hot D3cold
[ 0.861392] pci 0000:03:00.0: 126.024 Gb/s available PCIe bandwidth, limited by 16.0 GT/s PCIe x8 link at 0000:00:01.1 (capable of 252.048 Gb/s with 16.0 GT/s PCIe x16 link)
[ 1.090373] pci 0000:03:00.0: vgaarb: setting as boot VGA device
[ 1.090373] pci 0000:03:00.0: vgaarb: bridge control possible
[ 1.090373] pci 0000:03:00.0: vgaarb: VGA device added: decodes=io+mem,owns=none,locks=none
[ 1.216067] pci 0000:03:00.0: ROM [mem 0xfffe0000-0xffffffff pref]: can't claim; no compatible bridge window
[ 1.216097] pci 0000:03:00.0: ROM [mem 0xc1120000-0xc113ffff pref]: assigned
[ 1.216639] pci 0000:03:00.1: D0 power state depends on 0000:03:00.0
[ 1.217745] pci 0000:03:00.0: Adding to iommu group 0
[ 3.121457] ACPI: bus type drm_connector registered
[ 3.787655] [drm] amdgpu kernel modesetting enabled.
[ 3.792521] amdgpu: Virtual CRAT table created for CPU
[ 3.802361] amdgpu: Topology: Add CPU node
[ 3.807734] [drm] initializing kernel modesetting (IP DISCOVERY 0x1002:0x744C 0x1028:0x0C4C 0xCF).
[ 3.817628] [drm] register mmio base: 0xC1000000
[ 3.817935] [drm] register mmio size: 1048576
[ 3.822964] [drm] add ip block number 0 <soc21_common>
[ 3.823267] [drm] add ip block number 1 <gmc_v11_0>
[ 3.823566] [drm] add ip block number 2 <ih_v6_0>
[ 3.823864] [drm] add ip block number 3 <psp>
[ 3.824164] [drm] add ip block number 4 <smu>
[ 3.824458] [drm] add ip block number 5 <dm>
[ 3.824750] [drm] add ip block number 6 <gfx_v11_0>
[ 3.825044] [drm] add ip block number 7 <sdma_v6_0>
[ 3.825332] [drm] add ip block number 8 <vcn_v4_0>
[ 3.825620] [drm] add ip block number 9 <jpeg_v4_0>
[ 3.825906] [drm] add ip block number 10 <mes_v11_0>
[ 3.826302] amdgpu 0000:03:00.0: amdgpu: Fetched VBIOS from VFCT
[ 3.826592] amdgpu: ATOM BIOS: 113-BRT121633-001
[ 3.827186] Loading firmware: amdgpu/psp_13_0_0_sos.bin
[ 3.827511] Loading firmware: amdgpu/psp_13_0_0_ta.bin
[ 3.827825] Loading firmware: amdgpu/smu_13_0_0.bin
[ 3.828134] Loading firmware: amdgpu/dcn_3_2_0_dmcub.bin
[ 3.828445] Loading firmware: amdgpu/gc_11_0_0_pfp.bin
[ 3.828749] amdgpu 0000:03:00.0: amdgpu: CP RS64 enable
[ 3.829034] Loading firmware: amdgpu/gc_11_0_0_me.bin
[ 3.829346] Loading firmware: amdgpu/gc_11_0_0_rlc.bin
[ 3.829896] Loading firmware: amdgpu/gc_11_0_0_mec.bin
[ 3.830201] Loading firmware: amdgpu/gc_11_0_0_imu.bin
[ 3.830493] Loading firmware: amdgpu/sdma_6_0_0.bin
[ 3.830780] [drm] VCN(0) encode/decode are enabled in VM mode
[ 3.831053] [drm] VCN(1) encode/decode are enabled in VM mode
[ 3.831324] Loading firmware: amdgpu/vcn_4_0_0.bin
[ 3.831620] amdgpu 0000:03:00.0: [drm:jpeg_v4_0_early_init [amdgpu]] JPEG decode is enabled in VM mode
[ 3.832039] Loading firmware: amdgpu/gc_11_0_0_mes_2.bin
[ 3.832335] Loading firmware: amdgpu/gc_11_0_0_mes1.bin
[ 3.832718] amdgpu 0000:03:00.0: vgaarb: deactivate vga console
[ 3.832721] amdgpu 0000:03:00.0: amdgpu: Trusted Memory Zone (TMZ) feature not supported
[ 3.832743] amdgpu 0000:03:00.0: amdgpu: MEM ECC is not presented.
[ 3.832746] amdgpu 0000:03:00.0: amdgpu: SRAM ECC is not presented.
[ 3.832755] [drm] vm size is 262144 GB, 4 levels, block size is 9-bit, fragment size is 9-bit
[ 3.832762] amdgpu 0000:03:00.0: amdgpu: VRAM: 16368M 0x0000008000000000 - 0x00000083FEFFFFFF (16368M used)
[ 3.832766] amdgpu 0000:03:00.0: amdgpu: GART: 512M 0x00007FFF00000000 - 0x00007FFF1FFFFFFF
[ 3.832775] [drm] Detected VRAM RAM=16368M, BAR=16384M
[ 3.832778] [drm] RAM width 256bits GDDR6
[ 3.832945] [drm] amdgpu: 16368M of VRAM memory ready
[ 3.832948] [drm] amdgpu: 31996M of GTT memory ready.
[ 3.832964] [drm] GART: num cpu pages 131072, num gpu pages 131072
[ 3.833015] [drm] PCIE GART of 512M enabled (table at 0x0000008000900000).
[ 3.833329] [drm] Loading DMUB firmware via PSP: version=0x07002900
[ 3.833449] [drm] Found VCN firmware Version ENC: 1.19 DEC: 7 VEP: 0 Revision: 0
[ 3.833459] amdgpu 0000:03:00.0: amdgpu: Will use PSP to load VCN firmware
[ 4.005924] [drm] reserve 0x1300000 from 0x83fc000000 for PSP TMR
[ 4.223966] amdgpu 0000:03:00.0: amdgpu: RAP: optional rap ta ucode is not available
[ 4.223971] amdgpu 0000:03:00.0: amdgpu: SECUREDISPLAY: securedisplay ta ucode is not available
[ 4.224001] amdgpu 0000:03:00.0: amdgpu: smu driver if version = 0x0000003d, smu fw if version = 0x0000003f, smu fw program = 0, smu fw version = 0x004e7900 (78.121.0)
[ 4.224008] amdgpu 0000:03:00.0: amdgpu: SMU driver if version not matched
[ 4.431348] amdgpu 0000:03:00.0: amdgpu: SMU is initialized successfully!
[ 4.431760] [drm] Display Core v3.2.266 initialized on DCN 3.2
[ 4.431764] [drm] DP-HDMI FRL PCON supported
[ 4.433826] [drm] DMUB hardware initialized: version=0x07002900
[ 4.738264] [drm] PSR support 1, DC PSR ver 0, sink PSR ver 1 DPCD caps 0xa su_y_granularity 0
[ 5.406166] [drm] kiq ring mec 3 pipe 1 q 0
[ 5.412245] [drm] VCN decode and encode initialized successfully(under DPG Mode).
[ 5.413091] amdgpu 0000:03:00.0: [drm:jpeg_v4_0_hw_init [amdgpu]] JPEG decode initialized successfully.
[ 5.534735] amdgpu: HMM registered 16368MB device memory
[ 5.535476] kfd kfd: amdgpu: Allocated 3969056 bytes on gart
[ 5.535491] kfd kfd: amdgpu: Total number of KFD nodes to be created: 1
[ 5.535516] amdgpu: Virtual CRAT table created for GPU
[ 5.535742] amdgpu: Topology: Add dGPU node [0x744c:0x1002]
[ 5.535746] kfd kfd: amdgpu: added device 1002:744c
[ 5.535759] amdgpu 0000:03:00.0: amdgpu: SE 6, SH per SE 2, CU per SH 8, active_cu_number 72
[ 5.535766] amdgpu 0000:03:00.0: amdgpu: ring gfx_0.0.0 uses VM inv eng 0 on hub 0
[ 5.535769] amdgpu 0000:03:00.0: amdgpu: ring comp_1.0.0 uses VM inv eng 1 on hub 0
[ 5.535773] amdgpu 0000:03:00.0: amdgpu: ring comp_1.1.0 uses VM inv eng 4 on hub 0
[ 5.535776] amdgpu 0000:03:00.0: amdgpu: ring comp_1.2.0 uses VM inv eng 6 on hub 0
[ 5.535780] amdgpu 0000:03:00.0: amdgpu: ring comp_1.3.0 uses VM inv eng 7 on hub 0
[ 5.535783] amdgpu 0000:03:00.0: amdgpu: ring comp_1.0.1 uses VM inv eng 8 on hub 0
[ 5.535786] amdgpu 0000:03:00.0: amdgpu: ring comp_1.1.1 uses VM inv eng 9 on hub 0
[ 5.535789] amdgpu 0000:03:00.0: amdgpu: ring comp_1.2.1 uses VM inv eng 10 on hub 0
[ 5.535793] amdgpu 0000:03:00.0: amdgpu: ring comp_1.3.1 uses VM inv eng 11 on hub 0
[ 5.535796] amdgpu 0000:03:00.0: amdgpu: ring sdma0 uses VM inv eng 12 on hub 0
[ 5.535799] amdgpu 0000:03:00.0: amdgpu: ring sdma1 uses VM inv eng 13 on hub 0
[ 5.535803] amdgpu 0000:03:00.0: amdgpu: ring vcn_unified_0 uses VM inv eng 0 on hub 8
[ 5.535806] amdgpu 0000:03:00.0: amdgpu: ring vcn_unified_1 uses VM inv eng 1 on hub 8
[ 5.535809] amdgpu 0000:03:00.0: amdgpu: ring jpeg_dec uses VM inv eng 4 on hub 8
[ 5.535813] amdgpu 0000:03:00.0: amdgpu: ring mes_kiq_3.1.0 uses VM inv eng 14 on hub 0
[ 5.537086] [drm] ring gfx_32768.1.1 was added
[ 5.537444] [drm] ring compute_32768.2.2 was added
[ 5.537749] [drm] ring sdma_32768.3.3 was added
[ 5.537805] [drm] ring gfx_32768.1.1 ib test pass
[ 5.537879] [drm] ring compute_32768.2.2 ib test pass
[ 5.537916] [drm] ring sdma_32768.3.3 ib test pass
[ 5.539354] amdgpu 0000:03:00.0: amdgpu: Using BOCO for runtime pm
[ 5.539942] [drm] Initialized amdgpu 3.57.0 20150101 for 0000:03:00.0 on minor 0
[ 5.566318] fbcon: amdgpudrmfb (fb0) is primary device
[ 5.566651] [drm] DSC precompute is not needed.
[ 5.568150] amdgpu 0000:03:00.0: [drm] fb0: amdgpudrmfb frame buffer device
[ 20.712414] WARNING: CPU: 25 PID: 622 at drivers/gpu/drm/amd/amdgpu/../display/dc/dsc/dcn20/dcn20_dsc.c:272 dsc2_disable+0x101/0x180 [amdgpu]
[ 20.713003] Modules linked in: zfs(PO) spl(O) dm_crypt hid_logitech_hidpp hid_logitech_dj hid_generic usbhid amdgpu video amdxcp i2c_algo_bit drm_ttm_helper crc32_pclmul ttm crc32c_intel polyval_clmulni agpgart polyval_generic drm_exec ghash_clmulni_intel sha512_ssse3 gpu_sched sha256_ssse3 drm_suballoc_helper sha1_ssse3 drm_buddy rtsx_pci_sdmmc mmc_core aesni_intel ucsi_acpi drm_display_helper nvme crypto_simd ccp xhci_pci i2c_hid_acpi typec_ucsi drm_kms_helper i2c_hid nvme_core roles rtsx_pci rng_core cryptd xhci_hcd sha1_generic mfd_core cec hwmon typec drm wmi i2c_core dm_mod dax efivarfs
[ 20.715059] Workqueue: events_highpri dm_irq_work_func [amdgpu]
[ 20.715590] RIP: 0010:dsc2_disable+0x101/0x180 [amdgpu]
[ 20.721508] ? dsc2_disable+0x101/0x180 [amdgpu]
[ 20.722474] ? dsc2_disable+0x101/0x180 [amdgpu]
[ 20.724642] ? dsc2_disable+0x101/0x180 [amdgpu]
[ 20.725152] ? dsc2_disable+0x101/0x180 [amdgpu]
[ 20.725657] link_set_dsc_on_stream+0x400/0x470 [amdgpu]
[ 20.726595] ? dm_helpers_dp_write_dsc_enable+0x282/0x700 [amdgpu]
[ 20.727132] link_set_dsc_enable+0x7a/0x90 [amdgpu]
[ 20.727670] link_set_dpms_off+0x193/0x730 [amdgpu]
[ 20.728582] ? dcn32_apply_idle_power_optimizations+0x22f/0x250 [amdgpu]
[ 20.729498] ? dm_read_reg_func+0x30/0xa0 [amdgpu]
[ 20.730026] dc_commit_state_no_check+0xa4c/0xdd0 [amdgpu]
[ 20.730532] dc_commit_streams+0x247/0x3b0 [amdgpu]
[ 20.731040] amdgpu_dm_atomic_commit_tail+0x437/0x38e0 [amdgpu]
[ 20.732705] ? drm_atomic_helper_setup_commit+0x201/0x730 [drm_kms_helper]
[ 20.735683] ? amdgpu_dm_plane_helper_prepare_fb+0x176/0x2e0 [amdgpu]
[ 20.736527] ? amdgpu_dm_plane_helper_prepare_fb+0x1b8/0x2e0 [amdgpu]
[ 20.736978] commit_tail+0x8c/0x120 [drm_kms_helper]
[ 20.737333] drm_atomic_helper_commit+0x111/0x140 [drm_kms_helper]
[ 20.737679] drm_atomic_commit+0x8e/0xc0 [drm]
[ 20.738027] ? __pfx___drm_printfn_info+0x10/0x10 [drm]
[ 20.738367] drm_client_modeset_commit_atomic+0x1fe/0x240 [drm]
[ 20.738701] drm_client_modeset_commit_locked+0x51/0x150 [drm]
[ 20.739350] drm_client_modeset_commit+0x1c/0x40 [drm]
[ 20.739681] __drm_fb_helper_restore_fbdev_mode_unlocked+0x80/0xc0 [drm_kms_helper]
[ 20.740007] drm_fb_helper_hotplug_event+0xdc/0xf0 [drm_kms_helper]
[ 20.740331] drm_client_dev_hotplug+0x95/0xf0 [drm]
[ 20.740669] handle_hpd_irq_helper+0x154/0x170 [amdgpu]
[ 22.039233] amdgpu 0000:03:00.0: [drm] Adding stream 000000002d8248d6 to context failed with err 2!
[ 22.227406] systemd[1]: Starting Load Kernel Module drm...
[ 22.247726] systemd[1]: modprobe@drm.service: Deactivated successfully.
[ 22.247809] systemd[1]: Finished Load Kernel Module drm.
[ 22.872344] snd_hda_intel 0000:03:00.1: bound 0000:03:00.0 (ops amdgpu_dm_audio_component_bind_ops [amdgpu])
[ 27.120825] amdgpu 0000:03:00.0: [drm] Adding stream 00000000eb63c5f4 to context failed with err 2!
[ 32.024744] amdgpu 0000:03:00.0: [drm] Adding stream 00000000a7a76a87 to context failed with err 2!
[ 32.024895] amdgpu 0000:03:00.0: [drm] Adding stream 00000000a7a76a87 to context failed with err 2!
[ 32.025028] amdgpu 0000:03:00.0: [drm] Adding stream 00000000a7a76a87 to context failed with err 2!
[ 32.400322] amdgpu 0000:03:00.0: [drm] Adding stream 00000000eb63c5f4 to context failed with err 2!
[ 52.617929] amdgpu 0000:03:00.0: [drm] Adding stream 000000005de5b06a to context failed with err 2!
[ 83.168051] amdgpu 0000:03:00.0: [drm] Adding stream 00000000c8f1093b to context failed with err 2! |
_________________ Life without passion is death in disguise |
|
Back to top |
|
|
logrusx Advocate
Joined: 22 Feb 2018 Posts: 2201
|
Posted: Sat May 04, 2024 5:26 am Post subject: |
|
|
Why don't you file a kernel bug?
Best Regards,
Georgi |
|
Back to top |
|
|
KShots Guru
Joined: 09 Oct 2003 Posts: 595 Location: Florida
|
Posted: Sat May 04, 2024 7:20 pm Post subject: |
|
|
I think I will at this point... but I also think it will get no attention as I am the only linux user with an RX 7900M - not a whole lot of demand there. I'll open one up and post a link here in this topic in case anyone does a search for it.
EDIT: Linked _________________ Life without passion is death in disguise |
|
Back to top |
|
|
KShots Guru
Joined: 09 Oct 2003 Posts: 595 Location: Florida
|
Posted: Mon Jul 15, 2024 8:35 pm Post subject: |
|
|
So... a few months down the road, no responses on the bug report. Pretty sure nobody's interested in supporting this configuration.
That said... This afternoon I managed to upgrade to 6.6.38... but only via Plasma & Wayland... Gnome & Wayland still blanks out after a few seconds and never comes back. I still get a lot of kernel error messages under dmesg, which are the same as what was captured in the bug report above... I'm about ready to give up on this laptop. _________________ Life without passion is death in disguise |
|
Back to top |
|
|
|