Commits · 5728d064190e169f1a42381bd7e5fc4d411f3188 · Kirill Smelkov / linux

09 Nov, 2022 40 commits

drm/nouveau/fb: handle sysmem flush page from common code · 5728d064

Ben Skeggs authored Jun 01, 2022

- also executes pre-DEVINIT, so early boot is able to DMA sysmem
Signed-off-by: Ben Skeggs <bskeggs@redhat.com>
Reviewed-by: Lyude Paul <lyude@redhat.com>

5728d064

drm/nouveau/flcn: new code to load+boot simple HS FWs (VPR scrubber) · 0e44c217

Ben Skeggs authored Jun 01, 2022

Adds the start of common interfaces to load and boot the HS binaries
provided by NVIDIA that enable the usage of GR.

ACR already handles most of this, but it's very much tied into ACR's
init process, and there's other code that could benefit from reusing
a lot of this stuff too (ie. VBIOS DEVINIT/PreOS, VPR scrubber).

The VPR scrubber code is fairly independent, and a good first target.

- adds better debug output to fw loading process, to ease bring-up/debug

v2:
- whitespace, 0->false
Signed-off-by: Ben Skeggs <bskeggs@redhat.com>
Reviewed-by: Lyude Paul <lyude@redhat.com>

0e44c217

drm/nouveau/flcn: rework falcon reset · f15cde64

Ben Skeggs authored Jun 01, 2022

Mostly preparation to fit in Ampere changes, but should result in reset
sequences a lot closer to RM's, and perhaps help out with the issues we
sometimes see reported in this area.
Signed-off-by: Ben Skeggs <bskeggs@redhat.com>
Reviewed-by: Lyude Paul <lyude@redhat.com>

f15cde64

drm/nouveau/sec2: switch to newer style interrupt handler · c7c0aac7
Ben Skeggs authored Jun 01, 2022
```
Ampere.
Signed-off-by: Ben Skeggs <bskeggs@redhat.com>
Reviewed-by: Lyude Paul <lyude@redhat.com>
```
c7c0aac7

drm/nouveau/sec2: unload RTOS before tearing down WPR · 3b330f08

Ben Skeggs authored Jun 01, 2022

Reset regs won't be available on Ampere while SEC2 RTOS is running, and
we're apparently supposed to be doing this on earlier GPUs too.

v2:
- fixed some excessive indentation
Signed-off-by: Ben Skeggs <bskeggs@redhat.com>
Reviewed-by: Lyude Paul <lyude@redhat.com>

3b330f08

drm/nouveau/pmu/gm20b,gp10b: boot RTOS from PMU init · a9d90860

Ben Skeggs authored Jun 01, 2022

Cleanup before falcon changes.
Signed-off-by: Ben Skeggs <bskeggs@redhat.com>
Reviewed-by: Lyude Paul <lyude@redhat.com>

a9d90860

drm/nouveau/pmu: move init() falcon reset to non-nvfw code · ccdc0431

Ben Skeggs authored Jun 01, 2022

Cleanup before falcon changes.

- fixes (attempt at?) reset of pmu while rtos is running, on gm20b

v2:
- remove extra whitespace
Signed-off-by: Ben Skeggs <bskeggs@redhat.com>
Reviewed-by: Lyude Paul <lyude@redhat.com>

ccdc0431

drm/nouveau/pmu: move preinit() falcon reset to devinit · b7f44ef7

Ben Skeggs authored Jun 01, 2022

Cleanup before falcon changes.
Signed-off-by: Ben Skeggs <bskeggs@redhat.com>
Reviewed-by: Lyude Paul <lyude@redhat.com>

b7f44ef7

drm/nouveau/gsp: add funcs · 74f9dcb0

Ben Skeggs authored Jun 01, 2022

Ampere.
Signed-off-by: Ben Skeggs <bskeggs@redhat.com>
Reviewed-by: Lyude Paul <lyude@redhat.com>

74f9dcb0

drm/nouveau/fifo/ga100-: initial support · 7f4f35ea

Ben Skeggs authored Jun 01, 2022

- replaces the hacked-up version that existed solely to support TTM

v2. remove earlier hack preventing use of non-stall intr for fences
Signed-off-by: Ben Skeggs <bskeggs@redhat.com>

7f4f35ea

drm/nouveau/ce/ga100-: initial support · 05d271c3

Ben Skeggs authored Jun 01, 2022

- replaces the hacked-up version that existed solely to support TTM
- noop until the next commit, adding proper support for ampere host

v2. fixup for ga103 early merge
Signed-off-by: Ben Skeggs <bskeggs@redhat.com>

05d271c3

drm/nouveau/fifo: add new channel classes · 06db7fde

Ben Skeggs authored Jun 01, 2022

Exposes a bunch of the new features that became possible as a result
of the earlier commits.  DRM will build on this in the future to add
support for features such as SCG ("async compute") and multi-device
rendering, as part of the work necessary to be able to write a half-
decent vulkan driver - finally.

For the moment, this just crudely ports DRM to the API changes.

- channel class interfaces now the same for all HW classes
- channel group class exposed (SCG)
- channel runqueue selector exposed (SCG)
- channel sub-device id control exposed (multi-device rendering)
- channel names in logging will reflect creating process, not fd owner
- explicit USERD allocation required by VOLTA_CHANNEL_GPFIFO_A and newer
- drm is smarter about determining the appropriate channel class to use
Signed-off-by: Ben Skeggs <bskeggs@redhat.com>
Reviewed-by: Lyude Paul <lyude@redhat.com>

06db7fde

drm/nouveau/fifo: add new engine object handling · 7ac29332

Ben Skeggs authored Jun 01, 2022

Simplifies the GPU-specific code, completing the switch to newer HALs.
Signed-off-by: Ben Skeggs <bskeggs@redhat.com>
Reviewed-by: Lyude Paul <lyude@redhat.com>

7ac29332

drm/nouveau/fifo: add new engine context handling · 8ab849d6

Ben Skeggs authored Jun 01, 2022

Builds on the context tracking that was added earlier.

- marks engine context PTEs as 'priv' where possible
Signed-off-by: Ben Skeggs <bskeggs@redhat.com>
Reviewed-by: Lyude Paul <lyude@redhat.com>

8ab849d6

drm/nouveau/fifo: add RAMFC info to nvkm_chan_func · 3647c53b

Ben Skeggs authored Jun 01, 2022

- adds support for specifying SUBDEVICE_ID for channel
- rounds non-power-of-two GPFIFO sizes down, rather than up
Signed-off-by: Ben Skeggs <bskeggs@redhat.com>
Reviewed-by: Lyude Paul <lyude@redhat.com>

3647c53b

drm/nouveau/fifo: add USERD info to nvkm_chan_func · fbe9f433

Ben Skeggs authored Jun 01, 2022

And use it to cleanup multiple implementations of almost the same thing.

- prepares for non-polled / client-provided USERD
- only zeroes relevant "registers", rather than entire USERD
Signed-off-by: Ben Skeggs <bskeggs@redhat.com>
Reviewed-by: Lyude Paul <lyude@redhat.com>

fbe9f433

drm/nouveau/fifo: add RAMIN info to nvkm_chan_func · d3e7a439

Ben Skeggs authored Jun 01, 2022

Currently provided by {chan,dma,gpfifo}*.c, and those are going away.
Signed-off-by: Ben Skeggs <bskeggs@redhat.com>
Reviewed-by: Lyude Paul <lyude@redhat.com>

d3e7a439

drm/nouveau/fifo: add common runlist control · b084fff2

Ben Skeggs authored Jun 01, 2022

- less dependence on waiting for runlist updates, on GPUs that allow it
- supports runqueue selector in RAMRL entries
- completes switch to common runl/cgrp/chan topology info
Signed-off-by: Ben Skeggs <bskeggs@redhat.com>
Reviewed-by: Lyude Paul <lyude@redhat.com>

b084fff2

drm/nouveau/fifo: add common channel recovery · 4d60100a

Ben Skeggs authored Jun 01, 2022

That sure was fun to untangle.

- handled per-runlist, rather than globally
- more straight-forward process in general
- various potential SW/HW races have been fixed
- fixes lockdep issues that were present in >=gk104's prior implementation
- volta recovery now actually stands a chance of working
- volta/turing waiting for PBDMA idle before engine reset
- turing using hw-provided TSG info for CTXSW_TIMEOUT
Signed-off-by: Ben Skeggs <bskeggs@redhat.com>
Reviewed-by: Lyude Paul <lyude@redhat.com>

4d60100a

drm/nouveau/fifo: kill channel on NV_PPBDMA_INTR_1_CTXNOTVALID · 0b1bb129
Ben Skeggs authored Jun 01, 2022
```
Signed-off-by: Ben Skeggs <bskeggs@redhat.com>
Reviewed-by: Lyude Paul <lyude@redhat.com>
```
0b1bb129

drm/nouveau/fifo: kill channel on a selection of PBDMA errors · 520db040

Ben Skeggs authored Jun 01, 2022

A bunch of these can be handled in such a way that the channel can
continue, however, any of these are a pretty decent sign something
has gone horribly wrong, and the safest option is to disable the
channel.

This is a bit of a hack, we will want to handle these individually
and dump relevant debug info for each at some point.
Signed-off-by: Ben Skeggs <bskeggs@redhat.com>
Reviewed-by: Lyude Paul <lyude@redhat.com>

520db040

drm/nouveau/fifo: add chan/cgrp preempt() · acff9415

Ben Skeggs authored Jun 01, 2022

Signed-off-by: Ben Skeggs <bskeggs@redhat.com>
Reviewed-by: Lyude Paul <lyude@redhat.com>

acff9415

drm/nouveau/fifo: add chan start()/stop() · 67059b9f

Ben Skeggs authored Jun 01, 2022

- nvkm_chan_error() built on top, stops channel and sends 'killed' event
- removes an odd double-bashing of channel enable regs on kepler and up
- pokes doorbell on turing and up, after enabling channel
Signed-off-by: Ben Skeggs <bskeggs@redhat.com>
Reviewed-by: Lyude Paul <lyude@redhat.com>

67059b9f

drm/nouveau/fifo: add chan bind()/unbind() · 62742b5e

Ben Skeggs authored Jun 01, 2022

- stops programming (non-existent) runl id field on bind(), from maxwell
Signed-off-by: Ben Skeggs <bskeggs@redhat.com>
Reviewed-by: Lyude Paul <lyude@redhat.com>

62742b5e

drm/nouveau/fifo: add runlist block()/allow() · 3a6bc9c2

Ben Skeggs authored Jun 01, 2022

Signed-off-by: Ben Skeggs <bskeggs@redhat.com>
Reviewed-by: Lyude Paul <lyude@redhat.com>

3a6bc9c2

drm/nouveau/fifo: add runlist wait() · 4a492fd5

Ben Skeggs authored Jun 01, 2022

- adds g8x/turing registers, which were missing before
- switches fermi to polled wait, like later hw (see: 4f2fc25c...)
Signed-off-by: Ben Skeggs <bskeggs@redhat.com>
Reviewed-by: Lyude Paul <lyude@redhat.com>

4a492fd5

drm/nouveau/fifo: add new engine context tracking · f48dd293

Ben Skeggs authored Jun 01, 2022

Channel groups have somewhat more complicated requirements than what we
currently support. An engine context is shared between all channels in
a channel group, VEID/subctx support (later) brings per-VEID components,
and we need to track an individual channel's engine context pointers.

This commit adds the structures and refcounting to support the above,
wrapping the prior implementation for the moment.
Signed-off-by: Ben Skeggs <bskeggs@redhat.com>
Reviewed-by: Lyude Paul <lyude@redhat.com>

f48dd293

drm/nouveau/fifo: add new channel lookup interfaces · c358f538

Ben Skeggs authored Jun 01, 2022

- supports per-runlist CHIDs
- channel group lock held across reference, rather than global lock

v2:
- remove unnecessary parenthesis
Signed-off-by: Ben Skeggs <bskeggs@redhat.com>
Reviewed-by: Lyude Paul <lyude@redhat.com>

c358f538

drm/nouveau/fifo: merge mmu fault handlers together · e43c872c

Ben Skeggs authored Jun 01, 2022

After updating GF100 implementation from the GK104/TU102 ones, and using
the new runlist/engine topology info, all three handlers become (almost)
identical.

- there's a temporary kludge to call through to the HW-specific recovery
- engine fault mapping info determined at load time, not on every fault
Signed-off-by: Ben Skeggs <bskeggs@redhat.com>
Reviewed-by: Lyude Paul <lyude@redhat.com>

e43c872c

drm/nouveau/fifo: move PBDMA intr to runq · 923f1ff5

Ben Skeggs authored Jun 01, 2022

- merges gf100/gk104- NV_PFIFO_INTR_0_PBDMA and NV_PPBDMA_INTR_0 code
Signed-off-by: Ben Skeggs <bskeggs@redhat.com>
Reviewed-by: Lyude Paul <lyude@redhat.com>

923f1ff5

drm/nouveau/fifo: move PBDMA init to runq · 87c86024

Ben Skeggs authored Jun 01, 2022

- bumps pbdma timeout to value RM uses on newer HW
- bumps fb timeout to max from boot default
- one/both of these greatly improves stability on // piglit runs
Signed-off-by: Ben Skeggs <bskeggs@redhat.com>
Reviewed-by: Lyude Paul <lyude@redhat.com>

87c86024

drm/nouveau/fifo: program NV_PFIFO_FB_TIMEOUT on init · 324176e7

Ben Skeggs authored Oct 03, 2022

NVGPU and RM both program this value.

Fixes a bunch of random hangs running parallel piglit.
Signed-off-by: Ben Skeggs <bskeggs@redhat.com>

324176e7

drm/nouveau/fifo: tidy global PBDMA init · 965c41d9
Ben Skeggs authored Oct 03, 2022
```
Signed-off-by: Ben Skeggs <bskeggs@redhat.com>
```
965c41d9

drm/nouveau/fifo: tidy up non-stall intr handling · d67f3b96

Ben Skeggs authored Jun 01, 2022

- removes a layer of indirection in the intr handling
- prevents non-stall ctrl racing with unknown intrs
Signed-off-by: Ben Skeggs <bskeggs@redhat.com>
Reviewed-by: Lyude Paul <lyude@redhat.com>

d67f3b96

drm/nouveau/fifo: use explicit intr interfaces · 2fc71a05

Ben Skeggs authored Jun 01, 2022

More control, and shallower call-chain to get to the point.
Signed-off-by: Ben Skeggs <bskeggs@redhat.com>
Reviewed-by: Lyude Paul <lyude@redhat.com>

2fc71a05

drm/nouveau/fifo: use runlist engine info to lookup engine classes · 0fc72ee9
Ben Skeggs authored Jun 01, 2022
```
Signed-off-by: Ben Skeggs <bskeggs@redhat.com>
Reviewed-by: Lyude Paul <lyude@redhat.com>
```
0fc72ee9

drm/nouveau/fifo: add cgrp, have all channels be part of one · 468fae7b

Ben Skeggs authored Jun 01, 2022

Engine context tracking will move to nvkm_cgrp in later commits, so we
create SW-only channel groups on HW without support for them.

- switches to nvkm_chid for TSG/channel ID allocation
Signed-off-by: Ben Skeggs <bskeggs@redhat.com>
Reviewed-by: Lyude Paul <lyude@redhat.com>

468fae7b

drm/nouveau/fifo: expose per-runlist CHID information · eb39c613

Ben Skeggs authored Jun 01, 2022

DRM uses this to setup fence-related items.

- nouveau_chan.runlist will always be "0" for the moment, not an issue
  as GPUs prior to ampere have system-wide channel IDs,
Signed-off-by: Ben Skeggs <bskeggs@redhat.com>
Reviewed-by: Lyude Paul <lyude@redhat.com>

eb39c613

drm/nouveau/fifo: expose runlist topology info on all chipsets · 6de12538

Ben Skeggs authored Jun 01, 2022

Previously only available from Kepler onwards.

- also fixes the info() queries causing fifo init()/fini() unnecessarily
Signed-off-by: Ben Skeggs <bskeggs@redhat.com>
Reviewed-by: Lyude Paul <lyude@redhat.com>

6de12538

drm/nouveau/fifo: add common runlist/engine topology · d94470e9

Ben Skeggs authored Jun 01, 2022

Creates an nvkm_runl for each runlist on the GPU, and an nvkm_engn for
each engine that is reachable from a runlist.

- basically what gk104- already does, but extended to all chips
- adds per-runlist CHID allocators (Ampere)
- splits g98/gt2xx out from g84 (different target engines)
Signed-off-by: Ben Skeggs <bskeggs@redhat.com>
Reviewed-by: Lyude Paul <lyude@redhat.com>

d94470e9