Commits · 79ef5dca5e5cd5a33662d64c927c1b9786d4edee · nexedi / linux

03 Nov, 2015 40 commits

drm/nouveau: bump patchlevel to indicate availability of abi16/nvif interop · 79ef5dca
Ben Skeggs authored Nov 03, 2015
```
Signed-off-by: Ben Skeggs <bskeggs@redhat.com>
```
79ef5dca
drm/nouveau/abi16: implement limited interoperability with usif/nvif · 2621a416
Ben Skeggs authored Nov 03, 2015
```
Signed-off-by: Ben Skeggs <bskeggs@redhat.com>
```
2621a416

drm/nouveau/abi16: introduce locked variant of nouveau_abi16_get() · 786a57ef

Ben Skeggs authored Nov 03, 2015

USIF already takes the client mutex, but will need access to ABI16 data
in order to provide some limited interoperability.
Signed-off-by: Ben Skeggs <bskeggs@redhat.com>

786a57ef

drm/nouveau/abi16: remove unused argument from nouveau_abi16_get() · 09433f24
Ben Skeggs authored Nov 03, 2015
```
Signed-off-by: Ben Skeggs <bskeggs@redhat.com>
```
09433f24

drm/nouveau/pci: enable c800 magic for Medion Erazer X7827 · b41c4857

Ilia Mirkin authored Oct 31, 2015

Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=91557Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu>
Signed-off-by: Ben Skeggs <bskeggs@redhat.com>

b41c4857

drm/nouveau/pci: enable c800 magic for Lenovo Y510P · a2736b07

Ilia Mirkin authored Oct 27, 2015

Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=70354#c75Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu>
Signed-off-by: Ben Skeggs <bskeggs@redhat.com>

a2736b07

drm/nouveau/pll/gk104: fix PLL instability due to bad configuration with gddr5 · 78eaf335

Karol Herbst authored Aug 16, 2015

This patch uses an approach closer to the nvidia driver to configure
both PLLs for high gddr5 memory clocks (usually above 2400MHz)

Previously nouveau used the one PLL as it was used for the lower clocks
and just adjusted the second PLL to get as close as possible to the
requested clock.  This means for my card, that I got a 4050 MHz clock
although 4008 MHz was requested.

Now the driver iterates over a list of PLL configuration also used by
the nvidia driver and then adjust the second PLL to get near the
requested clock.  Also it hold to some restriction I found while
analyzing the PLL configurations

This won't fix all gddr5 high clock issues itself, but it should be
fine on hybrid gpu systems as found on many laptops these days.  Also
switching while normal desktop usage should be a lot more stable than
before.

v2: move the pll code into ramgk104
Signed-off-by: Karol Herbst <nouveau@karolherbst.de>
Signed-off-by: Ben Skeggs <bskeggs@redhat.com>

78eaf335

drm/nouveau/clk/g84: Enable reclocking for GDDR3 G94-G200 · 0d42743d

Roy Spliet authored Sep 30, 2015

Your milage may vary, as it's only been tested on a single G94 and one G96.
Signed-off-by: Roy Spliet <rspliet@eclipso.eu>
Tested-by: Pierre Moreau <pierre.morrow@free.fr>
Signed-off-by: Ben Skeggs <bskeggs@redhat.com>

0d42743d

drm/nouveau/bus/hwsq: Implement VBLANK waiting heuristic · 271c2766

Roy Spliet authored Sep 30, 2015

Avoids waiting for VBLANKS that never arrive on headless or otherwise
unconventional set-ups. Strategy taken from MEMX.
Signed-off-by: Roy Spliet <rspliet@eclipso.eu>
Tested-by: Pierre Moreau <pierre.morrow@free.fr>
Signed-off-by: Ben Skeggs <bskeggs@redhat.com>

271c2766

drm/nouveau/fb/ramnv50: Script changes for G94 and up · 4d9faafa

Roy Spliet authored Sep 30, 2015

10053c is not even read on some cards, and I have no idea exactly what the
criteria are. Likely NVIDIA pre-scans the VBIOS and in their driver disables
all features that are never used. The practical effect should be the same
as this implementation though.
Signed-off-by: Roy Spliet <rspliet@eclipso.eu>
Tested-by: Pierre Moreau <pierre.morrow@free.fr>
Signed-off-by: Ben Skeggs <bskeggs@redhat.com>

4d9faafa

drm/nouveau/fb/ramnv50: Deal with cards without timing entries · 797eb6ed

Roy Spliet authored Sep 30, 2015

Like Pierre's G94. We might want to structure Kepler similarly in a follow-up.
Signed-off-by: Roy Spliet <rspliet@eclipso.eu>
Tested-by: Pierre Moreau <pierre.morrow@free.fr>
Signed-off-by: Ben Skeggs <bskeggs@redhat.com>

797eb6ed

drm/nouveau/fb/ramnv50: Voltage GPIOs · 1cf688dd

Roy Spliet authored Sep 30, 2015

Does not seem to be necessary for NVA0, hence untested by me.
Signed-off-by: Roy Spliet <rspliet@eclipso.eu>
Tested-by: Pierre Moreau <pierre.morrow@free.fr>
Signed-off-by: Ben Skeggs <bskeggs@redhat.com>

1cf688dd

drm/nouveau/fb/ramgt215: Restructure r111100 calculation for DDR2 · 0b0b78cd

Roy Spliet authored Sep 30, 2015

Seems to be mostly equal to DDR3 on < GT218, should improve stability for
DDR2 reclocks.
Signed-off-by: Roy Spliet <rspliet@eclipso.eu>
Signed-off-by: Ben Skeggs <bskeggs@redhat.com>

0b0b78cd

drm/nouveau/fb/ramgt215: Change FBVDD/Q when BIOS asks for it · ef6e8f4c
Roy Spliet authored Sep 30, 2015
```
Signed-off-by: Roy Spliet <rspliet@eclipso.eu>
Signed-off-by: Ben Skeggs <bskeggs@redhat.com>
```
ef6e8f4c

drm/nouveau/fb/ramgt215: Transform GPIO ramfuc method from FBVREF-specific to generic · e0a37f85

Roy Spliet authored Sep 30, 2015

In preparation of changing FBVDDQ, as observed on at least one GDDR3 card.
While at it, adhere to func.log[1] properly for consistency.
Signed-off-by: Roy Spliet <rspliet@eclipso.eu>
Signed-off-by: Ben Skeggs <bskeggs@redhat.com>

e0a37f85

drm/nouveau/bios/rammap: Identify DLLoff for >= GF100 · b4f2bf33
Roy Spliet authored Sep 30, 2015
```
Signed-off-by: Roy Spliet <rspliet@eclipso.eu>
Signed-off-by: Ben Skeggs <bskeggs@redhat.com>
```
b4f2bf33

drm/nouveau/pci: Handle 5-bit and 8-bit tag field · 5d5b43f5

Pierre Moreau authored Oct 03, 2015

If the hardware supports extended tag field (8-bit ones), then enable it.

This is usually done by the VBIOS, but not on some MBPs (see fdo#86537).

In case extended tag field is not supported, 5-bit tag field is used which
limits the possible number of requests to 32. Apparently bits 7:0 of
0x08841c stores some number of outstanding requests, so cap it to 32 if
extended tag is unsupported.

Fixes: fdo#86537

v2: Restrict changes to chipsets >= 0x84
v3:
  * Add nvkm_pci_mask to pci.h
  * Mask bit 8 before setting it
v4:
  * Rename `add` argument of nvkm_pci_mask to `value`
  * Move code from nvkm_pci_init to g84_pci_init and remove PCIe and chipset
    checks
v5:
  * Rebase code on latest PCI structure
  * Restore PCIe check
  * Fix namings in nvkm_pci_mask
  * Rephrase part of the commit message
Signed-off-by: Pierre Moreau <pierre.morrow@free.fr>
Signed-off-by: Ben Skeggs <bskeggs@redhat.com>

5d5b43f5

drm/nouveau/disp,pm: constify nvkm_object_func structures · 354a2249

Julia Lawall authored Oct 11, 2015

These nvkm_object_func structures are never modified.  All other
nvkm_object_func structures are declared as const.

Done with the help of Coccinelle.
Signed-off-by: Julia Lawall <Julia.Lawall@lip6.fr>
Signed-off-by: Ben Skeggs <bskeggs@redhat.com>

354a2249

drm/nouveau/gr: add FERMI_COMPUTE_B class to GF110+ · 2e69a5cd

Ilia Mirkin authored Oct 07, 2015

GF110+ supports both the A and B compute classes, make sure to accept
both.
Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu>
Signed-off-by: Ben Skeggs <bskeggs@redhat.com>

2e69a5cd

drm/nouveau/gr: document mp error 0x10 · 3988f645

Ilia Mirkin authored Oct 07, 2015

NVIDIA provided the documentation for mp error 0x10, INVALID_ADDR_SPACE,
which apparently happens when trying to use an atomic operation on
local or shared memory (instead of global memory).
Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu>
Signed-off-by: Ben Skeggs <bskeggs@redhat.com>

3988f645

drm/nouveau: fix memory leak · bad4274a

Sudip Mukherjee authored Sep 11, 2015

If pm_runtime_get_sync() we were going to "out" but we missed freeing
vma.
Signed-off-by: Sudip Mukherjee <sudip@vectorindia.org>
Signed-off-by: Ben Skeggs <bskeggs@redhat.com>

bad4274a

drm/nouveau: remove unused function · 4458c563

Sudip Mukherjee authored Sep 02, 2015

coverity.com reported that memset was using a buffer of size 0, on
checking the code it turned out that the function was not being used. So
remove it.
Signed-off-by: Sudip Mukherjee <sudip@vectorindia.org>
Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>
Signed-off-by: Ben Skeggs <bskeggs@redhat.com>

4458c563

drm/nouveau/pmu/gk107: enable PGOB codepaths · 3c9aca31

Ben Skeggs authored Oct 01, 2015

Reported to be needed as per fdo#70354 comment #61.
Signed-off-by: Ben Skeggs <bskeggs@redhat.com>

3c9aca31

drm/nouveau/pmu/gk104: check fuse to determine presence of PGOB · 560f989f

Ben Skeggs authored Oct 01, 2015

Not 100% confirmed, but seems to match from the few boards I've looked
at so far.
Signed-off-by: Ben Skeggs <bskeggs@redhat.com>

560f989f

drm/nouveau/pci: prepare for chipset-specific initialisation tasks · 779d16aa
Ben Skeggs authored Oct 01, 2015
```
Signed-off-by: Ben Skeggs <bskeggs@redhat.com>
```
779d16aa

drm/nouveau/pci/nv46: attempt to fix msi, and re-enable by default · c4266a9c

Ben Skeggs authored Oct 01, 2015

Was not able to obtain a trace of NVRM due to kernel version annoyances,
however, experimentally confirmed that the WAR we use on NV50/G8x boards
works here too.
Signed-off-by: Ben Skeggs <bskeggs@redhat.com>

c4266a9c

drm/nouveau/pci/g94: split implementation from nv40 · b31505c4

Ben Skeggs authored Oct 01, 2015

An upcoming patch will implement functionality that we don't use on any
NV40 chipset.
Signed-off-by: Ben Skeggs <bskeggs@redhat.com>

b31505c4

drm/nouveau/pci/g84: split implementation from nv50 · 3e55b53b

Ben Skeggs authored Oct 01, 2015

An upcoming patch will implement functionality that we don't use on the
original NV50.
Signed-off-by: Ben Skeggs <bskeggs@redhat.com>

3e55b53b

drm/nouveau/ibus/gf100: increase wait timeout to avoid read faults · b6afa265

Samuel Pitoiset authored Sep 24, 2015

Increase clock timeout of some unknown engines in order to avoid failure
at high gpcclk rate.

This fixes IBUS read faults on my GF119 when reclocking is manually
enabled. Note that memory reclocking is completely broken and NvMemExec
has to be disabled to allow core clock reclocking only.
Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>
Signed-off-by: Ben Skeggs <bskeggs@redhat.com>

b6afa265

drm/nouveau/gm204/6: add voltage control using the new gk104 volt class · 24580d1c

Martin Peres authored Sep 16, 2015

I got confirmation that we can read and change the voltage with the same code.
The divider is also computed correctly on the gm204 we got our hands on.

Thanks to Yoshimo on IRC for executing the tests on his gm204!
Signed-off-by: Martin Peres <martin.peres@free.fr>
Signed-off-by: Ben Skeggs <bskeggs@redhat.com>

24580d1c

drm/nouveau/gm107: add voltage control using the new gk104 volt class · dc47700f

Martin Peres authored Sep 09, 2015

Let's ignore the other desktop Maxwells until I get my hands on one and confirm
that we still can change the voltage.
Signed-off-by: Martin Peres <martin.peres@free.fr>

dc47700f

drm/nouveau/volt/gk104: add support for pwm and gpio modes · 1531dbbb

Martin Peres authored Sep 09, 2015

Most Keplers actually use the GPIO-based voltage management instead of the new
PWM-based one. Use the GPIO mode as a fallback as it already gracefully handles
the case where no GPIOs exist.

All the Maxwells seem to use the PWM method though.

v2:
 - Do not forget to commit the PWM configuration change!
Signed-off-by: Martin Peres <martin.peres@free.fr>

1531dbbb

drm/nouveau/volt: add support for non-vid-based voltage controllers · 4c58a05b

Martin Peres authored Sep 09, 2015

This patch is not ideal but it definitely beats a rewrite of the current
interface and is very self-contained.
Signed-off-by: Martin Peres <martin.peres@free.fr>
Signed-off-by: Ben Skeggs <bskeggs@redhat.com>

4c58a05b

drm/nouveau/bios/volt: add support for pwm-based volt management · 7bddeba9
Martin Peres authored Sep 09, 2015
```
Signed-off-by: Martin Peres <martin.peres@free.fr>
Signed-off-by: Ben Skeggs <bskeggs@redhat.com>
```
7bddeba9

drm/nouveau/ttm: set the DMA mask for platform devices · b31cf78b

Alexandre Courbot authored Sep 04, 2015

So far the DMA mask was not set for platform devices, which limited them
to a 32-bit physical space. Allow dma_set_mask() to be called for
non-PCI devices, and also take the IOMMU bit into account since it could
restrict the physically addressable space.
Signed-off-by: Alexandre Courbot <acourbot@nvidia.com>
Signed-off-by: Ben Skeggs <bskeggs@redhat.com>

b31cf78b

drm/nouveau/ttm: convert to DMA API · 524883bb

Alexandre Courbot authored Sep 04, 2015

The pci_dma_* functions are now superseeded in the kernel by the DMA
API. Make the conversion to this more generic API.
Signed-off-by: Alexandre Courbot <acourbot@nvidia.com>
Signed-off-by: Ben Skeggs <bskeggs@redhat.com>

524883bb

drm/nouveau/instmem/gk20a: make use of the IOMMU bit · 68b56653

Alexandre Courbot authored Sep 04, 2015

Use the IOMMU bit specified in platform data instead of hardcoding it to
the bit used by current Tegra GPUs.
Signed-off-by: Alexandre Courbot <acourbot@nvidia.com>
Signed-off-by: Ben Skeggs <bskeggs@redhat.com>

68b56653

drm/nouveau/platform: allow to specify the IOMMU bit · e396ecd1

Alexandre Courbot authored Sep 04, 2015

Current Tegra code taking advantage of the IOMMU assumes a hardcoded
value for the IOMMU bit. Make it a platform property instead for
flexibility.

v2 (Ben Skeggs): remove nvkm dependence on drm structures
Signed-off-by: Alexandre Courbot <acourbot@nvidia.com>
Signed-off-by: Ben Skeggs <bskeggs@redhat.com>

e396ecd1

drm/nouveau/instmem/gk20a: use direct CPU access · 69c49382

Alexandre Courbot authored Sep 04, 2015

The Great Nouveau Refactoring Take II brought us a lot of goodness,
including acquire/release methods that are called before and after an
instobj is modified. These functions can be used as synchronization
points to manage CPU/GPU coherency if we modify an instobj using the
CPU.

This patch replaces the legacy and slow PRAMIN access for gk20a instmem
with CPU mappings and writes. A LRU list is used to unmap unused
mappings after a certain threshold (currently 1MB) of mapped instobjs is
reached. This allows mappings to be reused most of the time.

Accessing instobjs using the CPU requires to maintain the GPU L2 cache,
which we do in the acquire/release functions. This triggers a lot of L2
flushes/invalidates, but most of them are performed on an empty cache
(and thus return immediately), and overall context setup performance
greatly benefits from this (from 250ms to 160ms on Jetson TK1 for a
simple libdrm program).

Making L2 management more explicit should allow us to grab some more
performance in the future.
Signed-off-by: Alexandre Courbot <acourbot@nvidia.com>
Signed-off-by: Ben Skeggs <bskeggs@redhat.com>

69c49382

drm/nouveau: remove unnecessary usage of object handles · fcf3f91c

Ben Skeggs authored Sep 04, 2015

No longer required in a lot of cases, as objects are identified over NVIF
via an alternate mechanism since the rework.
Signed-off-by: Ben Skeggs <bskeggs@redhat.com>

fcf3f91c