Commits · d2ead3eaf8a4bf92129eda69189ce18a6c1cc8bd · Kirill Smelkov / linux

14 Dec, 2012 6 commits

drm/radeon/kms: add evergreen/cayman CS parser for async DMA (v2) · d2ead3ea

Alex Deucher authored Dec 13, 2012

Allows us to use the DMA ring from userspace.
DMA doesn't have a good NOP packet in which to embed the
reloc idx, so userspace has to add a reloc for each
buffer used and order them to match the command stream.

v2: fix address bounds checking
Signed-off-by: Alex Deucher <alexander.deucher@amd.com>

d2ead3ea

drm/radeon/kms: add 6xx/7xx CS parser for async DMA (v2) · cf4ccd01

Alex Deucher authored Nov 18, 2011

Allows us to use the DMA ring from userspace.
DMA doesn't have a good NOP packet in which to embed the
reloc idx, so userspace has to add a reloc for each
buffer used and order them to match the command stream.

v2: fix address bounds checking, reloc indexing
Signed-off-by: Alex Deucher <alexander.deucher@amd.com>

cf4ccd01

drm/radeon: fix htile buffer size computation for command stream checker · 4ac0533a

Jerome Glisse authored Dec 13, 2012

Fix the size computation of the htile buffer.
Signed-off-by: Jerome Glisse <jglisse@redhat.com>
Signed-off-by: Alex Deucher <alexander.deucher@amd.com>

4ac0533a

drm/radeon: fix fence locking in the pageflip callback · 9af20792

Daniel Vetter authored Dec 11, 2012

We need to hold bdev->fence_lock while grabbing a reference to
the fence, to prevent concurrent clearing/changing of the
ttm_bo->sync_obj field.
Signed-off-by: Daniel Vetter <daniel.vetter@ffwll.ch>
Signed-off-by: Alex Deucher <alexander.deucher@amd.com>

9af20792

drm/radeon: make indirect register access concurrency-safe · 2c385151

Daniel Vetter authored Dec 02, 2012

With the new per-crtc locking mutliple set-cursor calls could happen
in parallel. Out of sheer paranoia I've opted for an irqsave spinlock.
But if there's indeed an access from interrupt contexts to these regs
it's already broken with the old code, so this can likely just be
reduced to a normal spinlock. Otoh the pageflip completion happens
from the vblank irq handler ...
Signed-off-by: Daniel Vetter <daniel.vetter@ffwll.ch>
Signed-off-by: Alex Deucher <alexander.deucher@amd.com>

2c385151

drm/radeon: add W|RREG32_IDX for MM_INDEX|DATA based mmio accesss · 2ef9bdfe

Daniel Vetter authored Dec 02, 2012

Just refactoring to make the next patche simpler. Now all indirect register
access in the new modesetting driver should go through the r100_mm_(w|r)reg
fucntions.

RADEON_READ_MM from the old driver seems to be totally unused, so just kill
it.
Signed-off-by: Daniel Vetter <daniel.vetter@ffwll.ch>
Signed-off-by: Alex Deucher <alexander.deucher@amd.com>

2ef9bdfe

13 Dec, 2012 1 commit

Merge branch 'drm-next-3.8' of git://people.freedesktop.org/~agd5f/linux into drm-next · 9add1ac3

Dave Airlie authored Dec 13, 2012

* 'drm-next-3.8' of git://people.freedesktop.org/~agd5f/linux:
  drm/radeon: fix fence driver for dma ring when wb is disabled
  drm/radeon/si: add VM CS checker support for CP DMA
  drm/radeon/cayman: add VM CS checker support for CP DMA
  drm/radeon: add support for CP DMA packet to evergreen CS checker
  drm/radeon: add support for CP DMA packet to r6xx/r7xx CS checker
  drm/radeon: add register headers for CP DMA on r6xx-SI
  drm/radeon: improve mc_stop/mc_resume on r5xx-r7xx
  drm/radeon: fix amd afusion gpu setup aka sumo v2
  drm/radeon: do not move bo to different placement at each cs

9add1ac3

12 Dec, 2012 9 commits

drm/radeon: fix fence driver for dma ring when wb is disabled · 86a1881d

Jerome Glisse authored Dec 12, 2012

The dma ring can't write to register thus have to write to memory
its fence value. This ensure that it doesn't try to use scratch
register for dma ring fence driver.

Should fix:
https://bugs.freedesktop.org/show_bug.cgi?id=58166Signed-off-by: Jerome Glisse <jglisse@redhat.com>
Reviewed-by: Alex Deucher <alexander.deucher@amd.com>

86a1881d

drm/radeon/si: add VM CS checker support for CP DMA · 5aa709be

Alex Deucher authored Dec 03, 2012

Need to verify for copies involving registers.
Signed-off-by: Alex Deucher <alexander.deucher@amd.com>

5aa709be

drm/radeon/cayman: add VM CS checker support for CP DMA · 94e014ee
Alex Deucher authored Dec 03, 2012
```
Need to verify for copies involving registers.
Signed-off-by: Alex Deucher <alexander.deucher@amd.com>
```
94e014ee
drm/radeon: add support for CP DMA packet to evergreen CS checker · 8770b86b
Alex Deucher authored Dec 03, 2012
```
Currently only memory and GDS transfers are allowed.
Signed-off-by: Alex Deucher <alexander.deucher@amd.com>
```
8770b86b
drm/radeon: add support for CP DMA packet to r6xx/r7xx CS checker · 6830f585
Alex Deucher authored Dec 03, 2012
```
Currently only memory to memory transfers are allowed.
Signed-off-by: Alex Deucher <alexander.deucher@amd.com>
```
6830f585
drm/radeon: add register headers for CP DMA on r6xx-SI · b997a8ba
Alex Deucher authored Dec 03, 2012
```
Signed-off-by: Alex Deucher <alexander.deucher@amd.com>
```
b997a8ba

drm/radeon: improve mc_stop/mc_resume on r5xx-r7xx · 6253e4c7

Alex Deucher authored Dec 12, 2012

Along the same lines of what was done for evergreen+
in the last kernel.
Signed-off-by: Alex Deucher <alexander.deucher@amd.com>

6253e4c7

drm/radeon: fix amd afusion gpu setup aka sumo v2 · bd25f078

Jerome Glisse authored Dec 11, 2012

Set the proper number of tile pipe that should be a multiple of
pipe depending on the number of se engine.

Fix:
https://bugs.freedesktop.org/show_bug.cgi?id=56405
https://bugs.freedesktop.org/show_bug.cgi?id=56720

v2: Don't change sumo2
Signed-off-by: Jerome Glisse <jglisse@redhat.com>
Cc: stable@vger.kernel.org
Reviewed-by: Alex Deucher <alexander.deucher@amd.com>

bd25f078

drm/radeon: do not move bo to different placement at each cs · d025e9e2

Jerome Glisse authored Nov 29, 2012

The bo creation placement is where the bo will be. Instead of trying
to move bo at each command stream let this work to another worker
thread that will use more advance heuristic.

agd5f: remove leftover unused variable
Signed-off-by: Jerome Glisse <jglisse@redhat.com>
Reviewed-by: Alex Deucher <alexander.deucher@amd.com>

d025e9e2

10 Dec, 2012 18 commits

Merge branch 'drm-next-3.8' of git://people.freedesktop.org/~agd5f/linux into drm-next · a636a982

Dave Airlie authored Dec 11, 2012

Alex writes:
"adds support for the
asynchronous DMA engines on r6xx-SI.  These engines are used
for ttm bo moves and VM page table updates currently.  They
could also be exposed via the CS ioctl for userspace use,
but I haven't had a chance to add proper CS checker patches
for them yet.  These patches have been tested extensively
internally for months, so they should be pretty solid."

* 'drm-next-3.8' of git://people.freedesktop.org/~agd5f/linux:
  drm/radeon: use DMA engine for VM page table updates on SI
  drm/radeon: add dma engine support for vm pt updates on si (v2)
  drm/radeon: use DMA engine for VM page table updates on cayman/TN
  drm/radeon: add dma engine support for vm pt updates on ni (v5)
  drm/radeon: use async dma for ttm buffer moves on 6xx-SI
  drm/radeon/kms: add support for dma rings to radeon_test_moves()
  drm/radeon/kms: Add initial support for async DMA on SI
  drm/radeon/kms: Add initial support for async DMA on cayman/TN
  drm/radeon/kms: Add initial support for async DMA on evergreen
  drm/radeon/kms: Add initial support for async DMA on r6xx/r7xx

a636a982

drm/radeon: use DMA engine for VM page table updates on SI · bf66a786

Alex Deucher authored Oct 22, 2012

DMA engine has special packets to facilitate this and it also keeps
the 3D engine free for other things.
Signed-off-by: Alex Deucher <alexander.deucher@amd.com>

bf66a786

drm/radeon: add dma engine support for vm pt updates on si (v2) · deab48f1

Alex Deucher authored Oct 22, 2012

Async DMA has a special packet for contiguous pt updates
which saves overhead.

v2: rebase
Signed-off-by: Alex Deucher <alexander.deucher@amd.com>

deab48f1

drm/radeon: use DMA engine for VM page table updates on cayman/TN · 33e54678

Alex Deucher authored Oct 22, 2012

DMA engine has special packets to facilitate this and it also keeps
the 3D engine free for other things.
Signed-off-by: Alex Deucher <alexander.deucher@amd.com>

33e54678

drm/radeon: add dma engine support for vm pt updates on ni (v5) · 3b6b59b6

Alex Deucher authored Oct 22, 2012

Async DMA has a special packet for contiguous pt updates
which saves overhead.

v2: leave the CP method enabled for now as doing the updates
in the DMA rings is not working properly yet.

v3: update for 2 level pts

v4: rebase

v5: drop pte/pde packet.  doesn't seem to work on NI.
Signed-off-by: Alex Deucher <alexander.deucher@amd.com>

3b6b59b6

drm/radeon: use async dma for ttm buffer moves on 6xx-SI · 2d6cc729
Alex Deucher authored Jul 20, 2012
```
Signed-off-by: Alex Deucher <alexander.deucher@amd.com>
```
2d6cc729
drm/radeon/kms: add support for dma rings to radeon_test_moves() · 009ee7a0
Alex Deucher authored Jun 04, 2012
```
Signed-off-by: Alex Deucher <alexander.deucher@amd.com>
```
009ee7a0

drm/radeon/kms: Add initial support for async DMA on SI · 8c5fd7ef

Alex Deucher authored Dec 04, 2012

Pretty much the same as cayman.  Some changes to the copy
packets.
Signed-off-by: Alex Deucher <alexander.deucher@amd.com>

8c5fd7ef

drm/radeon/kms: Add initial support for async DMA on cayman/TN · f60cbd11

Alex Deucher authored Dec 04, 2012

There are 2 async DMA engines on cayman, one at 0xd000 and
one at 0xd800.  The programming interface is the same as
evergreen however there are some changes to the commands
for using vmids.
Signed-off-by: Alex Deucher <alexander.deucher@amd.com>

f60cbd11

drm/radeon/kms: Add initial support for async DMA on evergreen · 233d1ad5

Alex Deucher authored Dec 04, 2012

Pretty similar to 6xx/7xx except the count field increased in the
packet header and the max IB size increased.
Signed-off-by: Alex Deucher <alexander.deucher@amd.com>

233d1ad5

drm/radeon/kms: Add initial support for async DMA on r6xx/r7xx · 4d75658b

Alex Deucher authored Sep 27, 2012

Uses the new multi-ring infrastucture.  6xx/7xx has a single
async DMA ring.
Signed-off-by: Alex Deucher <alexander.deucher@amd.com>

4d75658b

drm/ttm: remove no_wait_reserve, v3 · 97a875cb

Maarten Lankhorst authored Nov 28, 2012

All items on the lru list are always reservable, so this is a stupid
thing to keep. Not only that, it is used in a way which would
guarantee deadlocks if it were ever to be set to block on reserve.

This is a lot of churn, but mostly because of the removal of the
argument which can be nested arbitrarily deeply in many places.

No change of code in this patch except removal of the no_wait_reserve
argument, the previous patch removed the use of no_wait_reserve.

v2:
 - Warn if -EBUSY is returned on reservation, all objects on the list
   should be reservable. Adjusted patch slightly due to conflicts.
v3:
 - Focus on no_wait_reserve removal only.
Signed-off-by: Maarten Lankhorst <maarten.lankhorst@canonical.com>
Reviewed-by: Thomas Hellstrom <thellstrom@vmware.com>
Signed-off-by: Dave Airlie <airlied@redhat.com>

97a875cb

drm/ttm: cope with reserved buffers on lru list in ttm_mem_evict_first, v2 · e7ab2019

Maarten Lankhorst authored Nov 28, 2012

Replace the goto loop with a simple for each loop, and only run the
delayed destroy cleanup if we can reserve the buffer first.

No race occurs, since lru lock is never dropped any more. An empty list
and a list full of unreservable buffers both cause -EBUSY to be returned,
which is identical to the previous situation, because previously buffers
on the lru list were always guaranteed to be reservable.

This should work since currently ttm guarantees items on the lru are
always reservable, and reserving items blockingly with some bo held
are enough to cause you to run into a deadlock.

Currently this is not a concern since removal off the lru list and
reservations are always done with atomically, but when this guarantee
no longer holds, we have to handle this situation or end up with
possible deadlocks.
Signed-off-by: Maarten Lankhorst <maarten.lankhorst@canonical.com>
Reviewed-by: Thomas Hellstrom <thellstrom@vmware.com>
Signed-off-by: Dave Airlie <airlied@redhat.com>

e7ab2019

drm/ttm: cope with reserved buffers on swap list in ttm_bo_swapout, v2 · 2b7b3ad2

Maarten Lankhorst authored Nov 28, 2012

Replace the while loop with a simple for each loop, and only run the
delayed destroy cleanup if we can reserve the buffer first.

No race occurs, since lru lock is never dropped any more. An empty list
and a list full of unreservable buffers both cause -EBUSY to be returned,
which is identical to the previous situation.
Signed-off-by: Maarten Lankhorst <maarten.lankhorst@canonical.com>
Reviewed-by: Thomas Hellstrom <thellstrom@vmware.com>
Signed-off-by: Dave Airlie <airlied@redhat.com>

2b7b3ad2

drm/ttm: call ttm_bo_cleanup_refs with reservation and lru lock held, v3 · 85b144f8

Maarten Lankhorst authored Nov 29, 2012

By removing the unlocking of lru and retaking it immediately, a race is
removed where the bo is taken off the swap list or the lru list between
the unlock and relock. As such the cleanup_refs code can be simplified,
it will attempt to call ttm_bo_wait non-blockingly, and if it fails
it will drop the locks and perform a blocking wait, or return an error
if no_wait_gpu was set.

The need for looping is also eliminated, since swapout and evict_mem_first
will always follow the destruction path, no new fence is allowed
to be attached. As far as I can see this may already have been the case,
but the unlocking / relocking required a complicated loop to deal with
re-reservation.

Changes since v1:
 - Simplify no_wait_gpu case by folding it in with empty ddestroy.
 - Hold a reservation while calling ttm_bo_cleanup_memtype_use again.
Changes since v2:
 - Do not remove bo from lru list while waiting
Signed-off-by: Maarten Lankhorst <maarten.lankhorst@canonical.com>
Reviewed-by: Thomas Hellstrom <thellstrom@vmware.com>
Signed-off-by: Dave Airlie <airlied@redhat.com>

85b144f8

drm/radeon: allow move_notify to be called without reservation · 6ed9ccb4

Maarten Lankhorst authored Nov 28, 2012

The few places that care should have those checks instead.
This allows destruction of bo backed memory without a reservation.
It's required for being able to rework the delayed destroy path,
as it is no longer guaranteed to hold a reservation before unlocking.

However any previous wait is still guaranteed to complete, and it's
one of the last things to be done before the buffer object is freed.
Signed-off-by: Maarten Lankhorst <maarten.lankhorst@canonical.com>
Signed-off-by: Dave Airlie <airlied@redhat.com>

6ed9ccb4

drm/ttm: change fence_lock to inner lock · 4154f051

Maarten Lankhorst authored Nov 28, 2012

This requires changing the order in ttm_bo_cleanup_refs_or_queue to
take the reservation first, as there is otherwise no race free way to
take lru lock before fence_lock.
Signed-off-by: Maarten Lankhorst <maarten.lankhorst@canonical.com>
Reviewed-by: Thomas Hellstrom <thellstrom@vmware.com>
Signed-off-by: Dave Airlie <airlied@redhat.com>

4154f051

Merge branch 'drm-next-3.8' of git://people.freedesktop.org/~agd5f/linux into drm-next · 1a1494de

Dave Airlie authored Dec 08, 2012

Alex writes:
Pretty minor -next pull request.  We some additional new bits waiting
internally for release.  Hopefully Monday we can get at least some of
them out.  The others will probably take a few more weeks.

Highlights of the current request:
- ELD registers for passing audio information to the sound hardware
- Handle GPUVM page faults more gracefully
- Misc fixes

Merge radeon test
* 'drm-next-3.8' of git://people.freedesktop.org/~agd5f/linux: (483 commits)
  drm/radeon: bump driver version for new info ioctl requests
  drm/radeon: fix eDP clk and lane setup for scaled modes
  drm/radeon: add new INFO ioctl requests
  drm/radeon/dce32+: use fractional fb dividers for high clocks
  drm/radeon: use cached memory when evicting for vram on non agp
  drm/radeon: add a CS flag END_OF_FRAME
  drm/radeon: stop page faults from hanging the system (v2)
  drm/radeon/dce4/5: add registers for ELD handling
  drm/radeon/dce3.2: add registers for ELD handling
  radeon: fix pll/ctrc mapping on dce2 and dce3 hardware
  Linux 3.7-rc7
  powerpc/eeh: Do not invalidate PE properly
  Revert "drm/i915: enable rc6 on ilk again"
  ALSA: hda - Fix build without CONFIG_PM
  of/address: sparc: Declare of_iomap as an extern function for sparc again
  PM / QoS: fix wrong error-checking condition
  bnx2x: remove redundant warning log
  vxlan: fix command usage in its doc
  8139cp: revert "set ring address before enabling receiver"
  MPI: Fix compilation on MIPS with GCC 4.4 and newer
  ...

Conflicts:
	drivers/gpu/drm/exynos/exynos_drm_encoder.c
	drivers/gpu/drm/exynos/exynos_drm_fbdev.c
	drivers/gpu/drm/nouveau/core/engine/disp/nv50.c

1a1494de

08 Dec, 2012 6 commits

drm/radeon: bump driver version for new info ioctl requests · 71bfe916
Alex Deucher authored Dec 07, 2012
```
Signed-off-by: Alex Deucher <alexander.deucher@amd.com>
```
71bfe916

drm/radeon: fix eDP clk and lane setup for scaled modes · 93927f9c

Alex Deucher authored Dec 04, 2012

Need to use the adjusted mode since we are sending native
timing and using the scaler for non-native modes.
Signed-off-by: Alex Deucher <alexander.deucher@amd.com>
Reviewed-by: Jerome Glisse <jglisse@redhat.com>
cc: stable@vger.kernel.org

93927f9c

drm/radeon: add new INFO ioctl requests · 2e1a7674

Alex Deucher authored Dec 04, 2012

Add requests to get the number of shader engines (SE) and
the number of SH per SE.  These are needed for geometry
and tesselation shaders in the 3D driver as well as setting
up PA_SC_RASTER_CONFIG on SI asics.
Signed-off-by: Alex Deucher <alexander.deucher@amd.com>

2e1a7674

drm/radeon/dce32+: use fractional fb dividers for high clocks · a02dc74b

Alex Deucher authored Nov 13, 2012

Fixes flickering with some high res montiors.
Signed-off-by: Alex Deucher <alexander.deucher@amd.com>
CC: stable@vger.kernel.org

a02dc74b

drm/radeon: use cached memory when evicting for vram on non agp · 0d0b3e74

Jerome Glisse authored Nov 28, 2012

Force the use of cached memory when evicting from vram on non agp
hardware. Also force write combine on agp hw. This is to insure
the minimum cache type change when allocating memory and improving
memory eviction especialy on pci/pcie hw.
Signed-off-by: Jerome Glisse <jglisse@redhat.com>

0d0b3e74

drm/radeon: add a CS flag END_OF_FRAME · 57f57083

Marek Olšák authored Dec 02, 2012

No version bump is required because setting the flag on older DRM has
no effect.

This only reserves the bit and doesn't use it. I assume we will use it
for buffer eviction heuristics.
Signed-off-by: Marek Olšák <maraeo@gmail.com>

57f57083