Commit 06f75b80 authored by Robert Bradshaw's avatar Robert Bradshaw

Suggest disabling wraparound rather than unsigned indices.

parent 681da667
...@@ -227,7 +227,8 @@ The array lookups are still slowed down by two factors: ...@@ -227,7 +227,8 @@ The array lookups are still slowed down by two factors:
... ...
cimport cython cimport cython
@cython.boundscheck(False) # turn of bounds-checking for entire function @cython.boundscheck(False) # turn off bounds-checking for entire function
@cython.wraparound(False) # turn off negative index wrapping for entire function
def naive_convolve(np.ndarray[DTYPE_t, ndim=2] f, np.ndarray[DTYPE_t, ndim=2] g): def naive_convolve(np.ndarray[DTYPE_t, ndim=2] f, np.ndarray[DTYPE_t, ndim=2] g):
... ...
...@@ -237,32 +238,9 @@ and in the worst case corrupt data). It is possible to switch bounds-checking ...@@ -237,32 +238,9 @@ and in the worst case corrupt data). It is possible to switch bounds-checking
mode in many ways, see :ref:`compiler-directives` for more mode in many ways, see :ref:`compiler-directives` for more
information. information.
Negative indices are dealt with by ensuring Cython that the indices will be Also, we've disabled the check to wrap negative indices (e.g. g[-1] giving
positive, by casting the variables to unsigned integer types (if you do have the last value). As with disabling bounds checking, bad things will happen
negative values, then this casting will create a very large positive value if we try to actually use negative indices with this disabled.
instead and you will attempt to access out-of-bounds values). Casting is done
with a special ``<>``-syntax. The code below is changed to use either
unsigned ints or casting as appropriate::
...
cdef int s, t # changed
cdef unsigned int x, y, v, w # changed
cdef int s_from, s_to, t_from, t_to
cdef DTYPE_t value
for x in range(xmax):
for y in range(ymax):
s_from = max(smid - x, -smid)
s_to = min((xmax - x) - smid, smid + 1)
t_from = max(tmid - y, -tmid)
t_to = min((ymax - y) - tmid, tmid + 1)
value = 0
for s in range(s_from, s_to):
for t in range(t_from, t_to):
v = <unsigned int>(x - smid + s) # changed
w = <unsigned int>(y - tmid + t) # changed
value += g[<unsigned int>(smid - s), <unsigned int>(tmid - t)] * f[v, w] # changed
h[x, y] = value
...
The function call overhead now starts to play a role, so we compare the latter The function call overhead now starts to play a role, so we compare the latter
two examples with larger N: two examples with larger N:
...@@ -310,4 +288,3 @@ There is some speed penalty to this though (as one makes more assumptions ...@@ -310,4 +288,3 @@ There is some speed penalty to this though (as one makes more assumptions
compile-time if the type is set to :obj:`np.ndarray`, specifically it is compile-time if the type is set to :obj:`np.ndarray`, specifically it is
assumed that the data is stored in pure strided mode and not in indirect assumed that the data is stored in pure strided mode and not in indirect
mode). mode).
Markdown is supported
0%
or
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment