Skip to content
Projects
Groups
Snippets
Help
Loading...
Help
Support
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in / Register
Toggle navigation
C
cython
Project overview
Project overview
Details
Activity
Releases
Repository
Repository
Files
Commits
Branches
Tags
Contributors
Graph
Compare
Issues
0
Issues
0
List
Boards
Labels
Milestones
Merge Requests
0
Merge Requests
0
Analytics
Analytics
Repository
Value Stream
Wiki
Wiki
Snippets
Snippets
Members
Members
Collapse sidebar
Close sidebar
Activity
Graph
Create a new issue
Commits
Issue Boards
Open sidebar
Kirill Smelkov
cython
Commits
70c5f657
Commit
70c5f657
authored
Jul 29, 2011
by
Mark Florisson
Browse files
Options
Browse Files
Download
Email Patches
Plain Diff
Update OpenMP docs
parent
ffbdf766
Changes
2
Hide whitespace changes
Inline
Side-by-side
Showing
2 changed files
with
45 additions
and
20 deletions
+45
-20
docs/src/userguide/parallelism.rst
docs/src/userguide/parallelism.rst
+28
-20
tests/run/sequential_parallel.pyx
tests/run/sequential_parallel.pyx
+17
-0
No files found.
docs/src/userguide/parallelism.rst
View file @
70c5f657
...
...
@@ -20,12 +20,14 @@ currently supports OpenMP, but later on more backends might be supported.
Thread-locality and reductions are automatically inferred for variables.
If you assign to a variable, it becomes lastprivate, meaning that the
If you assign to a variable
in a prange block
, it becomes lastprivate, meaning that the
variable will contain the value from the last iteration. If you use an
inplace operator on a variable, it becomes a reduction, meaning that the
values from the thread-local copies of the variable will be reduced with
the operator and assigned to the original variable after the loop. The
index variable is always lastprivate.
Variables assigned to in a parallel with block will be private and unusable
after the block, as there is no concept of a sequentially last value.
The ``schedule`` is passed to OpenMP and can be one of the following:
...
...
@@ -88,31 +90,31 @@ currently supports OpenMP, but later on more backends might be supported.
buffers used by a prange. A contained prange will be a worksharing loop
that is not parallel, so any variable assigned to in the parallel section
is also private to the prange. Variables that are private in the parallel
block are una
ltered
after the parallel block.
block are una
vailable
after the parallel block.
Example with thread-local buffers::
from cython.parallel import *
from cython.stdlib cimport abort
from cython.parallel import *
from libc.stdlib cimport abort, malloc, free
cdef Py_ssize_t
i, n = 100
cdef int * local_buf
cdef size_t size = 10
cdef Py_ssize_t idx,
i, n = 100
cdef int * local_buf
cdef size_t size = 10
with nogil, parallel
:
local_buf =
malloc(sizeof(int) * size)
if local_buf == NULL:
abort()
with nogil, parallel()
:
local_buf = <int *>
malloc(sizeof(int) * size)
if local_buf == NULL:
abort()
# populate our local buffer in a sequential loop
for i
in range(size):
local_buf[i] = i * 2
# populate our local buffer in a sequential loop
for idx
in range(size):
local_buf[i] = i * 2
# share the work using the thread-local buffer(s)
for i in prange(n, schedule='guided'):
func(local_buf)
# share the work using the thread-local buffer(s)
for i in prange(n, schedule='guided'):
func(local_buf)
free(local_buf)
free(local_buf)
Later on sections might be supported in parallel blocks, to distribute
code sections of work among threads.
...
...
@@ -135,7 +137,7 @@ enable OpenMP. For gcc this can be done as follows in a setup.py::
"hello",
["hello.pyx"],
extra_compile_args=['-fopenmp'],
libraries=['go
mp'],
extra_link_args=['-fopen
mp'],
)
setup(
...
...
@@ -158,7 +160,7 @@ particular order::
from cython.parallel import prange
def
func(Py_ssize_t n):
cdef int
func(Py_ssize_t n):
cdef Py_ssize_t i
for i in prange(n, nogil=True):
...
...
@@ -173,6 +175,12 @@ particular order::
In the example above it is undefined whether an exception shall be raised,
whether it will simply break or whether it will return 2.
Nested Parallelism
==================
Nested parallelism is currently disabled due to a bug in gcc 4.5 [#]_. However,
you can freely call functions with parallel sections from a parallel section.
.. rubric:: References
.. [#] http://www.openmp.org/mp-documents/spec30.pdf
.. [#] http://gcc.gnu.org/bugzilla/show_bug.cgi?id=49897
tests/run/sequential_parallel.pyx
View file @
70c5f657
...
...
@@ -688,3 +688,20 @@ def test_parallel_with_gil_continue_unnested():
sum
+=
i
print
sum
cdef
int
inner_parallel_section
()
nogil
:
cdef
int
j
,
sum
=
0
for
j
in
prange
(
10
):
sum
+=
j
return
sum
def
outer_parallel_section
():
"""
>>> outer_parallel_section()
450
"""
cdef
int
i
,
sum
=
0
for
i
in
prange
(
10
,
nogil
=
True
):
sum
+=
inner_parallel_section
()
return
sum
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment