Commits · 47fdda3f522399695107ef42f7803d7b25b9e1bf · Kirill Smelkov / gevent

25 Apr, 2018 1 commit
- lock: fix typo · 47fdda3f
  Florian Margaine authored Apr 25, 2018
```
It just bothers me so much whenever I hit it, sorry
```
  47fdda3f
23 Apr, 2018 6 commits

Merge pull request #1190 from gevent/cython-tracer · ab5272c5
Jason Madden authored Apr 23, 2018
```
Compile the monitor greenlet tracer with Cython
```
ab5272c5

Compile the monitor greenlet tracer with Cython. · aa911ed0

Jason Madden authored Apr 23, 2018

This makes things 54% faster. In fact, the monitor tracer is now faster than a trivial tracer implemented in python (settrace(lambda e, a: None)).

+-------------------+-----------------+-----------------------------+
| Benchmark         | 37_bench_tracer | 37_bench_tracer_cython_opt3 |
+===================+=================+=============================+
| monitor tracer    | 1.62 us         | 739 ns: 2.20x faster (-54%) |
+-------------------+-----------------+-----------------------------+
| max switch tracer | 3.06 us         | 874 ns: 3.50x faster (-71%) |
+-------------------+-----------------+-----------------------------+
| hub switch tracer | 2.16 us         | 815 ns: 2.66x faster (-62%) |
+-------------------+-----------------+-----------------------------+

Not significant (2): no tracer; trivial tracer

aa911ed0

Move the greenlet tracers to their own file and compile with cython. · 4231079a

Jason Madden authored Apr 23, 2018

Unoptimized still makes them 25% faster:

+-------------------+-----------------+------------------------------+
| Benchmark         | 37_bench_tracer | 37_bench_tracer_first_cython |
+===================+=================+==============================+
| trivial tracer    | 792 ns          | 786 ns: 1.01x faster (-1%)   |
+-------------------+-----------------+------------------------------+
| monitor tracer    | 1.62 us         | 1.24 us: 1.31x faster (-24%) |
+-------------------+-----------------+------------------------------+
| max switch tracer | 3.06 us         | 1.89 us: 1.62x faster (-38%) |
+-------------------+-----------------+------------------------------+
| hub switch tracer | 2.16 us         | 1.68 us: 1.29x faster (-22%) |
+-------------------+-----------------+------------------------------+

4231079a

Add benchmark for greenlet tracers. [skip ci] · 596773a1

Jason Madden authored Apr 23, 2018

Current numbers on 3.7b3:

no tracer: Mean +- std dev: 414 ns +- 10 ns
trivial tracer: Mean +- std dev: 792 ns +- 16 ns
monitor tracer: Mean +- std dev: 1.62 us +- 0.12 us
max switch tracer: Mean +- std dev: 3.06 us +- 0.12 us
hub switch tracer: Mean +- std dev: 2.16 us +- 0.04 us

596773a1

Merge pull request #1189 from felixonmars/patch-2 · 58754ba8
Jason Madden authored Apr 22, 2018
```
Fix a typo [skip ci]
```
58754ba8
Fix a typo · 9633ecc9
Felix Yan authored Apr 23, 2018

9633ecc9

20 Apr, 2018 3 commits

Merge pull request #1187 from gevent/static-libuv-patch · c733e399
Jason Madden authored Apr 20, 2018
```
Make the libuv run QUEUE part of the loop.
```
c733e399

Make the libuv run QUEUE part of the loop. · 0df38a9b

Jason Madden authored Apr 20, 2018

Benchmarking (link in the email) showed that malloc/free had substantial and widely varying overhead.

I didn't really see much of a difference in the gevent benchmarks, but I didn't run them all. However, if any patch gets upstreamed, it will probably be something like this.

The link referenced in the email contains the discussion on the libuv mailing list.

0df38a9b

Make the monitor thread survive a fork. (#1186) · d5942f40

Jason Madden authored Apr 20, 2018

* Make the monitor thread survive a fork.

* Add coverage for the new get_process.

* psutil is not on windows, move the test to a protected location.

* No, really.

d5942f40

19 Apr, 2018 7 commits
- Merge pull request #1184 from gevent/issue1182 · 3b12c0e8
  Jason Madden authored Apr 19, 2018
```
Add gevent.util.assert_switches 
```
  3b12c0e8
- Merge branch 'master' of https://github.com/gevent/gevent · 5f33d756
  Jason Madden authored Apr 19, 2018
  
  5f33d756
- Add gevent.util.assert_switches · 56cfd118
  Jason Madden authored Apr 19, 2018
```
Based on refactoring existing code in _monitor.py

Flexibly allows checking for any switching, switches that take longer
than a set amount of time, or switches that exceed a time limit
between getting back to the hub.

Fixes #1182.
```
  56cfd118
- Merge pull request #1183 from gevent/libuv-1.20.1 · f187fd05
  Jason Madden authored Apr 19, 2018
```
Update to libuv 1.20.1
```
  f187fd05
- Update to libuv 1.20.1 · 3324dab8
  Jason Madden authored Apr 19, 2018
  
  3324dab8
- Priorities: libuv doesnt have them [skip ci] · 46a133db
  Jason Madden authored Apr 19, 2018
  
  46a133db
- Add loop implementation docs. Fixes #1060. [skip ci] · bc184e2f
  Jason Madden authored Apr 19, 2018
  
  bc184e2f
18 Apr, 2018 5 commits

Fill in whatsnew_1_3.rst [skip ci] · 72c428f3
Jason Madden authored Apr 18, 2018

72c428f3
Add document about monitoring/debugging. [skip ci] · 643ee9fc
Jason Madden authored Apr 18, 2018

643ee9fc
Merge pull request #1181 from gevent/libuv120 · 43848b60
Jason Madden authored Apr 18, 2018
```
Update libuv to 1.20
```
43848b60
Update libuv to 1.20 · 3a599958
Jason Madden authored Apr 18, 2018

3a599958

More socket benchmarks [skip ci] · 7a0a9585

Jason Madden authored Apr 18, 2018

We're more competitive using a forked process:

.....................
gevent socketpair sendall greenlet: Mean +- std dev: 256 ms +- 6 ms
.....................
native socketpair sendall thread: Mean +- std dev: 44.4 ms +- 1.5 ms
.....................
WARNING: the benchmark result may be unstable
* the standard deviation (7.31 ms) is 13% of the mean (57.9 ms)
* the maximum (89.0 ms) is 54% greater than the mean (57.9 ms)

Try to rerun the benchmark with more runs, values and/or loops.
Run 'python -m perf system tune' command to reduce the system jitter.
Use perf stats, perf dump and perf hist to analyze results.
Use --quiet option to hide these warnings.

gevent socketpair sendall fork: Mean +- std dev: 57.9 ms +- 7.3 ms
.....................
native socketpair sendall fork: Mean +- std dev: 44.6 ms +- 2.0 ms
.....................
native udp sendto: Mean +- std dev: 30.6 ms +- 1.1 ms
.....................
gevent udp sendto: Mean +- std dev: 37.5 ms +- 2.7 ms

(python 3.7)

7a0a9585

16 Apr, 2018 4 commits

Documenting libuv and 1.3 updates [skip ci] · 6e2ab023

Jason Madden authored Apr 16, 2018

This addresses the first few points of #1060.

We now have a basic documentation for what a loop looks like, and we
have documented watcher.close()

Hub is better documented (only the parts that should be documented are
documented).

get/setswitchinterval is documnted.

Several xrefs are fixed up.

6e2ab023

Sigh. Pip changed the semantics of -U in pip 10. go back to the old way so... · 383c1c19

Jason Madden authored Apr 16, 2018

Sigh. Pip changed the semantics of -U in pip 10. go back to the old way so we're actually testing changing deps; if it breaks we need to know.

383c1c19

Merge pull request #1176 from gevent/issue1172 · 6766b088
Jason Madden authored Apr 16, 2018
```
Use /dev/fd|/proc/self/fd to get open FDs to close in Popen
```
6766b088

Add unit testing for the new close_fd functions using mocks · ccc47e33

Jason Madden authored Apr 16, 2018

Should get them to complete coverage.

Also move our test dependencies into the standard 'test' extra and
install mock (and futures) only on Python 2.7.

ccc47e33

15 Apr, 2018 7 commits

Use /dev/fd|/proc/self/fd to get open FDs to close in Popen · 461d6966

Jason Madden authored Apr 15, 2018

If those aren't available, use the old brute-force approach. This is
closer to what CPython does in its C implementation, and is much
faster.

We don't have to worry about the async signal safe stuff the C code
does because, guess what, we're running Python code here already
anyway, so much of it could wind up doing something that's not
actually safe anyway. Oh well.

Since we depend on Python 3.4 and above now, we can rely on the
CLOEXEC flag being set by default and not have to manually check
everything.

This speeds up 2.7 (close_fds defaults to *false* there, so the
default case doesn't change):

| Benchmark                 | 27_bench_subprocess | 27_bench_subprocess_dirfd     |
+---------------------------+---------------------+-------------------------------+
| spawn native no close_fds | 1.81 ms             | 1.79 ms: 1.01x faster (-1%)   |
| spawn gevent no close_fds | 2.11 ms             | 2.20 ms: 1.04x slower (+4%)   |
| spawn native close_fds    | 31.0 ms             | 30.2 ms: 1.03x faster (-3%)   |
| spawn gevent close_fds    | 31.6 ms             | 2.56 ms: 12.31x faster (-92%) |

And it really speeds up 3.7 (close_fds defaults to *true* there, so
the default case is much faster, and the non-default case is even better):

| Benchmark                 | 37_bench_subprocess | 37_bench_subprocess_dirfd     |
+---------------------------+---------------------+-------------------------------+
| spawn native no close_fds | 1.34 ms             | 1.27 ms: 1.06x faster (-6%)   |
| spawn gevent no close_fds | 117 ms              | 3.05 ms: 38.27x faster (-97%) |
| spawn native close_fds    | 1.36 ms             | 1.30 ms: 1.04x faster (-4%)   |
| spawn gevent close_fds    | 32.5 ms             | 3.34 ms: 9.75x faster (-90%)  |

Fixes #1172

461d6966

Add benchmark for using subprocess.Popen to create /usr/bin/true [skip ci] · b166aaf8

Jason Madden authored Apr 15, 2018

In response to #1172

The following numbers are for my machine on macOS 10.13.3 with MAXFD
of 50000.

Python 2.7:

.....................
spawn native no close_fds: Mean +- std dev: 1.81 ms +- 0.04 ms
.....................
spawn gevent no close_fds: Mean +- std dev: 2.11 ms +- 0.08 ms
.....................
spawn native close_fds: Mean +- std dev: 31.0 ms +- 0.7 ms
.....................
spawn gevent close_fds: Mean +- std dev: 31.6 ms +- 0.6 ms

Notice that the times when close_fd=True (not the default on 2.7) are
about the same. 2.7 uses the same Python loop we do to close all the
fds.

Now 3.7:

.....................
spawn native no close_fds: Mean +- std dev: 1.34 ms +- 0.04 ms
.....................
spawn gevent no close_fds: Mean +- std dev: 117 ms +- 2 ms
.....................
spawn native close_fds: Mean +- std dev: 1.36 ms +- 0.03 ms
.....................
spawn gevent close_fds: Mean +- std dev: 32.5 ms +- 0.4 ms

Notice that gevent is *much* slower when we *don't* close the fds.
This is because, starting in Python 3.4, close_fds defaults to true,
and when it's false we have to check os.get_inheritable() for each fd
before we close it. gevent performs the same as it did on Python 2.7
when closing fds, but the native implementation is much faster due to
the C optimizations outlined in #1172---it turns out they apply to BSD
and Apple platforms in addition to Linux, although they're not async
safe.

Now, the C code does the opposite for inheritable handles: it
explicitly calls make_inheritable() for the ones it wants to keep and
lets the OS close the others with CLOEXEC. We could probably do that
too; the slow down for this case counts as a regression, I think.

b166aaf8

Reorganize docs. · b7c9a1df

Jason Madden authored Apr 15, 2018

Move the API reference to its own (sectioned) page to clean up the main page. Break up the massive 'gevent' module page into more digestable parts and add more xrefs.

b7c9a1df

Some minor documentation cleanup for intro and basics. Prepping for some re-org. · 712b69b4
Jason Madden authored Apr 15, 2018

712b69b4
Explicitly publish and link to examples so they're consistent with a given release. · 35348992
Jason Madden authored Apr 15, 2018
```
[skip ci]
```
35348992
Add PEP345 project URLs since warehouse displays them now. · f6c662a8
Jason Madden authored Apr 15, 2018

f6c662a8
Merge pull request #1174 from gevent/cffi-win · e2aef582
Jason Madden authored Apr 15, 2018
```
Use environment markers to install CFFI on windows so the libuv backend can really be default
```
e2aef582

14 Apr, 2018 7 commits
- Don't try to install cffi on pypy via setup.py since that seems to break things. · 7fdfbc3b
  Jason Madden authored Apr 14, 2018
  
  7fdfbc3b
- We do need requests. · f084e4f3
  Jason Madden authored Apr 14, 2018
  
  f084e4f3
- Try to install less stuff: cryptography doesn't build on windows · c516aebd
  Jason Madden authored Apr 14, 2018
  
  c516aebd
- Another pass at pip 10 · 0790ac2d
  Jason Madden authored Apr 14, 2018
  
  0790ac2d
- Still no go with pip 10. Try to simplify and build psutil. · 54f9f43e
  Jason Madden authored Apr 14, 2018
  
  54f9f43e
- Use environment markers to install cffi on windows. · 14544dae
  Jason Madden authored Apr 14, 2018
```
Also try working around all the failed builds that pip 10 created: https://ci.appveyor.com/project/denik/gevent/build/1.0.1667/job/9jarsdxb6y0x2smd
```
  14544dae
- I actually built those wheels with Cython 28.2. · 38b9f54f
  Jason Madden authored Apr 14, 2018
```
[skip ci]
```
  38b9f54f