Commits · 15928944a1886272df0a720c750f3073c1b87109 · Boxiang Sun / Pyston

24 Jun, 2015 1 commit
- Try again to make weakref1 more robust · 15928944
  Kevin Modzelewski authored Jun 24, 2015
  
  15928944
23 Jun, 2015 8 commits
- Get rid of runtime/capi.h · 59479580
  Kevin Modzelewski authored Jun 23, 2015
  
  59479580
- Move these PySequence_* methods into abstract.cpp · 0809576c
  Kevin Modzelewski authored Jun 23, 2015
  
  0809576c
- Move most descriptor-related code into descr.cpp · 7f876c25
  Kevin Modzelewski authored Jun 23, 2015
```
I think it now contains the same types that CPython's
descrobject.c handles.
```
  7f876c25
- Merge pull request #631 from kmod/perf4 · df9ec460
  Kevin Modzelewski authored Jun 23, 2015
```
more rewriting
```
  df9ec460
- Don't bother propagating garbage, just pass NULL · 73fc53c9
  Kevin Modzelewski authored Jun 23, 2015
  
  73fc53c9
- llvm::DenseMap is unsuitable with arbitrary integer keys · e3650a48
  Kevin Modzelewski authored Jun 23, 2015
```
It needs to reserve some special key values that the caller
has to promise to never insert into the map.  For integers that's
-1 and -2... which we were using and then got some random data back.
```
  e3650a48
- Merge pull request #632 from toshok/mmap-larger-regions · 9415d7c1
  Kevin Modzelewski authored Jun 22, 2015
```
don't mmap such small regions
```
  9415d7c1
- don't mmap such small regions - linux clusters page faults so the larger the... · cdfbb8ee
  Chris Toshok authored Jun 23, 2015
```
don't mmap such small regions - linux clusters page faults so the larger the regions we mmap the better, and larger regions don't cost us anything
```
  cdfbb8ee
22 Jun, 2015 9 commits

Add a somewhat-rewritten case to callFunc · 45ffeafe

Kevin Modzelewski authored Jun 21, 2015

ie in cases that we couldn't rewrite all of the argument conversion,
emit a rewrite that's just a call directly to callFunc, so we can skip
the earlier parts of the dispatching.

I think there is more we can do here though; there are also more places
that this kind of approach would be applicable.

45ffeafe

Rewrite some simple cases that use varargs/kwargs · f71b8b75
Kevin Modzelewski authored Jun 20, 2015

f71b8b75
Rewrite some more builtin-type constructors · 084ab6ca
Kevin Modzelewski authored Jun 20, 2015

084ab6ca
Try to make some more weakref tests more robust · fa7b5e3b
Kevin Modzelewski authored Jun 22, 2015

fa7b5e3b
Merge pull request #630 from kmod/gc_fixes · 06a07a2c
Kevin Modzelewski authored Jun 22, 2015
```
fix two minor bugs
```
06a07a2c

Support buggy callers of PyString_GET_SIZE · 5592f785

Kevin Modzelewski authored Jun 22, 2015

In CPython, PyString_GET_SIZE and PyUnicode_GET_SIZE happen to have
the same implementation, so it is ok for callers to be wrong about
which one they call.

For us, previous to this commit, PyString_GET_SIZE would call PyString_Size,
which has different behavior than PyString_GET_SIZE for non-strings.
For unicode objects it means that we would return the size of the encoded
representation of the string rather than the number of unicode characters.
It also means that PyString_GET_SIZE can fail for us and return -1,
which it never can for CPython, so callers get away with not
checking the return value.

5592f785

Switch this to use the safer root registration form · 1f434147

Kevin Modzelewski authored Jun 22, 2015

Doesn't usually matter, but if we crank up the collection frequency
then it's possible to get a collection between the allocation
and the registration.

1f434147

Merge pull request #628 from kmod/perf3 · 6c273fbe
Kevin Modzelewski authored Jun 22, 2015
```
rewrite wrapperdescriptors
```
6c273fbe

Add additional args-in-place check right before jumps · 4d4eeeab

Kevin Modzelewski authored Jun 22, 2015

We would check that they're in the right place as part of
restoreArgs (ie putting them in the right place), but then we
would do some other work (loading the value to check, loading
the constant) which could potentially spill registers.

So add an additional check right before we emit the actual
jump instruction.  I'm surprised this never failed.

4d4eeeab

20 Jun, 2015 16 commits

Have the tester print out failed stats · c121fa51

Kevin Modzelewski authored Jun 20, 2015

I think the issue though was that I had tried to optimize the
heuristic for "can this be a patchpoint" to be too aggressive and
we were getting false negatives on the release build (but not the
debug build).

c121fa51

Make the TRACE_LOG helpers available to other files · 7529462c
Kevin Modzelewski authored Jun 20, 2015
```
and rename them to make it more clear that it's GC-related.
```
7529462c

Zero out 'globals' in the ASTInterpreter constructor · 085c18b9

Kevin Modzelewski authored Jun 20, 2015

Maybe the GC should zero-initialize everything, but it doesn't,
so we would get garbage values in that field.

085c18b9

Add support for rewriting wrapperdescriptors · 9d5b5a63

Kevin Modzelewski authored Jun 20, 2015

Involves a couple changes:
- have the rewriter treat certain callsites as non-mutations
- add special cases for wrapperdescr objects

9d5b5a63

Rename some of the stattimers · f8b16dbd

Kevin Modzelewski authored Jun 20, 2015

I renamed all the "about to enter jitted code" and "about to enter the interpreter"
stats to "in_jitted_code" and "in_interpreter", respectively; I don't think the
exact entry point ends up mattering that much.

A lot of stuff is showing up as "in_jitted_code"; I tried to find some of it using
the new itimer helper, and put some separate timers on those.

f8b16dbd

sum() was failing its ics by running out of instruction space · fa30790c
Kevin Modzelewski authored Jun 20, 2015

fa30790c

Some more debugging for parsing issue · a1d9091d

Kevin Modzelewski authored Jun 20, 2015

Seeing some corrupted-pyc-file issues on travis-ci,
that I can't reproduce locally.  Add some more debugging output
for when it happens again.

a1d9091d

Helper for pinning down where time in a stattimer happens · f1d03e81

Kevin Modzelewski authored Jun 12, 2015

If you define INVESTIGATE_STAT_TIMER to the name of the timer you want
to investigate, we will set an itimer that raises a SIGTRAP if you are
in that particular timer, but ignores the signal otherwise. There's no
tooling on top of it, but just running that inside gdb is already helpful.

f1d03e81

Merge commit 'pr/624' · cf2c2828
Kevin Modzelewski authored Jun 20, 2015
```
Conflicts:
	src/codegen/unwinding.cpp
```
cf2c2828
Merge pull request #626 from kmod/perf2 · 767a5193
Kevin Modzelewski authored Jun 19, 2015
```
make a couple more places successfully rewrite
```
767a5193

Don't crash in a couple rewriter cases · 2c93ce12

Kevin Modzelewski authored Jun 12, 2015

- if we try guarding after a mutation
- if we use all of our scratch space

Now, just set a "failed" flag internally and which prevents committing.

The motivation for the first part is trying to get rewrite calls to tp_getattro;
if the rewrite is from getattr then it will succeed, but if it comes from
callattr then we will want to do some more guards after the tp_getattro.  We
could try to pass that state around, but for now just use the 'failed' approach.

2c93ce12

Can rewrite some extra callattr edgecases · b5d994be

Kevin Modzelewski authored Jun 20, 2015

We don't usually call callattr with null_on_nonexistent,
but we do for __hasattr__ checking.  We can rewrite those
to just do the guards and then return NULL.

b5d994be

Rewrite calls to astInterpretFunction · cb5f86e6
Kevin Modzelewski authored Jun 20, 2015
```
kind of hacky but I think it's ok for now.
```
cb5f86e6

Add more special-cases to nonzero · 2dcbe804

Kevin Modzelewski authored Jun 20, 2015

We could also add more general rewriting, but
- these new special cases catch something like
  95% of the cases that we weren't rewriting
- these special cases are faster than doing the generic
  nonzerno mechanism (looking up the attribute, etc)

It'd be nice if we could get to the point that the generic
rewrites we'd create would be as good as the hand-crafted ones,
but that would require knowing that we don't need to guard on
constant classes, and then inlining within rewrites.

2dcbe804

Merge pull request #625 from kmod/perf · 5294816c
Kevin Modzelewski authored Jun 19, 2015
```
Some more perf hunting
```
5294816c
Rewrite calls to BoxedMethodDescriptors · acdd2cd8
Kevin Modzelewski authored Jun 19, 2015
```
ie one of the common entrypoints to capi code.
```
acdd2cd8

19 Jun, 2015 6 commits

Fix rewriter bug when instancemethod is on a non-function · e9661371

Kevin Modzelewski authored Jun 19, 2015

ie when it's on a builtin method.  We assumed at some point
that we wouldn't need to look at the function object, but now
that we can rewrite method_cls calls, that's not true.

e9661371

Add native tp_descr_get for function_cls and wrapperdescr_cls · a20bcabd
Kevin Modzelewski authored Jun 19, 2015

a20bcabd
Merge pull request #623 from kmod/stats · 85f8fff2
Kevin Modzelewski authored Jun 19, 2015
```
Playing with stattimers
```
85f8fff2
add a comment to tester.py so that everyone's test runs will regenerate expected caches · e7355582
Chris Toshok authored Jun 19, 2015

e7355582
use None as the sentinel value for traceback chains · b30c50ae
Chris Toshok authored Jun 19, 2015

b30c50ae

Add the concept of stattimer "avoidability" · 2cbc3c4a

Kevin Modzelewski authored Jun 19, 2015

ie roll up all the time into the most "avoidable" reason that we were doing it.
For example, if we are doing something like calling slot_tp_getttro on a builtin
type (very avoidable), roll up all the subsequent time (runtimeCall, etc) into
the slot_tp_getattro timer.  But if we call runtimeCall where we couldn't avoid it
(ex from the interpreter), log that separately.

Not sure how helpful it will be but for this specific investigation it seems to
somewhat work.  The idea of the "avoidability" is definitely pretty specific to
the type of work that you are thinking of doing; the numbers I put in are for
investigating slowpaths.

Also, remove all the timers that we have on specific runtime functions (ex: listMul).
I think we'll need another strategy for those.

2cbc3c4a