Commits · 9aac061c68930cf23338534e651e4e5387d0197e · Boxiang Sun / Pyston

An error occurred fetching the project authors.

19 Feb, 2015 1 commit
- add tp_getattr support, and most of PyNumber_Int's behavior · 9aac061c
  Chris Toshok authored 9 years ago
  
  9aac061c
18 Feb, 2015 2 commits
- Teach len() howto rewrite itself · ee7cf48d
  Marius Wachtler authored 9 years ago
```
-15% for fasta.py
```
  ee7cf48d
- Allow rewriting 1-arg calls to type() · a4722ed0
  Kevin Modzelewski authored 9 years ago
  
  a4722ed0
13 Feb, 2015 3 commits

Cleanup: add settable OSR/reopt thresholds, and get rid of tier 2 · 7cf92757

Previously it was:
tier 0: ast interpreter
tier 1: llvm, no speculations, no llvm opts
tier 2: llvm, w/ speculations, no llvm opts
tier 3: llvm, w/ speculations, w/ llvm opts

tier 2 seemed pretty useless, and very little would stay in it.  Also,
OSR would always skip from tier 1 to tier 3.

Separately, add configurable OSR/reopt thresholds.  This is mostly for the
sake of tests, where we can set lower limits and force OSR/reopts to happen.

7cf92757

move getreversed from objmodel.cpp to builtin_module/builtins.cpp · 280146ff
Chris Toshok authored 9 years ago

280146ff
add reversed() builtin function, as well as compatible __reversed__ methods on list/xrange. · b131d40c
Chris Toshok authored 9 years ago

b131d40c

12 Feb, 2015 4 commits

Have to reset more state in PyEval_ReInitThreads · bf7b9b47

Kevin Modzelewski authored 9 years ago

We were having problems with spawning subprocesses from threads,
since the children would inherit the "wait for another thread to acquire
the gil" flag, but would not inherit the thread that would actually
try to acquire the gil; this would make the child hang.

bf7b9b47

Assert that our return-type annotations are correct · 078c5bfa
Kevin Modzelewski authored 9 years ago
```
And fix a bunch of places that they weren't.
```
078c5bfa
Use the ICInvalidator to invalidate embedded references to BoxedFunctions when they get freed · ee247d3e
Kevin Modzelewski authored 9 years ago

ee247d3e

Only rewrite IC slots that are not currently being executed · aff2d9a4

Kevin Modzelewski authored 9 years ago

Before, we tried to do it based on whether the rewrite we were
adding was "compatible" with whatever was already in there.  But we
weren't really doing this and there were a lot of limitations with this
method anyway.

aff2d9a4

11 Feb, 2015 1 commit

Scan ht_name for heap types · 038eb9de

Kevin Modzelewski authored 9 years ago

We weren't scanning it as part of the gc handler, so eventually it would
get collected and then __name__ would break.  (un)fortunately, in the
cases that would use it, it looks like it would always get replaced by
another string (the only thing being allocated out of that size bin?),
so it would look like the __name__ was just wrong.

Anyway, this should fix interp2.py

038eb9de

10 Feb, 2015 1 commit

Let object.__new__ take kwargs · 1427169a

Kevin Modzelewski authored 9 years ago

Also, improve rewriting to still be able to rewrite object construction.
For now, be able to rewrite the case that a function takes kwargs but
the kwargs is empty.

Also, add an even faster path to typeCallInternal.  This is partially
obviated by the improved rewriting, but we might as well keep it.

1427169a

07 Feb, 2015 3 commits
- give all CLFunctions a ParamNames (renamed from ArgNames) including builtin ones · 5de99b04
  Travis Hance authored 9 years ago
  
  5de99b04
- A number of random changes from trying to run tester.py · 8d247e34
  Kevin Modzelewski authored 9 years ago
  
  8d247e34
- implement set for getset descriptors, and setting for type __name__ · 3dbee69b
  Travis Hance authored 9 years ago
  
  3dbee69b
06 Feb, 2015 3 commits

Basic 'traceback' object support · e22da6f3

Kevin Modzelewski authored 9 years ago

Previously we were just passing around a vector<> of LineInfos; now, they
get encapsulated in a BoxedTraceback object. This has a couple benefits:
1) they can participate in the existing sys.exc_info passing+handling
2) we can enable a basic form of the traceback module.

2 means that we can finally test our tracebacks support, since I was constantly
fixing one issue only to break it in another place.

1 means that we now generate the right traceback for the current exception!
Before this change, the traceback we would generate was determined using a different
system than the exc_info-based exception raising, so sometimes they would diverge
and be horribly confusing.

There's a pretty big limitation with the current implementation: our tracebacks
don't span the right stack frames. In CPython, a traceback spans the stack frames
between the raise and the catch, but in Pyston the traceback includes all stack frames.
It's not easy to provide this behavior, since the tracebacks are supposed to get updated
as they get rethrown through each stack frame.

We could do some complicated stuff in irgen to make sure this happens. I think the better
but more complicated approach is for us to create the custom exception unwinder we've been
wanting. This would let us add custom traceback-handling support as we unwound the stack.

Another limitation is that tracebacks are supposed to automatically include a reference
to the entire frame stack (tb.tb_frame.f_back.f_back.f_back....). In Pyston, we're not
automatically generating those frame objects, so we would either need to do that and take
a perf hit, or (more likely?) generate the frame objects on-demand when they're needed.

It's not really clear that they're actually needed for traceback objects, so I implemented
a different traceback object API and changed the traceback.py library, under the assumption
that almost-noone actually deals with the traceback object internals.

e22da6f3

builtin functions · 128cb7c7
Travis Hance authored 9 years ago

128cb7c7

Migrate to the CPython file object format · b1e0937b

Kevin Modzelewski authored 9 years ago

Trying to patch up our file support so that we match CPython's behavior and
functionality more closely; this is the first step.

b1e0937b

05 Feb, 2015 3 commits

getset descriptors, __name__ for type and functions · ee548408
Travis Hance authored 9 years ago

ee548408
Allow patching capi calls of the form 'math.sqrt' · 2df05571
Kevin Modzelewski authored 9 years ago

2df05571

Give CApiFunctions their own custom 'internal callable' · 874ea0ee

Kevin Modzelewski authored 9 years ago

The 'internal callable' (bad name, sorry) is what defines how
the arguments get mapped to the parameters, and potentially also does
rewriting.

By providing a custom internal callable, we can make use of special knowledge
about how C API functions work.  In particular, we can skip the allocation
of the args + kwargs objects when we are calling an object with the METH_O
signature.

This patch includes rewriting support, though we don't currently allow
rewriting CAPI functions as part of callattrs.

874ea0ee

04 Feb, 2015 1 commit

Intern most codegen strings · 325dbfeb

Kevin Modzelewski authored 9 years ago

Most importantly, intern all the strings we put into the AST* nodes.
(the AST_Module* owns them)

This should save us some memory, but it also improves performance pretty
substantially since now we can do string comparisons very cheaply.  Performance
of the interpreter tier is up by something like 30%, and JIT-compilation times
are down as well (though not by as much as I was hoping).

The overall effect on perf is more muted since we tier out of the interpreter
pretty quickly; to see more benefit, we'll have to retune the OSR/reopt thresholds.

For better or worse (mostly better IMO), the interned-ness is encoded in the type
system, and things will not automatically convert between an InternedString and
a std::string.  It means that this diff is quite large, but it also makes it a lot
more clear where we are making our string copies or have other room for optimization.

325dbfeb

02 Feb, 2015 2 commits

Remove function versions that fail their speculations · b4094e4e

Kevin Modzelewski authored 9 years ago

The goal is to not continually call functions that deopt every time,
since the deopt is expensive.

Right now the threshold is simple: if a function deopts 4 (configurable)
times, then mark that function version as invalid and force a recompilation
on the next call.

b4094e4e

Basic (new) deopt support · 25ac9de4

Kevin Modzelewski authored 9 years ago

Old deopt worked by compiling two copies of every BB, one with
speculations and one without, and stitching the two together.
This has a number of issues:
- doubles the amount of code LLVM has to jit
- can't ever get back on the optimized path
- doesn't support 'deopt if branch taken'
- horrifically complex
- doesn't support deopt from within try blocks

We actually ran into that last issue (see test from previous commit).  So
rather than wade in and try to fix old-deopt, just start switching to new-deopt.

(new) deopt works by using the frame introspection features, gathering up all
the locals, and passing them to the interpreter.

25ac9de4

29 Jan, 2015 1 commit
- Simple 'subprocess' support · dc9d0adc
  Kevin Modzelewski authored 9 years ago
  
  dc9d0adc
27 Jan, 2015 1 commit
- Mark enumerate_cls as safe for type call rewriting · 9ec28ecd
  Marius Wachtler authored 9 years ago
```
Reduces slowpath_typecall from 40000 to 30 for spectral_norm
```
  9ec28ecd
24 Jan, 2015 1 commit

Implement Python's name mangling · 636aca17

Kevin Modzelewski authored 9 years ago

ie names that begin with two underscores but don't end in two underscores have
the classname added to them.

Do this early on in the pipeline so that all the analyses operate post-mangling.

The implementation is kind of hacky and I couldn't think of a good way to make it
super systematic; there may be more cases I missed.

636aca17

22 Jan, 2015 7 commits
- Fix a regression -- __dict__ should be checked before calling __getattr__ · 3e4896eb
  Kevin Modzelewski authored 9 years ago
  
  3e4896eb
- I think this test is still working fine, but needs to have one of the limits bumped · b2a5d2b3
  Kevin Modzelewski authored 9 years ago
  
  b2a5d2b3
- Add support for dir looking at a class's bases · b68b8da7
  Kevin Modzelewski authored 9 years ago
  
  b68b8da7
- Basic collections support · 6abd5698
  Kevin Modzelewski authored 10 years ago
  
  6abd5698
- Don't set __str__ to __repr__, that breaks subclasses · 7aaaf4bf
  Kevin Modzelewski authored 10 years ago
  
  7aaaf4bf
- Add continue/break out of finally, sys.exc_clear · 28f055ef
  Kevin Modzelewski authored 9 years ago
  
  28f055ef
- Very basic tryfinally support · dc7a6171
  Kevin Modzelewski authored 10 years ago
```
Doesn't support continue/break/return yet
```
  dc7a6171
21 Jan, 2015 1 commit

Change from throwing a Box* to an ExcInfo triple · 44b63a61

Kevin Modzelewski authored 10 years ago

ExcInfo is a triple of exc_type, exc_value, exc_traceback -
analogous to Python's sys.exc_info().  Previously, we were just
throwing exc_value.

I still don't understand all the rules for when type(exc_value)
is not necessarily exc_type.  But this also makes it easier to
pass exc_traceback around, and should make it possible to make
our handling more consistent.

This commit just changes the runtime; the generated code currently
still expects a Box* to be thrown and will crash.

44b63a61

11 Jan, 2015 2 commits

Make separate getiter() and getPystonIter() · 78dfdfed

Kevin Modzelewski authored 10 years ago

Sometimes you want the wrapping behavior or not --
the builtin iter() function was calling getiter, but shouldn't
be doing this wrapping.

78dfdfed

Support StopIteration-based for loop iteration · a14ebbec

Kevin Modzelewski authored 10 years ago

At runtime, we can detect if something supports the new Pyston iteration
protocol (__hasnext__) or not.

Statically, when we lower a for loop, we assume that the iterator supports
the new protocol, and we can't check it

This commit adds a check to getiter() which wraps a Python-style iterator
with an IterWrapper which adds the __hasnext__ method.

a14ebbec

10 Jan, 2015 3 commits

Add the NONZERO bytecode and generate it appropriately · 4b36c835
Kevin Modzelewski authored 10 years ago

4b36c835
Fall back on __len__ if __nonzero__ doesnt exist · 7c751f48
Kevin Modzelewski authored 10 years ago

7c751f48

Support extension classes that specify tp_dictoffset · 5b2747b1

Kevin Modzelewski authored 10 years ago

I think there are a couple ways we can go about this.  It looks like
extensions just specify tp_dictoffset if they want their instances
to have instance attributes, but don't usually actually look at the
dict or really make use of the fact that it's a real dict.  So possibly,
we have the ability to stuff a non-dict into the slot they reserved for
us.

Seems risky though.  Instead, the approach in this commit is to create
an actual dict and put it in like they expect.  All the attribute-accessing
paths have been updated to look at both the fast Pyston HCAttrs method, and
the CAPI dict method.

5b2747b1