Commits · 28df51087294e95f041799cccafcec809bb15603 · Boxiang Sun / Pyston

04 Jun, 2015 1 commit
- Move the static computation out of the hot areas; thought this would help more · 28df5108
  Kevin Modzelewski authored Jun 03, 2015
  
  28df5108
03 Jun, 2015 16 commits

Move a bunch of stattimer stuff into header files · 4bde2afd

Kevin Modzelewski authored Jun 03, 2015

This roughly halves stattimer overhead, from +80% to +40%
(on a very small benchmark; more complex benchmarks probably
have lower overhead)

4bde2afd

more benchmarks · 4411108b
Kevin Modzelewski authored Jun 03, 2015

4411108b

Remove pyston_gcc from `make check` · 6a6d049c

Kevin Modzelewski authored Jun 03, 2015

I don't think we regularly run make check anymore, and it's a pain
to make sure the user has an up-to-date gcc (would need to come up
with a good set of instructions for anyone on 12.04).  So just remove
it from make check for now.

6a6d049c

Merge pull request #577 from kmod/tp_richcompare · b8cd6e68
Kevin Modzelewski authored Jun 03, 2015
```
tp_richcompare
```
b8cd6e68

Bump integration test timeout · cd6654bb

Kevin Modzelewski authored Jun 03, 2015

virtualenv_test currently takes ~7m for me.  we should probably move
most of the stuff it does to the "extra" tests, but for now at least
make `make check` not fail.

cd6654bb

Minor issue in libunwind binary search · f97bfec2
Kevin Modzelewski authored Jun 03, 2015

f97bfec2

Convert several classes to tp_richcompare · 9cb2b9f1

Kevin Modzelewski authored Jun 02, 2015

int, long, str, tuple, type

int and long are implemented using tp_compare in CPython,
which is the old-style comparison method.  I don't really understand
its semantics which rely on type coercion, and we don't have the
methods it needs, so just implement it as tp_richcompare for now.
I think this is still an overall compatibility improvement.

str_richcompare is very odd where we have to do some weird things
to convince the compiler to produce the best code it can.

9cb2b9f1

Switch to calling tp_richcompare · 0554a034

Kevin Modzelewski authored Jun 02, 2015

The notable places that are changed are PyEq, PyLt, and
compare/compareInternal.

The old codepaths are still in there (though thankfully
now with reduced duplication with cpython), since for anything
that defines a Python-level __lt__, it's better to call that
with our rewriting support, rather than calling it through
slot_tp_richcompare.

The control flow is kind of messy, since I don't think we
know what the right long-term organization is for these kinds of
things.  But basically, it's:
- if we can't rewrite, just call the C slot
- if we can rewrite, and we think calling tp_richcompare is profitable
  (ie it's not slot_tp_richcompare), we call that and emit a call to
  it in the patchpoint
- otherwise, we try calling the python attribute

Actual conversion of our attributes to tp_richcompare in the next commit.

0554a034

Merge pull request #579 from kmod/string_allocs · 24140771
Kevin Modzelewski authored Jun 03, 2015
```
Reduce string allocations
```
24140771
Minor compareInternal optimization · abd87d35
Kevin Modzelewski authored Jun 03, 2015

abd87d35

Get rid of boxString overloads · c44e4dab

Kevin Modzelewski authored Jun 03, 2015

and have the compiler pick the best way to convert to StringRef

I was running into some cases where we had StringRefs but would call
boxString which takes an std::string.  So hopefully this change
makes things cleaner and (slightly) faster.

c44e4dab

Switch some more runtime functions to taking StringRefs · a2fe70c2

Kevin Modzelewski authored Jun 03, 2015

The motivating one was classLookup(), since this happened
extremely frequently (once for every old-style instance
lookup), but I decided to go through and get some others.

a2fe70c2

I guess sre_parse.parse is slow · c510bb9e
Kevin Modzelewski authored Jun 03, 2015

c510bb9e
Merge pull request #578 from toshok/getattr-default-exc · 412730c4
Kevin Modzelewski authored Jun 03, 2015
```
for getattrFunc use getattrInternal instead of getattr
```
412730c4
Merge pull request #549 from tjhance/float_pow · 2be06383
Kevin Modzelewski authored Jun 03, 2015
```
Float pow
```
2be06383

for getattrFunc use getattrInternal instead of getattr · 1603542c

Chris Toshok authored Apr 21, 2015

getattr throws an exception if the attribute is not present.  getattrFunc already throws the same exception (if there isn't a default value passed in).

1603542c

02 Jun, 2015 5 commits

Merge pull request #574 from kmod/linkorder · 1eb99619
Kevin Modzelewski authored Jun 02, 2015
```
Add a section-ordering script
```
1eb99619
Add some more 'hot functions' · d8e5b7f6
Kevin Modzelewski authored Jun 02, 2015

d8e5b7f6

Add a section-ordering script · df5d6491

Kevin Modzelewski authored Jun 02, 2015

I think this lets us specify that certain functions should be
put together at the end of the text segment.

This is inspired by a similar feature of HHVM's build, though the goal
for us for now is just to improve performance consistency rather
than overall performance.  Hopefully soon/eventually we can do
profile-guided sorting like they do.

df5d6491

Merge pull request #575 from kmod/libunwind_build · c4c58d0d
Kevin Modzelewski authored Jun 01, 2015
```
Fix some issues with the way we build libunwind
```
c4c58d0d

Fix some issues with the way we build libunwind · c4063b2e

Kevin Modzelewski authored Jun 02, 2015

First, that we would only apply our patchset once. If we ever revert the
patches (I'm not sure under what conditions that happens), we previously would
never apply them again. Attempted to fix this by adding a special patch that
adds a new file that CMake looks for; if the file doesn't exist, cmake runs the
patches again.

Second, that we didn't rebuild libunwind if we apply new patches.
I'm not sure if there's a good general solution to this, but I was able
to figure out how to force libunwind to rebuild if we need to rerun the
patch command. It took some hacking since CMake doesn't track dependencies
on external projects, so we have to add some custom dependencies.

c4063b2e

01 Jun, 2015 10 commits
- Merge pull request #572 from toshok/cmake-redundant-flag · 93e83aad
  Kevin Modzelewski authored Jun 01, 2015
```
remove redundant -Wno-sign-cmopare
```
  93e83aad
- Merge pull request #573 from kmod/tp_hash · 6b0bb5ca
  Kevin Modzelewski authored Jun 01, 2015
```
Switch hash to using tp_hash
```
  6b0bb5ca
- remove redundant -Wno-sign-cmopare · c8197c8b
  Chris Toshok authored Jun 01, 2015
  
  c8197c8b
- Switch hash to using tp_hash · 9522374b
  Kevin Modzelewski authored May 29, 2015
```
We weren't even doing any rewriting for hash, so there's not much
downside.  This also cuts down on boxing quite a bit since
we can usually avoid boxing the hash value.
```
  9522374b
- Merge pull request #571 from kmod/perf_investigations · 3d7cadd3
  Kevin Modzelewski authored Jun 01, 2015
```
Perf investigations
```
  3d7cadd3
- Some microbenchmarks I'm looking at · 5e3f1985
  Kevin Modzelewski authored May 29, 2015
```
Coming from looking into regex performance; re_compile is reduced
from django-template startup, and dict_hashing_ubench is reduced
from that.
```
  5e3f1985
- Change log colors for different JIT levels · 44a1a9b5
  Kevin Modzelewski authored May 29, 2015
```
Also, quiet some debug output
```
  44a1a9b5
- add argument parsing to process_stackmap.py · 1ee6db5b
  Kevin Modzelewski authored May 29, 2015
  
  1ee6db5b
- Merge pull request #517 from toshok/fewer-string-allocations · bf75c321
  Kevin Modzelewski authored Jun 01, 2015
```
use llvm::StringRef instead of std::string typeNew
```
  bf75c321
- use llvm::StringRef instead of char*'s. · d5389981
  Chris Toshok authored May 11, 2015
```
turns out we allocate/free the same std::strings for every slot name, for every typeNew.
instead do it once, when we create the slotdefs array.  Also, use llvm::StringRefs instead of
std::strings since we already have them in setattrGeneric (the other caller of update_slot.)
```
  d5389981
29 May, 2015 8 commits
- Merge pull request #567 from undingen/bom · c25abcd9
  Kevin Modzelewski authored May 29, 2015
```
Add UTF8-BOM support, int.bit_length, function.func_doc, fix '(-1)**0'
```
  c25abcd9
- Add int.bit_length, function.func_doc, fix '(-1)**0' · 4b40c012
  Marius Wachtler authored May 29, 2015
  
  4b40c012
- Support UTF-8 BOMs inside source files · a58e93a3
  Marius Wachtler authored May 29, 2015
  
  a58e93a3
- Merge pull request #566 from kmod/sampling_prof2 · 153fd1eb
  Kevin Modzelewski authored May 29, 2015
```
Python-level sampling profiler
```
  153fd1eb
- Minor updates to the process script · cee63186
  Kevin Modzelewski authored May 29, 2015
  
  cee63186
- Add more safepoints if the sampling profiler is on · ea0a21e1
  Kevin Modzelewski authored May 29, 2015
  
  ea0a21e1
- Python-level sampling profiler · 7596e61a
  Kevin Modzelewski authored May 29, 2015
```
Uses setitimer() to set a recurring signal, and prints
a Python stacktrace at the next safepoint aka allowGLReadPreemption.
This is not great since allowGLReadPreemption can happen a decent
amount later than the signal.  (I'll play around with trying to get
the signal to be acted on sooner, but it might be better to wait
for full signal-handling support.)

Still, it seems to provide some decent high-level info.  For example,
half of the startup time of the django-template benchmark seems to be
due to regular expressions.
```
  7596e61a
- Merge pull request #565 from undingen/self_assign · d618cd63
  Kevin Modzelewski authored May 28, 2015
```
Handle list self assignment during slicing
```
  d618cd63