Commits · 412730c4cfd57f7da6387d0af7faf5e6cd2970c0 · Boxiang Sun / Pyston

03 Jun, 2015 3 commits
- Merge pull request #578 from toshok/getattr-default-exc · 412730c4
  Kevin Modzelewski authored Jun 03, 2015
```
for getattrFunc use getattrInternal instead of getattr
```
  412730c4
- Merge pull request #549 from tjhance/float_pow · 2be06383
  Kevin Modzelewski authored Jun 03, 2015
```
Float pow
```
  2be06383
- for getattrFunc use getattrInternal instead of getattr · 1603542c
  Chris Toshok authored Apr 21, 2015
```
getattr throws an exception if the attribute is not present.  getattrFunc already throws the same exception (if there isn't a default value passed in).
```
  1603542c
02 Jun, 2015 5 commits

Merge pull request #574 from kmod/linkorder · 1eb99619
Kevin Modzelewski authored Jun 02, 2015
```
Add a section-ordering script
```
1eb99619
Add some more 'hot functions' · d8e5b7f6
Kevin Modzelewski authored Jun 02, 2015

d8e5b7f6

Add a section-ordering script · df5d6491

Kevin Modzelewski authored Jun 02, 2015

I think this lets us specify that certain functions should be
put together at the end of the text segment.

This is inspired by a similar feature of HHVM's build, though the goal
for us for now is just to improve performance consistency rather
than overall performance.  Hopefully soon/eventually we can do
profile-guided sorting like they do.

df5d6491

Merge pull request #575 from kmod/libunwind_build · c4c58d0d
Kevin Modzelewski authored Jun 01, 2015
```
Fix some issues with the way we build libunwind
```
c4c58d0d

Fix some issues with the way we build libunwind · c4063b2e

Kevin Modzelewski authored Jun 02, 2015

First, that we would only apply our patchset once. If we ever revert the
patches (I'm not sure under what conditions that happens), we previously would
never apply them again. Attempted to fix this by adding a special patch that
adds a new file that CMake looks for; if the file doesn't exist, cmake runs the
patches again.

Second, that we didn't rebuild libunwind if we apply new patches.
I'm not sure if there's a good general solution to this, but I was able
to figure out how to force libunwind to rebuild if we need to rerun the
patch command. It took some hacking since CMake doesn't track dependencies
on external projects, so we have to add some custom dependencies.

c4063b2e

01 Jun, 2015 10 commits
- Merge pull request #572 from toshok/cmake-redundant-flag · 93e83aad
  Kevin Modzelewski authored Jun 01, 2015
```
remove redundant -Wno-sign-cmopare
```
  93e83aad
- Merge pull request #573 from kmod/tp_hash · 6b0bb5ca
  Kevin Modzelewski authored Jun 01, 2015
```
Switch hash to using tp_hash
```
  6b0bb5ca
- remove redundant -Wno-sign-cmopare · c8197c8b
  Chris Toshok authored Jun 01, 2015
  
  c8197c8b
- Switch hash to using tp_hash · 9522374b
  Kevin Modzelewski authored May 29, 2015
```
We weren't even doing any rewriting for hash, so there's not much
downside.  This also cuts down on boxing quite a bit since
we can usually avoid boxing the hash value.
```
  9522374b
- Merge pull request #571 from kmod/perf_investigations · 3d7cadd3
  Kevin Modzelewski authored Jun 01, 2015
```
Perf investigations
```
  3d7cadd3
- Some microbenchmarks I'm looking at · 5e3f1985
  Kevin Modzelewski authored May 29, 2015
```
Coming from looking into regex performance; re_compile is reduced
from django-template startup, and dict_hashing_ubench is reduced
from that.
```
  5e3f1985
- Change log colors for different JIT levels · 44a1a9b5
  Kevin Modzelewski authored May 29, 2015
```
Also, quiet some debug output
```
  44a1a9b5
- add argument parsing to process_stackmap.py · 1ee6db5b
  Kevin Modzelewski authored May 29, 2015
  
  1ee6db5b
- Merge pull request #517 from toshok/fewer-string-allocations · bf75c321
  Kevin Modzelewski authored Jun 01, 2015
```
use llvm::StringRef instead of std::string typeNew
```
  bf75c321
- use llvm::StringRef instead of char*'s. · d5389981
  Chris Toshok authored May 11, 2015
```
turns out we allocate/free the same std::strings for every slot name, for every typeNew.
instead do it once, when we create the slotdefs array.  Also, use llvm::StringRefs instead of
std::strings since we already have them in setattrGeneric (the other caller of update_slot.)
```
  d5389981
29 May, 2015 11 commits
- Merge pull request #567 from undingen/bom · c25abcd9
  Kevin Modzelewski authored May 29, 2015
```
Add UTF8-BOM support, int.bit_length, function.func_doc, fix '(-1)**0'
```
  c25abcd9
- Add int.bit_length, function.func_doc, fix '(-1)**0' · 4b40c012
  Marius Wachtler authored May 29, 2015
  
  4b40c012
- Support UTF-8 BOMs inside source files · a58e93a3
  Marius Wachtler authored May 29, 2015
  
  a58e93a3
- Merge pull request #566 from kmod/sampling_prof2 · 153fd1eb
  Kevin Modzelewski authored May 29, 2015
```
Python-level sampling profiler
```
  153fd1eb
- Minor updates to the process script · cee63186
  Kevin Modzelewski authored May 29, 2015
  
  cee63186
- Add more safepoints if the sampling profiler is on · ea0a21e1
  Kevin Modzelewski authored May 29, 2015
  
  ea0a21e1
- Python-level sampling profiler · 7596e61a
  Kevin Modzelewski authored May 29, 2015
```
Uses setitimer() to set a recurring signal, and prints
a Python stacktrace at the next safepoint aka allowGLReadPreemption.
This is not great since allowGLReadPreemption can happen a decent
amount later than the signal.  (I'll play around with trying to get
the signal to be acted on sooner, but it might be better to wait
for full signal-handling support.)

Still, it seems to provide some decent high-level info.  For example,
half of the startup time of the django-template benchmark seems to be
due to regular expressions.
```
  7596e61a
- Merge pull request #565 from undingen/self_assign · d618cd63
  Kevin Modzelewski authored May 28, 2015
```
Handle list self assignment during slicing
```
  d618cd63
- Merge pull request #563 from kmod/tests · 42c7b2a3
  Kevin Modzelewski authored May 28, 2015
```
Add some parsing tests
```
  42c7b2a3
- Handle list self assignment during slicing · b19bda43
  Marius Wachtler authored May 29, 2015
  
  b19bda43
- Add some parsing tests · 722cd847
  Kevin Modzelewski authored May 28, 2015
```
- byte order marker
- newline between decorator and its function
- strings with size >64k [including a fix]

also, a decimal.Decimal test
```
  722cd847
28 May, 2015 11 commits

Merge pull request #560 from undingen/lxml2 · 7e1ff6cb
Kevin Modzelewski authored May 28, 2015
```
Add PyErr_GetExcInfo, make __builtins__ more similar to cpython and id() output more useful
```
7e1ff6cb
Merge pull request #562 from kmod/stringref · aada26f8
Kevin Modzelewski authored May 28, 2015
```
Convert runtime functions to take llvm::StringRef
```
aada26f8
Add Py_GetVersion and a import test for decorator and oauth2client · 711022ba
Marius Wachtler authored May 28, 2015

711022ba
Set __doc__ inside exec statements · 06d0dca9
Marius Wachtler authored May 28, 2015

06d0dca9
Fix the old make build: add threadmodule.c · 51eb2a56
Marius Wachtler authored May 28, 2015

51eb2a56
Add PyErr_GetExcInfo, make __builtins__ more similar to cpython · ee25dc67
Marius Wachtler authored May 27, 2015

ee25dc67

Comment out some dead code we don't support · 7a45efac

Kevin Modzelewski authored May 28, 2015

This only happens to me on the gcc build, and it apparently doesn't
happen on travis-ci.  Not exactly sure why that would cause the code
to get linked or not, but anyway it's stuff we're not using right now
so just ifdef it out.

7a45efac

Try to make weakref4 more robust · f456378f
Kevin Modzelewski authored May 28, 2015

f456378f
Simple tool to help with multiple perf runs · 087316e2
Kevin Modzelewski authored May 28, 2015

087316e2

Convert many runtime functions to take llvm::StringRef · ef27d6cb

Kevin Modzelewski authored May 28, 2015

Should hopefully cut down on allocations to pass around
'const std::string&' objects (since we don't always store things
as std::strings anymore), or to calls to strlen if we pass around
const char*s.

Haven't looked yet at the calls that we embed in the llvm IR.

Here are the perf results:
pyston django_migrate.py : 2.3s baseline: 2.3 (-1.7%)
pyston django-template.py : 15.1s baseline: 15.4 (-1.6%)
pyston interp2.py : 5.3s baseline: 6.3 (-15.1%)
pyston raytrace.py : 6.1s baseline: 6.2 (-0.7%)
pyston nbody.py : 8.4s baseline: 8.1 (+4.1%)
pyston fannkuch.py : 7.5s baseline: 7.5 (+0.2%)
pyston chaos.py : 20.2s baseline: 20.0 (+0.7%)
pyston fasta.py : 5.4s baseline: 5.4 (+0.3%)
pyston pidigits.py : 5.7s baseline: 5.7 (+0.0%)
pyston richards.py : 2.5s baseline: 2.7 (-6.2%)
pyston deltablue.py : 1.8s baseline: 1.8 (-0.0%)
pyston (geomean-3424) : 5.7s baseline: 5.8 (-2.0%)

I looked into the regression in nbody.py, and it is in an unrelated piece of
code (list unpacking) that has the same assembly and gets called the same number
of times. Maybe there's some weird cache collision. It's an extremely small
benchmark (a single 13-line loop) so I'm happy to write it off as microbenchmark
sensitivity. We can also optimize this if we want to; we could speculate on the
type that we are unpacking and inline the parts of the unpacking code we need.

ef27d6cb

Merge pull request #559 from kmod/threading_local4 · 925c13d5
Kevin Modzelewski authored May 27, 2015
```
switch to CPython's thread._local implementation
```
925c13d5