Commits · 95fdc01d688ddc993a98ebc396f7e94f1feba148 · nexedi / neoppod

16 Jan, 2017 3 commits

Increase connector buffer size for receiving · 95fdc01d

Julien Muchembled authored Jan 16, 2017

When receiving 1 byte, benchmarking shows no visible difference with values
between 4096 and 65536 for the buffer size. With higher values, it becomes
significantly slower.

On the other side, a 64k buffer is faster with bigger packets.
Time to run testBasicStore with MySQL:

            4096        65536
    real    0m51.115s   0m21.592s
    user    0m41.857s   0m13.540s
    sys     0m8.700s    0m2.687s

95fdc01d

qa: fix test_max_allowed_packet (testStorageMySQL) · b4317e4e
Julien Muchembled authored Jan 16, 2017
```
It started to fail with commit fd007f5d.
```
b4317e4e
qa: remove useless monkey-patch for checkMultipleUndoInOneTransaction · 9fcff73e
Julien Muchembled authored Jan 16, 2017
```
It should have been removed with commit cf32e594
(see also c277ed20 and related commits).
```
9fcff73e

13 Jan, 2017 4 commits
- client: remove unused Storage.cleanup · 522a142d
  Julien Muchembled authored Jan 13, 2017
  
  522a142d
- Move mock.py to neo/tests to not conflict with https://pypi.python.org/pypi/mock · 358da254
  Julien Muchembled authored Jan 13, 2017
  
  358da254
- qa: fix testImporter when run several times with --loop · 3e5299cb
  Julien Muchembled authored Jan 12, 2017
  
  3e5299cb
- qa: fix tests with recent ZODB/transaction · 8d21aaee
  Julien Muchembled authored Jan 13, 2017
  
  8d21aaee
12 Jan, 2017 1 commit

qa: make closure of NEOCluster more reliable in treaded tests · e2183483

Julien Muchembled authored Jan 12, 2017

Instances of NEOCluster were not deleted as soon as the only referrers were
weak proxies (at least that's what a quick check with the 'gc' module showed
at the beginning of tearDown). In some cases, __del__ was called while the next
test was logging a message, which led to deadlocks.

Without those proxies, it may be reliable, but only on CPython. See
http://doc.pypy.org/en/latest/cpython_differences.html#differences-related-to-garbage-collection-strategies

Relying on __del__ to close a cluster was wrong. NEOCluster is now a context
manager that does it explicitly at exit, in addition to automatically stop it.
The NEOCluster.stop method combines the previous stop/__del__/reset methods.

A new 'with_cluster' decorator is also added to avoid excessive indentation
in tests. Unindentation of existing tests will be done later.

e2183483

11 Jan, 2017 3 commits
- qa: add threaded test helper to filter connection by packet type · 1d738521
  Julien Muchembled authored Jan 11, 2017
  
  1d738521
- qa: always start a new threaded test with a empty log file · 7dc8d4db
  Julien Muchembled authored Jan 11, 2017
```
This is important when using --loop, otherwise tearDown is slower and slower
at removing packets from the log.
```
  7dc8d4db
- qa: more reliable use of extra clients in threaded tests · aac495e4
  Julien Muchembled authored Jan 11, 2017
```
testExternalInvalidation is splitted to minimize reindentation.
```
  aac495e4
09 Jan, 2017 1 commit
- master: fix AttributeError at exit during recovery · 82027ac9
  Julien Muchembled authored Jan 09, 2017
  
  82027ac9
06 Jan, 2017 3 commits
- qa: fix a race condition when using connection filters in threaded tests · b66c3fd3
  Julien Muchembled authored Jan 06, 2017
  
  b66c3fd3
- qa: in threaded tests, detect another case where a bug could call tic() endlessly · 600cc94a
  Julien Muchembled authored Jan 06, 2017
  
  600cc94a
- qa: new --loop runner option · 744dbf5f
  Julien Muchembled authored Jan 06, 2017
  
  744dbf5f
04 Jan, 2017 2 commits

qa: do more checks on the partition table in testReplicationBlockedByUnfinished · 00ea89c3
Julien Muchembled authored Jan 04, 2017

00ea89c3

qa: rewrite testReplicationBlockedByUnfinished as a threaded test · d3cb8888

Julien Muchembled authored Jan 04, 2017

It is extended to check that the storage is only notified about the
transactions that existed at the time it asked for them. Otherwise,
Replicator.transactionFinished would be called more than once, and
`self.ttid_set.remove(ttid)` would raise KeyError.

The functional version also contained an annoying 'sleep(10)'.

d3cb8888

03 Jan, 2017 1 commit
- qa: rewrite test checking read-locks · 5b66a6a7
  Julien Muchembled authored Jan 03, 2017
  
  5b66a6a7
30 Dec, 2016 1 commit
- qa: fix unclean test shutdown after use of ConnectionFilter · 327bb1c0
  Julien Muchembled authored Dec 28, 2016
```
Leaks in filter_queue caused deadlocks in the following threaded tests that
filter connections.
```
  327bb1c0
28 Dec, 2016 6 commits
- qa: code cleanup in non-threaded -u tests · 59004b8c
  Julien Muchembled authored Dec 28, 2016
  
  59004b8c
- Drop ability to build packets with keyword arguments · bcf4afa0
  Julien Muchembled authored Dec 28, 2016
```
The removed tests only covered this.
```
  bcf4afa0
- qa: rewrite test_answerStoreObject_3 as a threaded test · 0e06d906
  Julien Muchembled authored Dec 27, 2016
```
The removed test_answerStoreObject_{1,2} only covered the
'raise NEOStorageError', which is already an assertion.
```
  0e06d906
- qa: rewrite tools/coverage-html in Python to make -d option usable · 4a52bd98
  Julien Muchembled authored Dec 28, 2016
  
  4a52bd98
- qa: new --cov-unit runner option · 7a75daa9
  Julien Muchembled authored Dec 28, 2016
  
  7a75daa9
- qa: extend runner arguments to filter tests · 0ae3482b
  Julien Muchembled authored Dec 27, 2016
  
  0ae3482b
27 Dec, 2016 1 commit
- Replace --prune-orphan storage option with a command that can be used in RUNNING state · f2796d9c
  Julien Muchembled authored Dec 27, 2016
  
  f2796d9c
26 Dec, 2016 5 commits
- storage: code refactoring for backends to connect to the database · fd007f5d
  Julien Muchembled authored Dec 26, 2016
  
  fd007f5d
- storage: remove short option -R to reset the db, clarify that --reset exits once done · ccbf7bce
  Julien Muchembled authored Dec 26, 2016
  
  ccbf7bce
- storage: new --prune-orphan option · dd1d6b30
  Julien Muchembled authored Dec 26, 2016
  
  dd1d6b30
- storage: fix abort before vote, to free the storage space used by the transaction · 58d0b602
  Julien Muchembled authored Dec 23, 2016
  
  58d0b602
- importer: at startup, do not connect twice to the destination database · d87df377
  Julien Muchembled authored Dec 26, 2016
  
  d87df377
23 Dec, 2016 1 commit
- storage: simplify Transaction API · a8f9fedb
  Julien Muchembled authored Dec 23, 2016
  
  a8f9fedb
22 Dec, 2016 1 commit
- Fix replication of unfinished imported transactions · 87eee431
  Julien Muchembled authored Dec 22, 2016
  
  87eee431
21 Dec, 2016 3 commits

storage: start replicating the partition which is furthest behind · 4d3f3723

Julien Muchembled authored Dec 21, 2016

This fixes the following case when the backup is far behing the upstream DB,
and there are transactions being committed at the same time:

1. replicate partition 0
2. replicate partition 0
3. replicate partition 1
4. replicate partition 0
5. replicate partition 1
6. replicate partition 2
7. replicate partition 0
...
and so on in a quadratic way.

When the upstream activity was too high, the backup could even be stuck looping
on the first partitions.

4d3f3723

master: fix possibly wrong knowledge of cells' backup_tid when resuming backup · 17af3b47

Julien Muchembled authored Dec 20, 2016

The issue happens when there were commits while the backup cluster was down.
In this case, the master thinks that these commits are already replicated,
reporting wrong backup_tid to neoctl. It solved by itself once:
- there are new commits triggering replication for all partitions;
- all storage nodes have really replicated.

This also resulted in an inconsistent database when leaving backup mode during
this period.

17af3b47

Minor comment/doc changes · c95c6c39
Julien Muchembled authored Dec 20, 2016

c95c6c39

20 Dec, 2016 1 commit
- Release version 1.7.0 · 37f58489
  Julien Muchembled authored Dec 19, 2016
  
  37f58489
06 Dec, 2016 2 commits

master,client: ignore notifications before complete initialization · 36b2d141

Julien Muchembled authored Dec 06, 2016

A backup master crashed with the following traceback after a reconnection:

    Traceback (most recent call last):
      File "neo/master/app.py", line 127, in run
        self._run()
      File "neo/master/app.py", line 147, in _run
        self.playPrimaryRole()
      File "neo/master/app.py", line 348, in playPrimaryRole
        self.backup_app.provideService())
      File "neo/master/backup_app.py", line 123, in provideService
        poll(1)
      File "neo/lib/event.py", line 126, in poll
        to_process.process()
      File "neo/lib/connection.py", line 500, in process
        self._handlers.handle(self, self._queue.pop(0))
      File "neo/lib/connection.py", line 110, in handle
        self._handle(connection, packet)
      File "neo/lib/connection.py", line 125, in _handle
        handler.packetReceived(connection, packet)
      File "neo/lib/handler.py", line 117, in packetReceived
        self.dispatch(*args)
      File "neo/lib/handler.py", line 66, in dispatch
        method(conn, *args, **kw)
      File "neo/master/handlers/backup.py", line 52, in invalidateObjects
        app.invalidatePartitions(tid, partition_set)
      File "neo/master/backup_app.py", line 257, in invalidatePartitions
        self.triggerBackup(node)
      File "neo/master/backup_app.py", line 281, in triggerBackup
        assert cell_list, offset
    AssertionError: 0

36b2d141

Update comment that was still showing UUIDs instead of node ids · 02292584
Julien Muchembled authored Dec 06, 2016

02292584

01 Dec, 2016 1 commit
- Remove dead code found by coverage · 23b9544d
  Julien Muchembled authored Dec 01, 2016
  
  23b9544d