Commits · c1b5233ee2156fb3bdc417c6fc9a3020053e9b77 · nexedi / neoppod

26 Apr, 2019 7 commits

Make the number of replicas modifiable when the cluster is running · c1b5233e

Julien Muchembled authored Mar 27, 2019

neoctl gets a new command to change the number of replicas.

The number of replicas becomes a new partition table attribute and
like the PT id, it is stored in the config table. On the other side,
the configuration value for the number of partitions is dropped,
since it can be computed from the partition table, which is
always stored in full.

The -p/-r master options now only apply at database creation.

Some implementation notes:

- The protocol is slightly optimized in that the master now sends
  automatically the whole partition tables to the admin & client
  nodes upon connection, like for storage nodes.
  This makes the protocol more consistent, and the master is the
  only remaining node requesting partition tables, during recovery.

- Some parts become tricky because app.pt can be None in more cases.
  For example, the extra condition in NodeManager.update
  (before app.pt.dropNode) was added for this is the reason.
  Or the 'loadPartitionTable' method (storage) that is not inlined
  because of unit tests.
  Overall, this commit simplifies more than it complicates.

- In the master handlers, we stop hijacking the 'connectionCompleted'
  method for tasks to be performed (often send the full partition
  table) on handler switches.

- The admin's 'bootstrapped' flag could have been removed earlier:
  race conditions can't happen since the AskNodeInformation packet
  was removed (commit d048a52d).

c1b5233e

New --new-nid storage option for fast cloning · c9871056

Julien Muchembled authored Mar 21, 2019

It is often faster to set up replicas by stopping a node (and any
underlying database server like MariaDB) and do a raw copy of the
database (e.g. with rsync). So far, it required to stop the whole
cluster and use tools like 'mysql' or sqlite3' to edit:
- the 'pt' table in databases,
- the 'config.nid' values of the new nodes.

With this new option, if you already have 1 replica, you can set up
new replicas with such fast raw copy, and without interruption of
service. Obviously, this implies less redundancy during the operation.

c9871056

qa: fix 2 tests with ZODB5 · 69ea1a7e
Julien Muchembled authored Apr 26, 2019

69ea1a7e

qa: new tools/stress options to evaluate MySQL engines · 491f4c89

Julien Muchembled authored Apr 23, 2019

--kill-mysqld should be combined with something like -f .3 -r .1
to give storage nodes enough time to recover.
And also -D 0 to focus testing on the storage backend rather than NEO.

491f4c89

qa: provide a way to let tests start 1 mysqld per storage node · c11410ef
Julien Muchembled authored Apr 23, 2019

c11410ef
mysql: make 'user' actually optional in the DB connection string · 74ec44e3
Julien Muchembled authored Apr 23, 2019

74ec44e3
mysql: specify column families for RocksDB · 87c1de3b
Julien Muchembled authored Apr 17, 2019

87c1de3b

16 Apr, 2019 5 commits
- qa: add testIncremental (testImporter) test · aa7b654f
  Julien Muchembled authored Apr 09, 2019
  
  aa7b654f
- importer: fix hidden "maximum recursion depth exceeded" at startup · d5834ee9
  Julien Muchembled authored Apr 09, 2019
  
  d5834ee9
- importer: fix closure of ZODB, and also do it when the import is finished · c37bcfa3
  Julien Muchembled authored Apr 09, 2019
  
  c37bcfa3
- sqlite: fix resumption of migration to NEO with Importer · 6608a868
  Julien Muchembled authored Apr 09, 2019
  
  6608a868
- qa: fix a random failure in threaded tests · 989e9920
  Julien Muchembled authored Apr 06, 2019
```
This also reverts commit 442bb43a.
```
  989e9920
05 Apr, 2019 3 commits
- importer: speed up startup when the import is already finished · 26b1246a
  Julien Muchembled authored Apr 05, 2019
  
  26b1246a
- importer: fix replication (as source) once import is finished · 9d14ea1b
  Julien Muchembled authored Apr 05, 2019
```
This fixes up commit be839e92.
```
  9d14ea1b
- storage: fix DatabaseManager.getLastTID with max_tid · c58d4862
  Julien Muchembled authored Apr 05, 2019
  
  c58d4862
01 Apr, 2019 1 commit
- qa: remove 2 useless unit tests · b10cc750
  Julien Muchembled authored Mar 29, 2019
  
  b10cc750
21 Mar, 2019 2 commits

storage: allow the master to change our node id · 15369269
Julien Muchembled authored Mar 21, 2019
```
This is not used currently.
```
15369269

Rename --uuid command-line options into --nid · e8473a23

Julien Muchembled authored Mar 21, 2019

This breaks compatibily but it was mentionned from the beginning
that these options are only there for testing purpose.

TODO: rename all remaining occurrences of UUID into NID in the code

e8473a23

16 Mar, 2019 1 commit

importer: fix possible data loss on writeback · e387ad59

Julien Muchembled authored Mar 12, 2019

If the source DB is lost during the import and then restored from a backup,
all new transactions have to written back again on resume. It is the most
common case for which the writeback hits the maximum number of transactions
per partition to process at each iteration; the previous code was buggy in
that it could skip transactions.

e387ad59

11 Mar, 2019 3 commits
- Release version 1.11 · 48d936cb
  Julien Muchembled authored Mar 11, 2019
  
  48d936cb
- Fix short descriptions of neoctl & neomigrate in their headers · af2e209b
  Julien Muchembled authored Mar 11, 2019
  
  af2e209b
- Update copyright year · 342168cd
  Julien Muchembled authored Mar 11, 2019
  
  342168cd
26 Feb, 2019 2 commits

qa: new tool to stress-test NEO · 38e98a12

Julien Muchembled authored Oct 18, 2018

Example output:

    stress: yes (toggle with F1)
    cluster state: RUNNING
    last oid: 0x44c0
    last tid: 0x3cdee272ef19355 (2019-02-26 15:35:11.002419)
    clients: 2308, 2311, 2302, 2173, 2226, 2215, 2306, 2255, 2314, 2356 (+48)
            8m53.988s (42.633861/s)
    pt id: 4107
        RRRDDRRR
     0: OU......
     1: ..UO....
     2: ....OU..
     3: ......UU
     4: OU......
     5: ..UO....
     6: ....OU..
     7: ......UU
     8: OU......
     9: ..UO....
    10: ....OU..
    11: ......UU
    12: OU......
    13: ..UO....
    14: ....OU..
    15: ......UU
    16: OU......
    17: ..UO....
    18: ....OU..
    19: ......UU
    20: OU......
    21: ..UO....
    22: ....OU..
    23: ......UU

38e98a12

master: fix typo in comment · ce25e429
Julien Muchembled authored Oct 18, 2018

ce25e429

25 Feb, 2019 1 commit
- Fix error handling when setting up a listening connector · ce608653
  Julien Muchembled authored Feb 25, 2019
```
getAddress (via __repr__) raised EBADF on closed connectors.
```
  ce608653
31 Dec, 2018 7 commits
- Fix incomplete/incorrect mapping of node ids in logs · 1a070186
  Julien Muchembled authored Oct 18, 2018
```
In functional tests (or anything reusing this framework),
the mapping could be incorrect at the beginning of logs.
```
  1a070186
- Fix log corruption on rotation in multi-threaded applications (e.g. client) · 16fdb24d
  Julien Muchembled authored Dec 31, 2018
```
Corrupted logs cause neolog to fail with the following error:

  AttributeError: 'Log' object has no attribute 'uuid_str'
```
  16fdb24d
- sqlite: optimize storage of metadata · 243c1a0f
  Julien Muchembled authored Dec 31, 2018
```
This makes commit 3c7a3160
(storage: speed up reads by indexing 'obj' primarily by 'oid')
effective for SQLite.

The fake changes in test data are because we don't force upgrade
for this optimization.
```
  243c1a0f
- neolog: do not die when a table is corrupted · 49e7d17f
  Julien Muchembled authored Dec 20, 2018
  
  49e7d17f
- neolog: add support for zstd-compressed logs · ad379295
  Julien Muchembled authored Dec 23, 2018
  
  ad379295
- neolog: do not hardcode default value of -L option in help message · 4a96c8b6
  Julien Muchembled authored Dec 07, 2018
  
  4a96c8b6
- fixup! New log format to show node id (and optionally cluster name) in node column · af53946c
  Julien Muchembled authored Dec 23, 2018
```
Commit aa4d621d broke log rotation
and neolog sometimes failed to read in new format.
```
  af53946c
05 Dec, 2018 1 commit
- New log format to show node id (and optionally cluster name) in node column · aa4d621d
  Julien Muchembled authored Nov 25, 2018
```
neolog has new options: -N for old behaviour, and -C to show the cluster name.
```
  aa4d621d
21 Nov, 2018 4 commits

fixup! client: discard late answers to lockless writes · 8ef1ddba
Julien Muchembled authored Nov 09, 2018
```
Since commit 50e7fe52,
some code can be simplified.
```
8ef1ddba

client: fix race condition between Storage.load() and invalidations · a2e278d5

Julien Muchembled authored Nov 19, 2018

This fixes a bug that could manifest as follows:

Traceback (most recent call last):
File "neo/client/app.py", line 432, in load
self._cache.store(oid, data, tid, next_tid)
File "neo/client/cache.py", line 223, in store
assert item.tid == tid, (item, tid)
AssertionError: (<CacheItem oid='\x00\x00\x00\x00\x00\x00\x00\x01' tid='\x03\xcb\xc6\xca\xfd\xc7\xda\xee' next_tid='\x03\xcb\xc6\xca\xfd\xd8\t\x88' data='...' counter=1 level=1 expire=10000 prev=<...> next=<...>>, '\x03\xcb\xc6\xca\xfd\xd8\t\x88')

The big changes in the threaded test framework are required because we need to
reproduce a race condition between client threads and this conflicts with the
serialization of epoll events (deadlock).

a2e278d5

client: fix race condition in refcounting dispatched answer packets · 743026d5

Julien Muchembled authored Nov 16, 2018

This was found when stress-testing a big cluster. 1 client node was stuck:

  (Pdb) pp app.dispatcher.__dict__
  {'lock_acquire': <built-in method acquire of thread.lock object at 0x7f788c6e4250>,
  'lock_release': <built-in method release of thread.lock object at 0x7f788c6e4250>,
  'message_table': {140155667614608: {},
                    140155668875280: {},
                    140155671145872: {},
                    140155672381008: {},
                    140155672381136: {},
                    140155672381456: {},
                    140155673002448: {},
                    140155673449680: {},
                    140155676093648: {170: <neo.lib.locking.SimpleQueue object at 0x7f788a109c58>},
                    140155677536464: {},
                    140155679224336: {},
                    140155679876496: {},
                    140155680702992: {},
                    140155681851920: {},
                    140155681852624: {},
                    140155682773584: {},
                    140155685988880: {},
                    140155693061328: {},
                    140155693062224: {},
                    140155693074960: {},
                    140155696334736: {278: <neo.lib.locking.SimpleQueue object at 0x7f788a109c58>},
                    140155696411408: {},
                    140155696414160: {},
                    140155696576208: {},
                    140155722373904: {}},
  'queue_dict': {140155673622936: 1, 140155689147480: 2}}

140155673622936 should not be queue_dict

743026d5

More RTMIN+2 (log) information for clients and connections · 7e456329
Julien Muchembled authored Nov 14, 2018

7e456329

15 Nov, 2018 3 commits
- storage: check for conflicts when notifying that the a partition is replicated · d66b4f24
  Julien Muchembled authored Nov 06, 2018
  
  d66b4f24
- storage: clarify several assertions · f25b8ee3
  Julien Muchembled authored Nov 07, 2018
  
  f25b8ee3
- qa: new expectedFailure testcase method · 4150ffb1
  Julien Muchembled authored Nov 07, 2018
```
The idea is to write:

  with self.expectedFailure(...): \

just before the statement that is expected to fail. Contrary to the existing
decorator, we want to:
- be sure that the test fails at the expected line;
- be able to remove an expectedFailure without touching the code around.
```
  4150ffb1