Commits · 03042a693c33cfdf8c9c1562417cde43c8f0fedf · nexedi / neoppod

28 Mar, 2016 2 commits
- Add support for recent ZODB · 03042a69
  Julien Muchembled authored Mar 28, 2016
  
  03042a69
- tests: sort hunks of ZODB3.patch to match output of git-show · 04681e54
  Julien Muchembled authored Mar 28, 2016
  
  04681e54
22 Mar, 2016 2 commits
- client: fix invalidation issues when reconnecting to the master · 694c27f4
  Julien Muchembled authored Mar 22, 2016
  
  694c27f4
- Recover from failures during tpc_finish when the transaction got successfully committed · dd74d662
  Julien Muchembled authored Mar 21, 2016
  
  dd74d662
21 Mar, 2016 3 commits

master: do never abort a prepared transaction · 7ee7ff4e

Julien Muchembled authored Mar 20, 2016

This fixes the following crash (for example when a client disconnects during
tpc_finish):

Traceback (most recent call last):
  ...
  File "neo/master/handlers/storage.py", line 68, in answerInformationLocked
    self.app.tm.lock(ttid, conn.getUUID())
  File "neo/master/transactions.py", line 338, in lock
    if self._ttid_dict[ttid].lock(uuid) and self._queue[0][1] == ttid:
IndexError: list index out of range

7ee7ff4e

storage: fix crash when a client disconnects just after it requested to finish a transaction · 7aecdada
Julien Muchembled authored Mar 21, 2016

7aecdada
doc: minor changes in importer.conf · b05b961b
Julien Muchembled authored Mar 20, 2016

b05b961b

09 Mar, 2016 2 commits
- Release version 1.6.2 · 1705d828
  Julien Muchembled authored Mar 09, 2016
  
  1705d828
- BUGS: possible "uuid" conflict issue after clients got disconnected from the master · 24780e8e
  Julien Muchembled authored Mar 09, 2016
  
  24780e8e
08 Mar, 2016 2 commits
- tests: check case of multiple conflict resolutions for the same (oid, txn) · eee74faf
  Julien Muchembled authored Mar 08, 2016
  
  eee74faf
- tests: new helper to synchronize threads · 15bcd495
  Julien Muchembled authored Mar 08, 2016
  
  15bcd495
04 Mar, 2016 3 commits

storage: move the commit at tpc_vote from the backends to the unique caller · 645920e8
Julien Muchembled authored Mar 04, 2016

645920e8

storage: defer commit when unlocking a transaction (-> better performance) · eaa07e25

Julien Muchembled authored Mar 04, 2016

Before this change, a storage node did 3 commits per transaction:
- once all data are stored
- when locking the transaction
- when unlocking the transaction

The last one is not important for ACID. In case of a crash, the transaction
is unlocked again (verification phase). By deferring it by 1 second, we
only have 2 commits per transaction during high activity because all pending
changes are merged with the commits caused by other transactions.

This change compensates the extra commit(s) per transaction that were
introduced in commit 7eb7cf1b
("Minimize the amount of work during tpc_finish").

eaa07e25

client: optimize cache by not keeping items with counter=0 in history queue · 254878a8
Julien Muchembled authored Mar 02, 2016

254878a8

02 Mar, 2016 1 commit

client: revert incorrect memory optimization · 763806e0

Julien Muchembled authored Mar 02, 2016

Since commit d2d77437 ("client: make the cache
tolerant to late invalidations when the entry is in the history queue"),
invalidated items became current again when they were moved to the history
queue, which was wrong for 2 reasons:
- only the last items of _oid_dict values may have next_tid=None,
- and for such items, they could be wrongly reused when caching the real
  current data.

763806e0

01 Mar, 2016 1 commit
- storage: switch to a maintained fork of MySQL-python · 5f0c93f5
  Julien Muchembled authored Mar 01, 2016
  
  5f0c93f5
26 Feb, 2016 4 commits
- README: minor update · e0bd2b5b
  Julien Muchembled authored Feb 26, 2016
  
  e0bd2b5b
- doc: rename CHANGES/README/UPGRADE for GitLab · 55eb90c1
  Julien Muchembled authored Feb 26, 2016
  
  55eb90c1
- tests: new NEO_DB_SOCKET environment variable to chose the MySQL server to use · cc72e972
  Julien Muchembled authored Feb 26, 2016
  
  cc72e972
- BUGS: deadlock avoidance can also happen with only 1 storage node · 9bd524ab
  Julien Muchembled authored Feb 26, 2016
  
  9bd524ab
05 Feb, 2016 1 commit

client: make the cache tolerant to late invalidations when the entry is in the history queue · d2d77437

Julien Muchembled authored Feb 05, 2016

This fixes the following scenario:
1. the master sends invalidations to clients,
   and unlocks to storages  (oid1, tid1)
2. the storage receives/processes the unlock
3. the client asks data (oid1, tid0)
4. the storage returns tid1 as next tid, whereas it's still None in the cache
   (before, it caused an assertion failure)
6. the client processes invalidations

d2d77437

25 Jan, 2016 2 commits
- Release version 1.6 · a7f50dfc
  Julien Muchembled authored Jan 25, 2016
  
  a7f50dfc
- Update copyright year · 5a8e9d04
  Julien Muchembled authored Jan 25, 2016
  
  5a8e9d04
21 Jan, 2016 2 commits
- Update neo/debug.py example · 321b0bf8
  Julien Muchembled authored Jan 21, 2016
  
  321b0bf8
- tests: document Patch class · e5c056b9
  Julien Muchembled authored Jan 21, 2016
  
  e5c056b9
12 Jan, 2016 1 commit
- client: remove obsolete comment in Storage.load · d43bd510
  Julien Muchembled authored Jan 12, 2016
```
See commit c277ed20
("client: really process all invalidations in poll thread").
```
  d43bd510
16 Dec, 2015 2 commits
- neoctl: don't print 'None' on successful check/truncate commands · 50a6cf41
  Julien Muchembled authored Dec 14, 2015
  
  50a6cf41
- interfaces: check signature of methods · 82d95846
  Julien Muchembled authored Dec 13, 2015
  
  82d95846
13 Dec, 2015 3 commits
- storage: define interface for backends and check they implement it · f419f974
  Julien Muchembled authored Dec 13, 2015
  
  f419f974
- importer: allow truncation after the last tid to import, during or after the import · c6b80f7b
  Julien Muchembled authored Dec 13, 2015
```
This is a partial implementation. To truncate at a smaller tid, you must wait
that data is imported up to this tid and stop using the Importer backend.
```
  c6b80f7b
- importer: do not implement deleteTransaction, now only used for replication · 24a9f1b8
  Julien Muchembled authored Dec 13, 2015
```
This backend does not support replication. Even if we implemented it, such node
could only be a source for other nodes so we should never delete transactions.
```
  24a9f1b8
12 Dec, 2015 1 commit
- neolog: fix crash on unknown packets · af8a8370
  Julien Muchembled authored Dec 12, 2015
  
  af8a8370
11 Dec, 2015 1 commit
- client: dump cache stats on SIGRTMIN+2 · 9e543d76
  Julien Muchembled authored Dec 11, 2015
  
  9e543d76
09 Dec, 2015 1 commit
- client: fix spurious connection timeouts · 06a64d80
  Julien Muchembled authored Dec 09, 2015
```
This fixes a regression caused by
commit eef52c27
```
  06a64d80
02 Dec, 2015 1 commit
- Release version 1.6 · f180b00e
  Julien Muchembled authored Dec 02, 2015
  
  f180b00e
01 Dec, 2015 3 commits

master: fix verification when nodes don't have any readable cell · cd669221
Julien Muchembled authored Nov 24, 2015

cd669221
Bump protocol version and upgrade storages automatically · ca2caf87
Julien Muchembled authored Nov 25, 2015

ca2caf87

Safer DB truncation, new 'truncate' ctl command · d3c8b76d

Julien Muchembled authored Dec 01, 2015

With the previous commit, the request to truncate the DB was not stored
persistently, which means that this operation was still vulnerable to the case
where the master is restarted after some nodes, but not all, have already
truncated. The master didn't have the information to fix this and the result
was a DB partially truncated.

-> On a Truncate packet, a storage node only stores the tid somewhere, to send
   it back to the master, which stays in RECOVERING state as long as any node
   has a different value than that of the node with the latest partition table.

We also want to make sure that there is no unfinished data, because a user may
truncate at a tid higher than a locked one.

-> Truncation is now effective at the end on the VERIFYING phase, just before
   returning the last ids to the master.

At last all nodes should be truncated, to avoid that an offline node comes back
with a different history. Currently, this would not be an issue since
replication is always restart from the beginning, but later we'd like they
remember where they stopped to replicate.

-> If a truncation is requested, the master waits for all nodes to be pending,
   even if it was previously started (the user can still force the cluster to
   start with neoctl). And any lost node during verification also causes the
   master to go back to recovery.

Obviously, the protocol has been changed to split the LastIDs packet and
introduce a new Recovery, since it does not make sense anymore to ask last ids
during recovery.

d3c8b76d

30 Nov, 2015 2 commits

Perform DB truncation during recovery, send PT to storages before verification · 3e3eab5b

Julien Muchembled authored Nov 25, 2015

Currently, the database may only be truncated when leaving backup mode, but
the issue will be the same when neoctl gets a new command to truncate at an
arbitrary tid: we want to be sure that all nodes are truncated before anything
else.

Therefore, we stop sending Truncate orders before stopping operation because
nodes could fail/exit before actually processing them. Truncation must also
happen before asking nodes their last ids.

With this commit, if a truncation is requested:
- this is always the first thing done when a storage node connects to the
  primary master during the RECOVERING phase,
- and the cluster does not start automatically if there are missing nodes,
  unless an admin forces it.

Other changes:
- Connections to storage nodes don't need to be aborted anymore when leaving
  backup mode.
- The master always initiates communication when a storage node identifies,
  which simplifies code and reduces the number of exchanged packets.

3e3eab5b

master: fix possible blockage during recovery after a storage disconnection · 2485f151

Julien Muchembled authored Nov 19, 2015

At some point, the master asks a storage node its partition table. If this node
is lost before getting an answer, another node (or the same one if it comes
back) must be asked.

Before this change, the master node had to be restarted.

2485f151