Commits · cb1c9934299f102255ec02957c80f907f116b15d · Kirill Smelkov / neo

01 Dec, 2020 1 commit

Connection: Adjust msg_id a bit so it behaves like stream_id in HTTP/2 · cb1c9934

Kirill Smelkov authored Dec 18, 2016

This is 2020 edition of my original patch from 2016 ( dd3bb8b4 ).

It was described in my NEO/go article ( https://navytux.spb.ru/~kirr/neo.html )
in the paragraph quoted below:

NEO/go shifts from thinking about protocol logic as RPC to thinking of it as
more general network protocol and settles to provide general
connection-oriented message exchange service[1] : whenever a message with new
`msg_id` is sent, a new connection is established multiplexed on top of a
single node-node TCP link. Then it is possible to send/receive arbitrary
messages over back and forth until so established connection is closed. This
works transparently to NEO/py who still thinks it operates in simple RPC mode
because of the way messages are put on the wire and because simple RPC is
subset of a general exchange. The `neonet` module also provides `DialLink` and
`ListenLink` primitives[2] that work similarly to standard Go `net.Dial` and
`net.Listen` but wrap so created link into the multiplexing layer. What is
actually done this way is very similar to HTTP/2 which also provides multiple
general streams multiplexing on top of a single TCP connection ([3], [4]).
However if connection ids (sent in place of `msg_id` on the wire) are assigned
arbitrary, there could be a case when two nodes could try to initiate two new
different connections to each other with the same connection id. To prevent
such kind of conflict a simple rule to allocate connection ids either even or
odd, depending on the role peer played while establishing the link, could be
used. HTTP/2 takes similar approach[5] where `"Streams initiated by a client
MUST use odd-numbered stream identifiers; those initiated by the server MUST
use even-numbered stream identifiers."` with NEO/go doing the same
corresponding to who was originally dialer and who was a listener. However it
requires small patch to be applied on NEO/py side to increment `msg_id` by 2
instead of 1.

[1] https://lab.nexedi.com/kirr/neo/blob/463ef9ad/go/neo/neonet/connection.go
[2] https://lab.nexedi.com/kirr/neo/blob/463ef9ad/go/neo/neonet/newlink.go
[3] https://tools.ietf.org/html/rfc7540#section-5
[4] https://http2.github.io/faq/#why-is-http2-multiplexed
[5] https://tools.ietf.org/html/rfc7540#section-5.1.1

It can be named as "terrible", "irritating", "stupid" or "crazy", but the fact is:

- it does no harm to NEO/py and is backward-compatible: a NEO/py node
without this patch can still successfully connect and interoperate to
another NEO/py node with this patch.

- it is required for NEO/go to be able to interoperate with NEO/py.
Both client and server parts of NEO/go use the same neonet module to exchange messages.

- NEO/go client is used by wendelin.core 2, which organizes access to on-ZODB
ZBigFile data via WCFS filesystem implemented in Go.

So on one side this patch is small, simple and does not do any harm to NEO/py.
On the other side it is required for NEO/go and wendelin.core 2.

To me this clearly indicates that there should be NO GOOD REASON to reject
inclusion of this patch into NEO/py.

--------

My original patch from 2016 came with corresponding adjustments to neo/tests/testConnection.py
( dd3bb8b4 )
but commit f6eb02b4 (Remove packet timeouts; 2017-05-04) removed testConnection.py
completely and, if I understand correctly, did not add any other test to
compensate that. This way I'm not trying to restore my tests to
Connection neither.

Anyway, with this patch there is no regression to all other existing NEO/py tests.

--------

My original patch description from 2016 follows:

- even for server initiated streams
- odd for client initiated streams

This way I will be able to use Pkt.msg_id as real stream_id in go's Conn
because with even / odd scheme there is no possibility for id conflicts
in between two peers.

/cc @romain, @tomo, @rafael, @arnau, @vpelletier, @klaus, @Tyagov

cb1c9934

19 Aug, 2020 4 commits

Merge "mysql: workaround for MDEV-20693" · f2ea4be2
Julien Muchembled authored May 22, 2020

f2ea4be2

qa: skip broken ZODB test · a1418c9d

Julien Muchembled authored Jun 12, 2020

======================================================================
FAIL: check_tid_ordering_w_commit (neo.tests.zodb.testBasic.BasicTests)
----------------------------------------------------------------------
Traceback (most recent call last):
  File "ZODB/tests/BasicStorage.py", line 397, in check_tid_ordering_w_commit
    self.assertEqual(results.pop('lastTransaction'), tids[1])
  File "neo/tests/__init__.py", line 301, in assertEqual
    return super(NeoTestBase, self).assertEqual(first, second, msg=msg)
failureException: '\x03\xd8\x85H\xbffp\xbb' != '\x03\xd8\x85H\xbfs\x0b\xdd'

(cherry picked from commit f4cb59d2)

a1418c9d

client: fix race with invalidations when starting a new transaction on ZODB 5 · 96a5c01f
Julien Muchembled authored Jun 05, 2020
```
This requires ZODB >= 5.6.0

(cherry picked from commit a7d101ec)
```
96a5c01f
Code clean-up, comment fixes · fa7fbad6
Julien Muchembled authored Feb 20, 2020
```
(cherry picked from commit 43029be2)
```
fa7fbad6

22 May, 2020 2 commits

master: fix crash in STARTING_BACKUP when connecting to an upstream secondary master · 011eba12

Julien Muchembled authored Apr 30, 2019

This fixes the following assertion:

  Traceback (most recent call last):
    File "neo/master/app.py", line 172, in run
      self._run()
    File "neo/master/app.py", line 182, in _run
      self.playPrimaryRole()
    File "neo/master/app.py", line 302, in playPrimaryRole
      self.backup_app.provideService())
    File "neo/master/backup_app.py", line 114, in provideService
      node, conn = bootstrap.getPrimaryConnection()
    File "neo/lib/bootstrap.py", line 74, in getPrimaryConnection
      poll(1)
    File "neo/lib/event.py", line 160, in poll
      to_process.process()
    File "neo/lib/connection.py", line 504, in process
      self._handlers.handle(self, self._queue.pop(0))
    File "neo/lib/connection.py", line 92, in handle
      self._handle(connection, packet)
    File "neo/lib/connection.py", line 107, in _handle
      pending[0][1].packetReceived(connection, packet)
    File "neo/lib/handler.py", line 125, in packetReceived
      self.dispatch(*args)
    File "neo/lib/handler.py", line 75, in dispatch
      method(conn, *args, **kw)
    File "neo/lib/handler.py", line 159, in notPrimaryMaster
      assert primary != self.app.server
  AttributeError: 'BackupApplication' object has no attribute 'server'

(cherry picked from commit dba07e72)

011eba12

mysql: workaround for MDEV-20693 · 70387981
Julien Muchembled authored Jan 15, 2020

70387981

07 Jan, 2020 1 commit
- Merge v1.12 · 2c823e2e
  Julien Muchembled authored Jan 07, 2020
  
  2c823e2e
28 Apr, 2019 1 commit
- Release version 1.12 · 6332112c
  Julien Muchembled authored Apr 28, 2019
  
  6332112c
27 Apr, 2019 12 commits

master: reject drop/tweak ctl commands that could lead to unwanted status · 55a6dd0f

Julien Muchembled authored Apr 11, 2019

The following 2 operations can be onerous and they should not be
directly usable without some kind of confirmation by the user:
- Dropping a node now requires to first stop it.
- Tweaking does not exclude anymore automatically DOWN nodes,
  because a node could go DOWN between the moment the user sends
  the command to tweak and the actual tweak by the master.

55a6dd0f

qa: extend test reproducing the migration of a big ZODB to NEO · ef4d58f6
Julien Muchembled authored Apr 07, 2019

ef4d58f6
neoctl: better display of full partition tables · ab082d7e
Julien Muchembled authored Apr 04, 2019

ab082d7e
Bump protocol version · c6453626
Julien Muchembled authored Apr 26, 2019

c6453626

tweak: add option to simulate · 2a27239d

Julien Muchembled authored Mar 31, 2019

Initially, I wanted to do the simulation inside neoctl but it has no knowledge
of the topology (the master don't send devpath values of storage nodes).
Therefore, the work is delegated to the master node, which implies a change
of the protocol.

2a27239d

tweak: do not crash when trying to remove all nodes · 3839d224
Julien Muchembled authored Apr 04, 2019

3839d224
tweak: do not touch cells of nodes that are intended to be dropped · 8a645d9f
Julien Muchembled authored Mar 29, 2019

8a645d9f

Better error reporting from the master to neoctl for denied requests · c2c9e99d

Julien Muchembled authored Apr 06, 2019

This stops abusing ProtocolError, which disconnects the admin node needlessly.

The many 'if ... raise RuntimeError' in neo/neoctl/neoctl.py
could be turned into assertions.

c2c9e99d

Make 'neoctl print pt' report the number of replicas · 21190ee7
Julien Muchembled authored Mar 31, 2019

21190ee7

Make the number of replicas modifiable when the cluster is running · ef5fc508

Julien Muchembled authored Mar 27, 2019

neoctl gets a new command to change the number of replicas.

The number of replicas becomes a new partition table attribute and
like the PT id, it is stored in the config table. On the other side,
the configuration value for the number of partitions is dropped,
since it can be computed from the partition table, which is
always stored in full.

The -p/-r master options now only apply at database creation.

Some implementation notes:

- The protocol is slightly optimized in that the master now sends
  automatically the whole partition tables to the admin & client
  nodes upon connection, like for storage nodes.
  This makes the protocol more consistent, and the master is the
  only remaining node requesting partition tables, during recovery.

- Some parts become tricky because app.pt can be None in more cases.
  For example, the extra condition in NodeManager.update
  (before app.pt.dropNode) was added for this is the reason.
  Or the 'loadPartitionTable' method (storage) that is not inlined
  because of unit tests.
  Overall, this commit simplifies more than it complicates.

- In the master handlers, we stop hijacking the 'connectionCompleted'
  method for tasks to be performed (often send the full partition
  table) on handler switches.

- The admin's 'bootstrapped' flag could have been removed earlier:
  race conditions can't happen since the AskNodeInformation packet
  was removed (commit d048a52d).

ef5fc508

New --new-nid storage option for fast cloning · 27e3f620

Julien Muchembled authored Mar 21, 2019

It is often faster to set up replicas by stopping a node (and any
underlying database server like MariaDB) and do a raw copy of the
database (e.g. with rsync). So far, it required to stop the whole
cluster and use tools like 'mysql' or sqlite3' to edit:
- the 'pt' table in databases,
- the 'config.nid' values of the new nodes.

With this new option, if you already have 1 replica, you can set up
new replicas with such fast raw copy, and without interruption of
service. Obviously, this implies less redundancy during the operation.

27e3f620

qa: fix 2 tests with ZODB5 · 64e02391
Julien Muchembled authored Apr 26, 2019

64e02391

26 Apr, 2019 4 commits
- qa: new tools/stress options to evaluate MySQL engines · 491f4c89
  Julien Muchembled authored Apr 23, 2019
```
--kill-mysqld should be combined with something like -f .3 -r .1
to give storage nodes enough time to recover.
And also -D 0 to focus testing on the storage backend rather than NEO.
```
  491f4c89
- qa: provide a way to let tests start 1 mysqld per storage node · c11410ef
  Julien Muchembled authored Apr 23, 2019
  
  c11410ef
- mysql: make 'user' actually optional in the DB connection string · 74ec44e3
  Julien Muchembled authored Apr 23, 2019
  
  74ec44e3
- mysql: specify column families for RocksDB · 87c1de3b
  Julien Muchembled authored Apr 17, 2019
  
  87c1de3b
16 Apr, 2019 5 commits
- qa: add testIncremental (testImporter) test · aa7b654f
  Julien Muchembled authored Apr 09, 2019
  
  aa7b654f
- importer: fix hidden "maximum recursion depth exceeded" at startup · d5834ee9
  Julien Muchembled authored Apr 09, 2019
  
  d5834ee9
- importer: fix closure of ZODB, and also do it when the import is finished · c37bcfa3
  Julien Muchembled authored Apr 09, 2019
  
  c37bcfa3
- sqlite: fix resumption of migration to NEO with Importer · 6608a868
  Julien Muchembled authored Apr 09, 2019
  
  6608a868
- qa: fix a random failure in threaded tests · 989e9920
  Julien Muchembled authored Apr 06, 2019
```
This also reverts commit 442bb43a.
```
  989e9920
05 Apr, 2019 3 commits
- importer: speed up startup when the import is already finished · 26b1246a
  Julien Muchembled authored Apr 05, 2019
  
  26b1246a
- importer: fix replication (as source) once import is finished · 9d14ea1b
  Julien Muchembled authored Apr 05, 2019
```
This fixes up commit be839e92.
```
  9d14ea1b
- storage: fix DatabaseManager.getLastTID with max_tid · c58d4862
  Julien Muchembled authored Apr 05, 2019
  
  c58d4862
01 Apr, 2019 1 commit
- qa: remove 2 useless unit tests · b10cc750
  Julien Muchembled authored Mar 29, 2019
  
  b10cc750
21 Mar, 2019 2 commits

storage: allow the master to change our node id · 15369269
Julien Muchembled authored Mar 21, 2019
```
This is not used currently.
```
15369269

Rename --uuid command-line options into --nid · e8473a23

Julien Muchembled authored Mar 21, 2019

This breaks compatibily but it was mentionned from the beginning
that these options are only there for testing purpose.

TODO: rename all remaining occurrences of UUID into NID in the code

e8473a23

16 Mar, 2019 1 commit

importer: fix possible data loss on writeback · e387ad59

Julien Muchembled authored Mar 12, 2019

If the source DB is lost during the import and then restored from a backup,
all new transactions have to written back again on resume. It is the most
common case for which the writeback hits the maximum number of transactions
per partition to process at each iteration; the previous code was buggy in
that it could skip transactions.

e387ad59

13 Mar, 2019 3 commits
- client: inline Application._loadFromCache · a33c624c
  Julien Muchembled authored Mar 13, 2019
  
  a33c624c
- client: replace global load lock by a per-oid one · 87eca1e0
  Julien Muchembled authored Mar 09, 2019
  
  87eca1e0
- client: unindent code · c7cdcf87
  Julien Muchembled authored Mar 08, 2019
  
  c7cdcf87