Commits · 220612aff1d073a33bd10e204ed899e74ce3804a · Ayush Tiwari / erp5

25 Feb, 2019 1 commit

mroonga: prefer deferring full text on the same node for better cache efficiency · 220612af

Julien Muchembled authored Feb 25, 2019

In this specific case, the number of grouped messages is bound,
and actually not greater than the number of grouped messages
being processed for normal indexing.

220612af

21 Feb, 2019 2 commits

CMFActivity: enable node preference by default in some safe cases · 77424415
Julien Muchembled authored Feb 18, 2019

77424415

CMFActivity: new activate() parameter to prefer executing on the same node · 301962ad

Julien Muchembled authored Jan 08, 2019

The goal is to make better use of the ZODB Storage cache. It is common to do
processing on a data set in several sequential transactions: in such case, by
continuing execution of these messages on the same node, data is loaded from
ZODB only once. Without this, and if there are many other messages to process,
processing always continue on a random node, causing much more load from ZODB.

To prevent nodes from having too much work to do, or too little compared to
other nodes, this new parameter is only a hint for CMFActivity. It remains
possible for a node to execute a message that was intended for another node.

Before this commit, a processing node selects the first message(s) according to
the following ordering:

  priority, date

and now:

  priority, node_preference, date

where node_preference is:

  -1 -> same node
   0 -> no preferred node
   1 -> another node

The implementation is tricky for 2 reasons:
- MariaDB can't order this way in a single simple query, so we have 1
  subquery for each case, potentially getting 3 times the wanted maximum of
  messages, then order/filter on the resulting union.
- MariaDB also can't filter efficiently messages for other nodes, so the 3rd
  subquery returns messages for any node, potentially duplicating results from
  the first 2 subqueries. This works because they'll be ordered last.
  Unfortunately, this requires extra indices.

In any case, message reservation must be very efficient, or MariaDB deadlocks
quickly happen, and locking an activity table during reservation reduces
parallelism too much.

In addition to better cache efficiency, this new feature can be used as a
workaround for a bug affecting serialiation_tag, causing IntegrityError when
reindexing many new objects. If you have 2 recursive reindexations for both a
document and one of its lines, and if you have so many messages than grouping
is split between these 2 messages, then you end up with 2 nodes indexing the
same line in parallel: for some tables, the pattern DELETE+INSERT conflicts
since InnoDB does not take any lock when deleting a non-existent row.

If you have many activities creating such documents, you can combine with
grouping and appropriate priority to make sure that such pair of messages won't
be executed on different nodes, except maybe at the end (when there's no
document to create anymore; then activity reexecution may be enough).
For example:

  from Products.CMFActivity.ActivityTool import getCurrentNode
  portal.setPlacelessDefaultReindexParameters(
    activate_kw={'node': 'same', 'priority': priority},
    group_id=getCurrentNode())

where `priority` is the same as the activity containing the above code, which
can also use grouping without increasing the probability of IntegrityError.

301962ad

13 Feb, 2019 3 commits
- CMFActivity: even more test clean-up · 3ca5bf97
  Julien Muchembled authored Jan 23, 2019
  
  3ca5bf97
- CMFActivity: some optimization and clean-up in the code reserving messages · cee3e728
  Julien Muchembled authored Jan 08, 2019
  
  cee3e728
- CMFActivity: limit insertion by size in bytes instead of number of rows · 17dc7e23
  Julien Muchembled authored Jan 22, 2019
```
This fixes the issue that a transaction with many big messages failed to
commit. By dynamically find the maximum allowed size of a query, it also
speeds up insertion by minimizing the number of queries.
```
  17dc7e23
05 Feb, 2019 8 commits

CMFActivity: remove useless 'priority' index · 4b7acaa7
Julien Muchembled authored Jan 21, 2019

4b7acaa7
CMFActivity: remove old skin if any · afaa9d19
Julien Muchembled authored Jan 21, 2019

afaa9d19

CMFActivity: drop DTML completely and use consecutive uids when possible · d64887cb

Julien Muchembled authored Jan 21, 2019

This moves the remaining DTML queries to Python, dropping the 'activity' skin.

Dealing with conflicts of uids is easier if the inserted uids are consecutive:
now, only 1 random value is generated, as base uid. This also preserves the
order of insertion, which is wanted for performance reasons:
- No more random write in the primary index.
- When modifying several lines of several documents, 1 document being processed
  at a time, we'd like that any grouped activity (usually indexation) follows
  the same order, so that a processing node prefer many lines from a few
  documents instead of mixing lines from too many documents at the same time.
  This is usually better for caches.

d64887cb

CMFActivity: speed up console activity watcher · bcaf44a4

Julien Muchembled authored Jan 20, 2019

Average age of activities is dropped because it would become
too complicated to implement and it's useless information.

bcaf44a4

CMFActivity: remove processing/processing_date columns and improve watchers · 499d8f8b

Julien Muchembled authored Jan 18, 2019

The original goal was to improve performance by removing the
`processing_node_processing` index and the queries that modified
these 2 useless columns.

499d8f8b

CMFActivity: do not refresh view too often when checking messages dependencies · a32c8a42

Julien Muchembled authored Jan 16, 2019

The root call to getExecutableMessageList (i.e. the one from distribute)
is fast enough and won't hold old revisions of the database for too long.
It is also completely read-only so it won't lock anything.

This caused useless communication with the server.

a32c8a42

CMFActivity: validate using single SQL request per message · 9fe1d68d
Julien Muchembled authored Jan 15, 2019

9fe1d68d

CMFActivity: validate a message with a single SQL request per queue · 7e387bcb

Julien Muchembled authored Jan 14, 2019

As shown in the following example, on a big catalog table,
MariaDB is able to use several indices at the same time
('...' are obfuscated unique values):

  > analyze select SQL_NO_CACHE uid, relative_url from catalog where reference='...' OR relative_url='...';
  +------+-------------+---------+-------------+------------------------+------------------------+---------+------+------+--------+----------+------------+--------------------------------------------------+
  | id   | select_type | table   | type        | possible_keys          | key                    | key_len | ref  | rows | r_rows | filtered | r_filtered | Extra                                            |
  +------+-------------+---------+-------------+------------------------+------------------------+---------+------+------+--------+----------+------------+--------------------------------------------------+
  |    1 | SIMPLE      | catalog | index_merge | Reference,relative_url | Reference,relative_url | 768,767 | NULL |    2 |   2.00 |   100.00 |     100.00 | Using union(Reference,relative_url); Using where |
  +------+-------------+---------+-------------+------------------------+------------------------+---------+------+------+--------+----------+------------+--------------------------------------------------+
  1 row in set (0.00 sec)

So mixing different dependency types with OR should be fine
(no need to split into more subqueries and join with UNION).

7e387bcb

18 Jan, 2019 3 commits
- CMFActivity: remove dead code and simplify · 7b634f22
  Julien Muchembled authored Jan 14, 2019
  
  7b634f22
- CMFActivity: more test clean-up · 6338f120
  Julien Muchembled authored Jan 09, 2019
  
  6338f120
- CMFActivity: move most SQL queries from DTML to Python · ad7ee9aa
  Julien Muchembled authored Jan 08, 2019
  
  ad7ee9aa
13 Jan, 2019 1 commit

Fix reindexing with custom grouping using 'group_id' · 707058d7

Julien Muchembled authored Jan 13, 2019

This fixes:

  Traceback (innermost last):
    Module Products.CMFActivity.ActivityTool, line 1373, in invokeGroup
      traverse(method_id)(expanded_object_list)
    Module Products.ERP5Catalog.CatalogTool, line 946, in catalogObjectList
      super(CatalogTool, self).catalogObjectList(tmp_object_list, **m.kw)
    Module Products.ZSQLCatalog.ZSQLCatalog, line 813, in catalogObjectList
      **kw
  TypeError: catalogObjectList() got an unexpected keyword argument 'group_id'

707058d7

08 Jan, 2019 3 commits
- CMFActivity: some cleanup in tests · b1fe6111
  Julien Muchembled authored Jan 08, 2019
  
  b1fe6111
- Always make a quantity divergence tester take the delivery ratio into account · 7f4d22b7
  Julien Muchembled authored Dec 31, 2018
```
/reviewed-on nexedi/erp5!819
```
  7f4d22b7
- testnode: use CPUs a bit less agressively · 8930fec9
  Jérome Perrin authored Nov 30, 2018
```
* Set MAKEFLAGS with -l to limit number of jobs by load and number of
processes
* set NPY_NUM_BUILD_JOBS & BUNDLE_JOBS to `node_quantity` instance
parameter

/reviewed-on !809
```
  8930fec9
07 Jan, 2019 2 commits

testnode: avoid to rebuild testnode dependencies (firefox) all the time · 674d6459
Sebastien Robin authored Jan 07, 2019

674d6459

CMFActivity: better date ordering by using micro-precision · b82f3ba1

Julien Muchembled authored Jan 04, 2019

Originally, uids somehow sorted messages by date of insertion, in particular
for those that were created within the same second. But since random uids,
such messages became validated or processed in random order.

Note however that by default, messages created in the same transaction all have
exactly the same date, so commit a42da4de
("CMFActivity: Do not use offset for scanning messages to validate.")
forces us to keep the ordering on uids (in addition to priority/date).

Existing instances will upgrade automatically, using the already existing code
to upgrade tables in a generic way. You should see the following logs:

    INFO CMFActivity 'message_queue' table upgraded
    ALTER TABLE message_queue
      MODIFY COLUMN date datetime(6) NOT NULL AFTER uid,
      MODIFY COLUMN processing_date datetime(6) DEFAULT NULL AFTER processing
    INFO CMFActivity 'message_job' table upgraded
    ALTER TABLE message_job
      MODIFY COLUMN date datetime(6) NOT NULL AFTER uid,
      MODIFY COLUMN processing_date datetime(6) DEFAULT NULL AFTER processing
    INFO CMFActivity 'message' table upgraded
    ALTER TABLE message
      MODIFY COLUMN date datetime(6) NOT NULL AFTER uid,
      MODIFY COLUMN processing_date datetime(6) DEFAULT NULL AFTER processing


/reviewed-on nexedi/erp5!820

b82f3ba1

02 Jan, 2019 1 commit
- erp5_web_renderjs_ui: Fix execDisplayERP5ActionCommand of erp5_router · e32deab9
  Georgios Dagkakis authored Jan 02, 2019
```
so that options are kept
```
  e32deab9
31 Dec, 2018 1 commit
- [erp5_web_renderjs_ui] Selecting nothing means all are selected · bddcb668
  Romain Courteaud authored Dec 21, 2018
```
Same behaviour than xhtml style
```
  bddcb668
21 Dec, 2018 2 commits
- Fix Base_reindexObjectSecurity when an allowed content is a solver · 27126964
  Julien Muchembled authored Dec 21, 2018
  
  27126964
- [erp5_web_renderjs_ui] Move include/exclude logic into form_list gadget · 9de61fa2
  Romain Courteaud authored Dec 18, 2018
```
Next step is to activate more contextual listbox actions.

Move listbox select button to the right, to simplify cancellation.
```
  9de61fa2
20 Dec, 2018 2 commits
- [erp5_web_ui_test] Do not use HTTPSConnection over http · 76aa608b
  Romain Courteaud authored Dec 20, 2018
  
  76aa608b
- [erp5_web_ui_test] Add timeout · 0d55282a
  Romain Courteaud authored Dec 20, 2018
  
  0d55282a
17 Dec, 2018 4 commits
- [erp5_web] Drop the VirtualHostMonster ID when configured directly on the section · 9d82f3f3
  Romain Courteaud authored Dec 17, 2018
```
WebSite redirection must be tested against different virtual host monster configuration
TODO: No idea how to fetch the ipv6 in live test
```
  9d82f3f3
- [erp5_web] No need to duplicate the REQUEST actual_url key · d76704f8
  Romain Courteaud authored Dec 17, 2018
  
  d76704f8
- testsuite: better test result name for SR egg test · aa516c7e
  Jérome Perrin authored Dec 06, 2018
```
https://lab.nexedi.com/nexedi/slapos/tree/master/software/proftpd/test
was just named "test", we want the test result to be named "proftpd"
instead.

/reviewed-on nexedi/erp5!815
```
  aa516c7e
- bt5: mark sphinxse and myisam fulltext bt5s deprecated in favor of mroonga · c03e5ea9
  Nicolas Wavrant authored Dec 17, 2018
  
  c03e5ea9
14 Dec, 2018 2 commits

[erp5_web_renderjs_ui] Listbox accept the sort parameter · 515b936b
Romain Courteaud authored Dec 13, 2018
```
See nexedi/erp5@04939506
```
515b936b

ERP5Type.Core.Folder: Fix recursive indexation of Trash Bins. · e733331f

Vincent Pelletier authored Dec 14, 2018

Since recent rework of isSubtreeIndexable, direct indexation of a trash bin
is fixed, but recursive indexation recursion condition broke: it stop just
above the trash bin instead of stopping just below it.

e733331f

13 Dec, 2018 1 commit

Revert "mark file uploading tests as expected failure." as these Functional... · dd528bae

Arnaud Fontaine authored Dec 12, 2018

Revert "mark file uploading tests as expected failure." as these Functional Tests are not supposed to fail.

This reverts commit ade16831.

enablePrivilege, and thus UniversalFileRead used to upload files, was disabled
in Firefox 17. Since SlapOS Firefox has been upgraded, "The operation is insecure"
error is raised when uploading files. However, this doesn't mean that these tests
should be expected to fail (and if they are, they should be removed instead).

dd528bae

12 Dec, 2018 1 commit

accounting: restrict restarting accounting period to Assignor · 366a760e

Jérome Perrin authored Dec 06, 2018

This workflow involved an Assignee who can open, close, re-open and an
Assignor who can close definitively.
This is usually configured so that accountants are Assignee and CFO is
Assignor.

We realized that re-opening a Period that was previously closed is
something we don't want the accountants to do without CFO's approval.

To support this configuration, we only allow Assignor to re-open.

Now Assignee can open and close temporarily and Assignor can re-open and
close definitively.

/reviewed-on nexedi/erp5!813

366a760e

11 Dec, 2018 3 commits

[erp5_gadget_interface_validator*] Delay loading of all renderjs iframe gadget · 30b82e54

Romain Courteaud authored Dec 11, 2018

This is really a hack, which may be integrated into renderJS directly

Reduce potential Zelenium test timeout by waiting for iframe to be loaded one by one

30b82e54

Base: Restore lost check of self.isIndexable . · 076ddfb2

Vincent Pelletier authored Dec 11, 2018

This check got lost when the first isSubtreeIndexable call was done on
parent document and not on self.

076ddfb2

Revert a change accidentally included in an unrelated commit. · ed452b33

Vincent Pelletier authored Dec 11, 2018

This partially reverts:

commit 76e3c115
Author: Vincent Pelletier <vincent@nexedi.com>
Date:   Mon Dec 10 16:40:48 2018 +0900

    Base: Fix isAncestryIndexable implementation.

as it accidentally carried over a totally unrelated (and unfinished)
change.

ed452b33