Commits · 74b5a68ef7fa65a959696b83f25b6fc867c9435e · Łukasz Nowak / erp5

22 May, 2019 1 commit
- CMFActivity.ActiivityTool: Move getCurrentNodeFamilyIdSet call in try..finally block. · 74b5a68e
  Vincent Pelletier authored 5 years ago
```
Even though it should not raise, a stale lock would be a serious issue.
```
  74b5a68e
21 May, 2019 2 commits

fixup! CMFActivity: Implement node families. · 7afd6521
Vincent Pelletier authored 5 years ago

7afd6521

CMFActivity: Implement node families. · bf35001b

The intent is to be able to tell that an independently-defined group of
activity nodes may execute given activity, and no other node.
This allows more flexible parallelism control than serialization_tag.

bf35001b

13 Mar, 2019 1 commit
- fixup! CMFActivity: some optimization and clean-up in the code reserving messages · a7fec1ef
  Julien Muchembled authored 6 years ago
```
The fixes a missing rename in
commit cee3e728.
```
  a7fec1ef
21 Feb, 2019 2 commits

CMFActivity: enable node preference by default in some safe cases · 77424415
Julien Muchembled authored 6 years ago

77424415

CMFActivity: new activate() parameter to prefer executing on the same node · 301962ad

Julien Muchembled authored 6 years ago

The goal is to make better use of the ZODB Storage cache. It is common to do
processing on a data set in several sequential transactions: in such case, by
continuing execution of these messages on the same node, data is loaded from
ZODB only once. Without this, and if there are many other messages to process,
processing always continue on a random node, causing much more load from ZODB.

To prevent nodes from having too much work to do, or too little compared to
other nodes, this new parameter is only a hint for CMFActivity. It remains
possible for a node to execute a message that was intended for another node.

Before this commit, a processing node selects the first message(s) according to
the following ordering:

  priority, date

and now:

  priority, node_preference, date

where node_preference is:

  -1 -> same node
   0 -> no preferred node
   1 -> another node

The implementation is tricky for 2 reasons:
- MariaDB can't order this way in a single simple query, so we have 1
  subquery for each case, potentially getting 3 times the wanted maximum of
  messages, then order/filter on the resulting union.
- MariaDB also can't filter efficiently messages for other nodes, so the 3rd
  subquery returns messages for any node, potentially duplicating results from
  the first 2 subqueries. This works because they'll be ordered last.
  Unfortunately, this requires extra indices.

In any case, message reservation must be very efficient, or MariaDB deadlocks
quickly happen, and locking an activity table during reservation reduces
parallelism too much.

In addition to better cache efficiency, this new feature can be used as a
workaround for a bug affecting serialiation_tag, causing IntegrityError when
reindexing many new objects. If you have 2 recursive reindexations for both a
document and one of its lines, and if you have so many messages than grouping
is split between these 2 messages, then you end up with 2 nodes indexing the
same line in parallel: for some tables, the pattern DELETE+INSERT conflicts
since InnoDB does not take any lock when deleting a non-existent row.

If you have many activities creating such documents, you can combine with
grouping and appropriate priority to make sure that such pair of messages won't
be executed on different nodes, except maybe at the end (when there's no
document to create anymore; then activity reexecution may be enough).
For example:

  from Products.CMFActivity.ActivityTool import getCurrentNode
  portal.setPlacelessDefaultReindexParameters(
    activate_kw={'node': 'same', 'priority': priority},
    group_id=getCurrentNode())

where `priority` is the same as the activity containing the above code, which
can also use grouping without increasing the probability of IntegrityError.

301962ad

05 Feb, 2019 4 commits

CMFActivity: remove old skin if any · afaa9d19
Julien Muchembled authored 6 years ago

afaa9d19

CMFActivity: remove processing/processing_date columns and improve watchers · 499d8f8b

Julien Muchembled authored 6 years ago

The original goal was to improve performance by removing the
`processing_node_processing` index and the queries that modified
these 2 useless columns.

499d8f8b

CMFActivity: validate using single SQL request per message · 9fe1d68d
Julien Muchembled authored 6 years ago

9fe1d68d

CMFActivity: validate a message with a single SQL request per queue · 7e387bcb

Julien Muchembled authored 6 years ago

As shown in the following example, on a big catalog table,
MariaDB is able to use several indices at the same time
('...' are obfuscated unique values):

  > analyze select SQL_NO_CACHE uid, relative_url from catalog where reference='...' OR relative_url='...';
  +------+-------------+---------+-------------+------------------------+------------------------+---------+------+------+--------+----------+------------+--------------------------------------------------+
  | id   | select_type | table   | type        | possible_keys          | key                    | key_len | ref  | rows | r_rows | filtered | r_filtered | Extra                                            |
  +------+-------------+---------+-------------+------------------------+------------------------+---------+------+------+--------+----------+------------+--------------------------------------------------+
  |    1 | SIMPLE      | catalog | index_merge | Reference,relative_url | Reference,relative_url | 768,767 | NULL |    2 |   2.00 |   100.00 |     100.00 | Using union(Reference,relative_url); Using where |
  +------+-------------+---------+-------------+------------------------+------------------------+---------+------+------+--------+----------+------------+--------------------------------------------------+
  1 row in set (0.00 sec)

So mixing different dependency types with OR should be fine
(no need to split into more subqueries and join with UNION).

7e387bcb

18 Jan, 2019 2 commits
- CMFActivity: remove dead code and simplify · 7b634f22
  Julien Muchembled authored 6 years ago
  
  7b634f22
- CMFActivity: move most SQL queries from DTML to Python · ad7ee9aa
  Julien Muchembled authored 6 years ago
  
  ad7ee9aa
03 Jul, 2018 2 commits
- ActivityTool: Inherit from BaseTool. · c3de53b8
  Vincent Pelletier authored 7 years ago
```
Reduces code duplication.
```
  c3de53b8
- ActivityTool: Do not mark as non-indexable. · c2ebc937
  Vincent Pelletier authored 7 years ago
```
portal_activities contains documents which need to be (and are) indexed.
For consistency, the tool itself should be indexable.
```
  c2ebc937
20 Jun, 2018 1 commit
- CMFActivity: Remove unused import. · bdf46eae
  Vincent Pelletier authored 6 years ago
  
  bdf46eae
16 May, 2018 1 commit
- CMFActivity: warnings for deprecated methods · 35cadb66
  Roque authored 6 years ago
  
  35cadb66
26 Apr, 2018 1 commit
- CMFActivity: move getCurrentNode and getServerAddress methods outside ActivityTool class · 6d74ba22
  Roque authored 6 years ago
```
- getCurrentNode method can be directly imported from CMFActivity
- corresponding unittests

/reviewed-on nexedi/erp5!647
```
  6d74ba22
06 Mar, 2018 2 commits

CMFActivity: Use roundrobin for dequeue the messages · 220e0c37
Hardik Juneja authored 7 years ago

220e0c37

CMFActivity: Use CMFActivity as a backend for joblib · d2c88bd6

Hardik Juneja authored 8 years ago

This commit:

- Adds a new Activity called "SQLJoblib"
- Adds a Backend to be used by joblib
- Uses OOBTree to store results instead of ConflictFreeLog
- Adds a getResultDict API to fetch resut Dict

It uses the original work from rafael@nexedi.com and loic.esteve@inria.fr

d2c88bd6

19 Feb, 2018 1 commit
- ActivityTool: Reuse existing import instead of CMFCore's underware. · 994ba134
  Vincent Pelletier authored 7 years ago
  
  994ba134
29 Jun, 2017 1 commit
- CMFActivity: call setSite() in invokeGroup(). · 816473bb
  Kazuhiko Shiozaki authored 7 years ago
```
otherwise Localizer will not work in grouped activity.
```
  816473bb
21 Apr, 2017 1 commit
- CMFActivity: Only log "queuing message" when activity was actually queued. · 1efa5d54
  Vincent Pelletier authored 7 years ago
```
This makes a difference for SQLDict as not all messages are accepted for
insertion.
```
  1efa5d54
23 Dec, 2016 1 commit
- use getIdOrUserName() instead of __str__(). · fe773036
  Vincent Pelletier authored 8 years ago
  
  fe773036
12 Jan, 2016 3 commits
- Add more security declarations. · f84e2f62
  Kazuhiko Shiozaki authored 9 years ago
  
  f84e2f62
- manage_afterAdd, manage_afterClone and manage_beforeDelete should be private. · fd1a7c2b
  Kazuhiko Shiozaki authored 11 years ago
  
  fd1a7c2b
- fix wrong permission in CMFActivity. · ea12774e
  Kazuhiko Shiozaki authored 11 years ago
  
  ea12774e
28 Oct, 2015 1 commit
- cleanup: __init__() should return None. · 7950d24e
  Kazuhiko Shiozaki authored 9 years ago
  
  7950d24e
19 May, 2015 2 commits
- CMFActivity: make script with proxy roles usable as a grouping method · ce5f08fd
  Julien Muchembled authored 9 years ago
  
  ce5f08fd
- CMFActivity: fix use of grouping methods from restricted Python · 89185deb
  Julien Muchembled authored 9 years ago
  
  89185deb
15 May, 2015 1 commit

fixup! CMFActivity: new invokeGroup API · 1aa70bbf

Julien Muchembled authored 9 years ago

For the new GroupedMessage class in commit
da234001, I renamed 'obj' to 'object'
at the last minute and I missed 2 occurrences.

1aa70bbf

13 May, 2015 2 commits
- CMFActivity: make the dummy group method run the grouped message with correct user · 8d764c23
  Julien Muchembled authored 9 years ago
  
  8d764c23
- CMFActivity: new invokeGroup API · da234001
  Julien Muchembled authored 9 years ago
```
The recent API change was not enough.
A grouping method may need more information: in particular, the dummy grouping
method must be fixed to change user.
```
  da234001
06 May, 2015 1 commit

CMFActivity: slightly delay non-executed grouped messages · c85a840f

Julien Muchembled authored 9 years ago

When grouped messages fail, ActivityTool must distinguish 3 groups,
in order to reexecute them separately, as follows:
- first, those that succeeded
- then, those that were skipped
- at last, failed ones

Grouping methods are updated to handle partial failures, and stop doing
anything when something goes wrong.

Without this, we would have the following pathological cases.

1. Let's suppose first that skipped messages are marked as succeeded.

The problem is that each skipped message that will fail causes the reexecution
of those that didn't fail.

Exemple: A:ok B:ok C:err D:err E:err F:err
  1: A:ok, B:ok, C:err, D:skipped, E:skipped, F:skipped
  2: A:ok, B:ok, D:err, E:skipped, F:skipped
  3: A:ok, B:ok, E:err, F:skipped
  4: A:ok, B:ok, F:err
  5: A:ok, B:ok -> commit

And worst, the first failed (C) may be processable again before 5, entering
a failing loop if it is executed again in the same group as A & B.

2. Another implementation is to mar...

c85a840f

30 Mar, 2015 2 commits
- CMFActivity: revert debugging change committed by mistake · dfb051cd
  Julien Muchembled authored 9 years ago
  
  dfb051cd
- CMFActivity: drop useless 'order_validation_text' column from 'message' table · eca363aa
  Julien Muchembled authored 9 years ago
```
For even more refactoring between SQLDict & SQLQueue,
which now uses SQL tables with the same schema.
```
  eca363aa
27 Mar, 2015 1 commit

CMFActivity: automatic migration of queues and removal of button to recreate tables · 3d644bde

Julien Muchembled authored 9 years ago

The action to recreate activity tables while preserving existing messages
was unsafe for 2 reasons:
- if any error happened, messages could be lost
- it relied on Message.reactivate

Which this patch, any instance created after commit d881edd1 (Aug 2010) will
upgrade successfully. For older instances, make sure you have no activity left.

For cases where 'ALTER TABLE' would not work, a better way to implement repair
functionality would be:
- one action to backup all messages in ZODB
- and another to restore them
And maybe a security so that during the backup-clear-restore sequence,
activities can't be created nor processed.

If any column is added in the future, it would still be possible to write code
that fills them by inspecting messages.

3d644bde

14 Oct, 2014 1 commit
- Revert "add erp5_sql_active_process business template." · 5fbe2885
  Kazuhiko Shiozaki authored 10 years ago
```
This reverts commit 6a8987fb.
```
  5fbe2885
08 Oct, 2014 1 commit
- add erp5_sql_active_process business template. · 6a8987fb
  Kazuhiko Shiozaki authored 10 years ago
  
  6a8987fb
04 Sep, 2014 1 commit
- Remove trailing whitespaces · 41f9cd88
  Gabriel Monnerat authored 10 years ago
  
  41f9cd88
29 Aug, 2014 1 commit
- CMFActivity: Provide automatic migration to named node. · 23d660e1
  Vincent Pelletier authored 10 years ago
```
Assumes nodes will not be named after other node's network address.
```
  23d660e1