Commits · a81baf75723a7552e5bc511fff33c2b7e816cd1a · Yusei Tahara / wendelin

29 Jun, 2015 4 commits

erp5_wendelin/test/test_01_IngestionFromFluentd: Verify resulting array in full explicitly · a81baf75

Kirill Smelkov authored Jun 29, 2015

Instead of verifying only min/max/len of the result, we can verify that the
result is explicitly the array we expect, especially that it is easier to do
and less lines with just appropriate arange() and array_equal().

/cc @Tyagov

a81baf75

erp5_wendelin/test/test_01_IngestionFromFluentd: Make sure we process all numbers in 'real_data' · 40acb891

Kirill Smelkov authored Jun 29, 2015

As explained in previous commit, real_data tail was ending without \n

    99988,99989\n99990,99991,99992,99993,99994,99995,99996,99997,99998,99999\n100000

and because DataStream_copyCSVToDataArray() processes data in full lines only,
the tail was lost.

Fix t by making sure the last line is always terminated properly with \n.

/cc @Tyagov

40acb891

erp5_wendelin/test/test_01_IngestionFromFluentd: Resulting zarray should be 100000 not 99999 · c3fa8672

Kirill Smelkov authored Jun 29, 2015

Consider this:

    In [1]: l = range(100000)

    In [2]: min(l)
    Out[2]: 0

    In [3]: max(l)
    Out[3]: 99999

    In [4]: len(l)
    Out[4]: 100000

so if we assert that zarray min=0 and max=99999 the length should be max+1
which is 100000.

NOTE the length is not 100001, as one would guess from test number sequence
created at the beginning of the test:

  def chunks(l, n):
    """Yield successive n-sized chunks from l."""
    for i in xrange(0, len(l), n):
      yield l[i:i+n]

  ...

    number_string_list = []
    for my_list in list(chunks(range(0, 100001), 10)):
      number_string_list.append(','.join([str(x) for x in my_list]))
    real_data = '\n'.join(number_string_list)

because processing code "eats" numbers till last \n and for 10001 last \n is located before 100000:

  99988,99989\n99990,99991,99992,99993,99994,99995,99996,99997,99998,99999\n100000

I will fix input data generation in the following patch.

/cc @Tyagov

c3fa8672

erp5_wendelin: Fix thinko in DataStream_copyCSVToDataArray() ZBigArray init · bf0adf22

Kirill Smelkov authored Jun 29, 2015

When we conditionally create new BigArray for appending data, we should create
it as empty, because in DataStream_copyCSVToDataArray() creation is done lazily
only when destination array is not yet initialized and we anyway append data to
the array in the following code block.

Creating BigArray with initial shape of appending part will result in
destination array being longer than neccessary by first-appended-chunk length
with this-way-introduced extra header reading as all zeros.

Fix it.

/cc @Tyagov

bf0adf22

26 Jun, 2015 1 commit
- Use better properties to match what is already used inside ERP5. · 88a2f8fe
  Ivan Tyagov authored Jun 26, 2015
```
source_section -> source
destination_section -> destination
```
  88a2f8fe
25 Jun, 2015 1 commit
- Not required dependency · 8a3cdaa0
  Ivan Tyagov authored Jun 25, 2015
  
  8a3cdaa0
24 Jun, 2015 3 commits
- Extend test. · 67fd7414
  Ivan Tyagov authored Jun 24, 2015
  
  67fd7414
- compensate possible offset mistmatch. Do not hide errors. · 8fcca25e
  Ivan Tyagov authored Jun 24, 2015
  
  8fcca25e
- Stop use \n character as ingestion delimiter, only use for .CSV format where... · 7313a789
  Ivan Tyagov authored Jun 24, 2015
```
Stop use \n character as ingestion delimiter, only use for .CSV format where it's part of the structure of a file.
```
  7313a789
22 Jun, 2015 4 commits
- Make test stand alone. · 9fa5088a
  Ivan Tyagov authored Jun 22, 2015
  
  9fa5088a
- No need of these anymore. · 34dc75cf
  Ivan Tyagov authored Jun 22, 2015
  
  34dc75cf
- Generate Dynamically Data Supply for test. · 70f1cb31
  Ivan Tyagov authored Jun 22, 2015
  
  70f1cb31
- Create generic OffsetIndex (derived from SortIndex). · 93fc04b6
  Ivan Tyagov authored Jun 18, 2015
  
  93fc04b6
10 Jun, 2015 1 commit
- No need to use this bt5 at all. · 53794eb2
  Ivan Tyagov authored Jun 10, 2015
  
  53794eb2
09 Jun, 2015 2 commits
- Dependency. · d4d6b049
  Ivan Tyagov authored Jun 09, 2015
  
  d4d6b049
- Dependency. · 1c436954
  Ivan Tyagov authored Jun 09, 2015
  
  1c436954
05 Jun, 2015 4 commits
- All modules should have unified columns as much as possible. · 919c7fe3
  Ivan Tyagov authored Jun 05, 2015
  
  919c7fe3
- Clean up iterate script implementation. No need of own array property type... · 56675e9e
  Ivan Tyagov authored Jun 05, 2015
```
Clean up iterate script implementation. No need of own array property type when we can use more generic object.
```
  56675e9e
- We can safely uncomment these lines now. · d1ab322d
  Ivan Tyagov authored Jun 05, 2015
  
  d1ab322d
- Add iteration script testing. · 8aaba7fc
  Ivan Tyagov authored Jun 05, 2015
```
Add new method that can copy CSV data to a Zbig Array.
```
  8aaba7fc
04 Jun, 2015 2 commits
- Enable testing as it's fixed in wendelin.core-dev egg · 0f31e777
  Ivan Tyagov authored Jun 04, 2015
  
  0f31e777
- Extend API and set default to None for an array. · 31376a6c
  Ivan Tyagov authored Jun 04, 2015
```
Add a new property of type array.
```
  31376a6c
02 Jun, 2015 6 commits
- Always pass destination array reference. · e9e24e46
  Ivan Tyagov authored Jun 02, 2015
```
Better end condition.
```
  e9e24e46
- try sample transformation · bc825ce5
  Ivan Tyagov authored Jun 02, 2015
  
  bc825ce5
- Clean up. · 01ba8bf0
  Ivan Tyagov authored Jun 02, 2015
  
  01ba8bf0
- Moved to slapos.git · 567bae97
  Ivan Tyagov authored Jun 02, 2015
  
  567bae97
- Forgotten argument. · d4610fca
  Ivan Tyagov authored Jun 02, 2015
  
  d4610fca
- Allow in API to pass reference of object (i.e. Data Array) where... · 96472d18
  Ivan Tyagov authored Jun 02, 2015
```
Allow in API to pass reference of object (i.e. Data Array) where transformation is expected to be stored.
```
  96472d18
29 May, 2015 3 commits
- More testing and API extension. · c850a14d
  Ivan Tyagov authored May 29, 2015
  
  c850a14d
- Add warning and forgotten property sheet. · e101ff9c
  Ivan Tyagov authored May 29, 2015
  
  e101ff9c
- Add a generic implementation of a script able to iterate effectively over a... · 3e76c012
  Ivan Tyagov authored May 29, 2015
```
Add a generic implementation of a script able to iterate effectively over a Data Stream and do transformation on data itself.
```
  3e76c012
28 May, 2015 2 commits
- Use our own form. · fe554209
  Ivan Tyagov authored May 28, 2015
  
  fe554209
- Add forgotten property sheet assignment. · 0b6e7af3
  Ivan Tyagov authored May 28, 2015
  
  0b6e7af3
27 May, 2015 3 commits

Pull in wendelin js-demo · 2c5f7d75
Kirill Smelkov authored May 27, 2015

2c5f7d75

First publication of Wendelin · 853f694c

Ivan Tyagov authored May 27, 2015

Wendelin is a Big Data platform based on ERP5.
More information can be found at http://www.wendelin.io/

Current repository contains following:

    * bt5/      - contains the generic ERP5 Business Templates needed to
                  setup Wendelin on top of ERP5
    * product/  - Wendelin file system product
    * slapos/   - SlapOs setup recipe
    * tests/    - test definitions for Wendelin platform

853f694c

Start of wendelin.git · 748d7cd9

Kirill Smelkov authored May 27, 2015

Wendelin is a Big Data platform based on ERP5.
More information can be found at http://www.wendelin.io/

748d7cd9

17 Apr, 2015 1 commit
- Explanatory note added. · 6c278cf6
  Ivan Tyagov authored Apr 17, 2015
  
  6c278cf6
30 Mar, 2015 2 commits
- First javascript version of a gadget based, single page approach of having an · 0fbd1f33
  Ivan Tyagov authored Mar 30, 2015
```
indexeddb storage locally at browser side.
```
  0fbd1f33
- first commit · 55bc1d6f
  Ivan Tyagov authored Mar 30, 2015
  
  55bc1d6f