Commits · 4b8146f1b3f5d46c6676f69c520dcef2b5496e20 · Kirill Smelkov / bcc

19 Aug, 2017 1 commit
- Update obsolete file name bitesize -> bitehist · 4b8146f1
  Tian Xin authored Aug 18, 2017
```
Update tutorial to change the file name.
```
  4b8146f1
17 Aug, 2017 3 commits

Fixes for debian and ubuntu docker build · eb819caf

Pavel Safronov authored Aug 17, 2017

* Fixed build for debian and ubuntu
* Bumped debian and ubuntu versions (fix some build-dependency issues)
* Make debian and ubuntu Dockerfiles use the same build script
* Build-dependencies now installing automatically via pbuilder

eb819caf

Merge pull request #1294 from iovisor/yhs_dev · 9de830ae
4ast authored Aug 17, 2017
```
avoid large map memory allocation in userspace
```
9de830ae

avoid large map memory allocation in userspace · 067219b2

Yonghong Song authored Aug 16, 2017

In bcc, internal BPF_F_TABLE defines a structure to
contain all the table information for later easy
extraction. A global structure will be defined
with this type. Note that this structure will be
allocated by LLVM during compilation.

In the table structure, one of field is:
   _leaf_type data[_max_entries]

If the _leaf_type and _max_entries are big,
significant memory will be consumed. A big
_leaf_type size example is for BPF_STACK_TRACE map
with 127*8=1016 bytes. If max_entries is bigger
as well, significant amount of memory will be
consumed by LLVM.

This patch replaces
  _leaf_type data[_max_entries]
to
  unsigned ing max_entries

The detail of a test example can be found in issue #1291.
For the example in #1291, without this patch, for a
BPF_STACK_TRACE map with 1M entries, the RSS is roughly
3GB (roughly 3KB per entry). With this patch, it is 5.8MB.
Signed-off-by: Yonghong Song <yhs@fb.com>

067219b2

16 Aug, 2017 5 commits

Merge pull request #1293 from iovisor/tools_fixes · 47305f47
Brendan Gregg authored Aug 16, 2017
```
Couple minor tools fixes
```
47305f47
python - set attach_xdp's default flag value to 0 · 5e84698e
Gabriel Ganne authored Aug 16, 2017
```
Signed-off-by: Gabriel Ganne <gabriel.ganne@enea.com>
```
5e84698e

have uniform uprobe event names for python and C++ · e48f7c96

Yonghong Song authored Jul 28, 2017

This is a followup change for previous commit 0ba15075
(permit multiple pids attaching to the same probe).
That commit changes the event name for python uprobe API
in order to permit multiple processes attaching to the
same uprobe point. The C++ uprobe event name remains
unchanged and thus a descrepancy.

This patch add changes to C++ side and also make python uretprobe
having the same naming convention.

Originally I experimented to put the common code to generate
event names in libbpf.c. But doing this seems more klunky
than simplicity of C++ and Python. So I stick to the current
design.

No need to add pid to kprobe event names as kprobe bpf invocation
will ignore pid anyway.
Signed-off-by: Yonghong Song <yhs@fb.com>

e48f7c96

Remove extra S_MAXSTAT array allocation in some tools · d51870b9
Brenden Blanco authored Aug 16, 2017
```
Fixes: #1280
Signed-off-by: Brenden Blanco <bblanco@gmail.com>
```
d51870b9
Omit include of ptrace.h for empty usdt contexts · 4cb1edb6
Brenden Blanco authored Aug 16, 2017
```
Fixes: #1280
Signed-off-by: Brenden Blanco <bblanco@gmail.com>
```
4cb1edb6

15 Aug, 2017 1 commit

Add clang check for -nopie option · a2afe8d3

Brenden Blanco authored Aug 11, 2017

The -no-pie option is not available in clang until 6.0. Per the
documentation, -nopie serves the same purpose as -no-pie (but not
-fno-pie). Hence, use the -nopie option when clang is in use as the CC.

Fixes: #1237
Signed-off-by: Brenden Blanco <bblanco@gmail.com>

a2afe8d3

11 Aug, 2017 10 commits
- Correct commit id for BPF_FUNC_get_socket_cookie · 8e26b491
  Paul Chaignon authored Aug 06, 2017
  
  8e26b491
- tools/tcptracer: print ns timestamps with -v · fff792df
  Iago López Galeiras authored Aug 04, 2017
  
  fff792df
- tools/tcptracer: add timestamp option · 6426cd4e
  Iago López Galeiras authored Aug 04, 2017
```
Similar to other tools from the tcp* family.
```
  6426cd4e
- Since LUA_GLOBALSINDEX is obsolete from Lua 5.2, use lua_getglobal function · 10a4af70
  Nan Xiao authored Aug 02, 2017
```
instead.
```
  10a4af70
- better state default value handling · 4870819a
  Brendan Gregg authored Aug 03, 2017
  
  4870819a
- add --state to offcputime · 843340d6
  Brendan Gregg authored Jul 31, 2017
  
  843340d6
- explain that 'args' can be used as 'ctx' in tracepoints · b9f42186
  Jean-Tiare Le Bigot authored Jul 30, 2017
  
  b9f42186
- make it easier to discover the reference documentation · c38cb7ac
  Jean-Tiare Le Bigot authored Jul 30, 2017
  
  c38cb7ac
- tcptop: Filter out negative values in receive probe · 81ad054a
  Benjamin Poirier authored Jul 28, 2017
```
When tcp_cleanup_rbuf() is called from tcp_recvmsg(), "copied" may be an
error code, especially -EAGAIN for non-blocking receives.
```
  81ad054a
- tcptop: Cleanup argument parsing · 8e86b9e9
  Benjamin Poirier authored Jul 27, 2017
  
  8e86b9e9
01 Aug, 2017 2 commits

Use unsigned conversion specifier for nlmsg_pid · 890c76ab

Toshiaki Makita authored Jul 31, 2017

nlmsg_pid is __u32, so let's use %u instead of %d.
Signed-off-by: Toshiaki Makita <makita.toshiaki@lab.ntt.co.jp>

890c76ab

Fix wrong netlink port id check · bb9b92ab

Toshiaki Makita authored Jul 31, 2017

As per man netlink, nlmsg_pid is not process id and in fact a value
different from process id can be used.

  bpf: Wrong pid -1615084642, expected 24407

This problem can be triggered by using pyroute2 with bcc.
Signed-off-by: Toshiaki Makita <makita.toshiaki@lab.ntt.co.jp>

bb9b92ab

31 Jul, 2017 2 commits
- Remove redundant semicolon · 861102b4
  Nan Xiao authored Jul 31, 2017
  
  861102b4
- 1. Use more safe snprintf instead of sprintf; · 0379779d
  Nan Xiao authored Jul 29, 2017
```
2. Modify procfilename buffer length in bcc_procutils_language function.
```
  0379779d
28 Jul, 2017 1 commit

Put libfl-dev into Build-Depends on Debian · a5ea40a8

Ivan Babrou authored Jul 28, 2017

On Debian Stretch `libfl-dev` is not installed automatically
if you install `flex`, this takjes care of the issue.

a5ea40a8

27 Jul, 2017 2 commits

Merge pull request #1263 from iovisor/yhs_dev · 2cc96a8c
Brenden Blanco authored Jul 27, 2017
```
permit multiple pids attaching to the same probe
```
2cc96a8c

permit multiple pids attaching to the same probe · 0ba15075

Yonghong Song authored Jul 21, 2017

Currently, if more than one pid-associated USDT attaching to
the same probe, usdt readarg code will be generated twice and
the compiler will complain.

This patch solves issue by preventing code duplication if
a previous context with the same mnt point and exec binary
has generated the code for the same probe. The event name is
also changed to have pid embedded so different pid-associated
uprobe event will have different names.

This patch introduces an internal uprobe event name
discrepency. It is a good idea to have event name
generation in libbpf so that both C++ API and Python API
will have consistent name conventions. This will be
addressed in a subsequent commit as it is largely
a different issue.
Signed-off-by: Yonghong Song <yhs@fb.com>

0ba15075

19 Jul, 2017 1 commit
- Merge pull request #1261 from iovisor/yhs_dev · b58a0d7e
  4ast authored Jul 18, 2017
```
generate proper usdt code to prevent llvm meddling with ctx->#fields
```
  b58a0d7e
18 Jul, 2017 4 commits

generate proper usdt code to prevent llvm meddling with ctx->#fields · 8206f547

Yonghong Song authored Jul 18, 2017

Qin reported a test case where llvm still messes up with ctx->#fields.
For code like below:
  switch(ctx->ip) {
    case 0x7fdf2ede9820ULL: *((int64_t *)dest) = *(volatile int64_t *)&ctx->r12; return 0;
    case 0x7fdf2edecd9cULL: *((int64_t *)dest) = *(volatile int64_t *)&ctx->bx; return 0;
  }
The compiler still generates:
    # r1 is the pointer to the ctx
    r1 += 24
    goto LBB0_4
  LBB0_3:
    r1 += 40
  LBB0_4:
    r3 = *(u64 *)(r1 + 0)
The verifier will reject the above code since the last load is not "ctx + field_offset"
format.

The responsible llvm optimization pass is CFGSimplifyPass. Its main implementation
in llvm/lib/Transforms/Utils/SimplifyCFG.cpp. The main routine to do the optimization
is SinkThenElseCodeToEnd. The routine canSinkInstructions is used to determine whether
an insn is a candidate for sinking.

Unfortunately, volatile load/store is not a condition to prevent the optimization.
But inline assembly is a condition which can prevent further optimization.

In this patch, instead of using volatile to annotate ctx->#field access, we do
normal ctx->#field access but put a compiler inline assembly memory barrier
   __asm__ __volatile__(\"\": : :\"memory\");
after the field access.

Tested with usdt unit test case, usdt_samples example, a couple of usdt unit tests
developed in the past.
Signed-off-by: Yonghong Song <yhs@fb.com>

8206f547

MySQL tracing without USDT (#1239) · 5f7035e4

Igor Mazur authored Jul 18, 2017

Support tracing MySQL queries even when MySQL is built
without USDT support, by using uprobes on internal functions
responsible for command (query) dispatching.

5f7035e4

Merge pull request #1259 from iovisor/yhs_dev · 87abe2a3
4ast authored Jul 18, 2017
```
Fix a clang memory leak
```
87abe2a3

Fix a clang memory leak · 6ed2229d

Yonghong Song authored Jul 17, 2017

In clang frontend actions, several compiler invocations are called
for rewriter and transforming source code to IR. During the invocation
to transform source code to IR, CodeGenOpts.DisableFree is used
to control whether the top target machine structure should be
freed or not for a particular clang invocation,
and its default value is TRUE.

See clang:lib/CodeGen/BackendUtil.cpp:
  ~EmitAssemblyHelper() {
    if (CodeGenOpts.DisableFree)
      BuryPointer(std::move(TM));
  }

So by default, the memory held by TM will not freed, even if
BPF module itself is freed. This is even more problematic
when continuous building/loading/unloading happens for long
live service.

This patch explicitly sets CodeGenOpts.DisableFree to FALSE
so memory can be properly freed. I did a simple experiment
to compile/load/unload an empty BPF program and the saving
is roughly 0.5MB.
Signed-off-by: Yonghong Song <yhs@fb.com>

6ed2229d

17 Jul, 2017 2 commits
- Merge pull request #1258 from shodoco/tcpbpf · f4727f43
  Brenden Blanco authored Jul 17, 2017
```
Update bpf.h and virtual_bpf.h to 4.13-rc1
```
  f4727f43
- Update bpf.h and virtual_bpf.h to 4.13-rc1 · e473a20e
  Huapeng Zhou authored Jul 17, 2017
  
  e473a20e
14 Jul, 2017 2 commits

tools: Add some documentation to lib/ucalls.py output (#1257) · 830c1f76
Geneviève Bastien authored Jul 14, 2017
```
Signed-off-by: Geneviève Bastien <gbastien@versatic.net>
```
830c1f76

Fix trace.py for library filenames containing colons (#1252) · 5a2b39e5

vkhromov authored Jul 14, 2017

`trace.py` parses a probe using the colon as a separator.  As a result, it
fails to create a uprobe for binary/library with a filename containing colons.

This diff fixes that issue with `trace.py`.  It requires a kernel with
https://lkml.org/lkml/2017/1/13/585 merged to work properly, otherwise
`trace.py` still fails for create uprobes.

5a2b39e5

11 Jul, 2017 3 commits

cc: Add open_perf_event to the C/C++ API (#1232) · 4180333c
Romain authored Jul 11, 2017

4180333c

memleak: expand allocator coverage (#1214) · 2c1799c9

Rinat Ibragimov authored Jul 11, 2017

* memleak: handle libc allocation functions other than malloc

* memleak: use tracepoints to track kernel allocations

* memleak: add combined-only mode

With large number of outstanding allocations, amount of data passed from
kernel becomes large, which slows everything down.

This patch calculates allocation statistics inside kernel, allowing user-
space part to pull combined statistics data only, thus significantly
reducing amount of passed data.

* memleak: increase hashtable capacities

There are a lot of allocations happen in kernel. Default values are not
enough to keep up.

* test: add a test for the memleak tool

2c1799c9

Add USDT sample (#1229) · b4691fba

bveldhoen authored Jul 11, 2017

This sample contains:
    - A library with an operation that uses usdt probes.
    - A console application that calls the operation.
    - Scripts to trace the latency of the operation.
    - Corresponding cmake files.

b4691fba

07 Jul, 2017 1 commit
- docs: update features list · dc777af4
  Quentin Monnet authored Jul 07, 2017
```
And fix a typo: direct_action -> direct-action
```
  dc777af4