[RFD] Efficient unit test and fuzz tools for kernel/libc porting

trinity.vger.kernel.org archive mirror
 help / color / mirror / Atom feed

* [RFD] Efficient unit test and fuzz tools for kernel/libc porting
@ 2016-07-06  7:39 Zhangjian (Bamvor)
  2016-07-06  8:00 ` Dmitry Vyukov
                   ` (2 more replies)
  0 siblings, 3 replies; 8+ messages in thread
From: Zhangjian (Bamvor) @ 2016-07-06  7:39 UTC (permalink / raw)
  To: linux-kernel, linux-arch, libc-alpha, trinity, syzkaller,
	aponomarenko, Jess Hertz, Tim Newsham
  Cc: arnd, catalin.marinas, broonie, joseph, maxim.kuvyrkov, ynorov,
	pinskia, schwab, agraf, marcus.shawcroft, dingtianhong, guohanjun,
	cuibixuan, lijinyue, lizefan

HI,

When I working on the ILP32 ABI for ARMv8 in last two years, I has encountered lots of syscall issues such as wrong number of arguments, different data type in binary interface. I realized that the correctness of argument passing between the C library and core kernel code is a common problem when bringing up new architecture or ABI to kernel and libc. Existing fuzz testing tools such as trinity[1], syzkaller[2] and triforce[3] only generate random or boundary values for syscall parameters and then inject them into kernel, but those tools won't validate if the results of those syscalls are correct or not. Thus they can not act as an unit test for ILP32. In this year, considering the abi of ILP32 is changes during discussion, I am thinking if I could use some sort of automatically tools to c
 heck whether the wrapper is correct or not. After learn and compare some fuzz tools, I feel that there is no such fuzz tools could help me. So, I wrote a new fuzz tools base on the trinity and it 
found several wrapper issues in glibc. I will first explain the different with existing fuzz tools and paste my propsosal in the end.

Trinity is developed in a long time. It could randomize the parameter of syscall and run individual syscall standalone or parallel. When I do the long time parallel test(not for ILP32), it could report some bug, e.g. hang, panic. It is useful but it is indeed hard to debug because it usually fail after a long time running. I do not know what does it exactly do.

Compare with Trinity, syzkaller is quite different. Here is the comparision between syzkaller and our tools:
1.  Syzkaller could recursively randomize base date type in syscall which means it is possible generate more meaningfull syscall test. But it only test the syscall through syscall() function. It assume that the c library is correct and stable. But it is wrong if we are porting new abi(such as ILP32) or architecture to glibc and kernel. We need to take c library into account. This is what my tools could do.

2.  Syzkaller could generate the readable short testcases. Our tools could only test individual syscall and check the correctness of parameter and return value. I think it is enough for the unit test which tests syscall one by one.

3.  Syzkaller could do the coverage. Our tools could not. I think it is useful for me. I plan to add the coverage later.

In my ILP32 works, my tools reported several off_t endian issues in glibc. Our tools work like this:

Dump the function                                 Dump the function
prototype from                                    prototype from c
vmlinux from the                                  library from the
sys_call_table                                    given list(posix
array in kernel.                                  interfaces or user
        |                                          defined).
        |                                                 |
        |                                                 |
       \|/                                               \|/
        `                                                 `
Generate jprobe        Modity Trinity to          Generate struct
hook according to      support run syscall        fuzz generator
prototype which        syscall from c             from the prototype.
will recursively       libray instead             And add them of
print the syscall      syscall() function         to trinity. Trinity
value.                       |                    will recursively
        \                     |                    print the function
         \                    |                    parameter.
          \                   |                           /
           -----------------------------------------------
                              |
                             \|/
                              `
               Run the trinity each syscall once
               and compare the function parameter
               printed in kernel and userspace
               If inconsistent, print specific
               information, such endian issue,
               32<->64bit conversion issue and
               so on.

Tools of function dump and hook generator based on the abi-dumper[4]. Other functions base on trinity. Return value test is similar except generating the kretprobe hook instead jprobe hook.

There are some hacks and the original funtion of trinity may be broken. The main changes in trinity are as follows:
1.  Call syscall through c library via call_glibc_syscalls() instead of direct syscall via syscall().
2.  Add new file generate-struct.c including the missing data type mentioned in syscall. This file is generated by ./struct_extract.py with a little modification. It should be fully auto generated in future.
3.  Add more date types in fill_arg()(generate-args.c) and include/syscall.h.
4.  Modify the syscallentry struct in syscalls directory according to the newly added data types.
5.  Add or Change some output message for script.
6.  Add jprobe hooks in modules directory. Such hook will be inserted before trinity test and removed after test.

Step to generate generate-struct.c and jprobe hooks:
1.  Compile the kernel/libc/binary which include the functions you want to generate.
2.  Dump function and struct information through modified abi-dumper.pl, named them as symbolinfo and typeinfo. You may need add "--all" and/or "--dump-static" depends on your binaries.
3.  Generate the file through trinity/scripts/struct_extract.py.

We also submit a proposal to linuxcon europe last month. Hope we could get a chance to share our works face to face.

The patches could be found at [5] and [6]. It works but ugly. We hope we could get some early response, such as if community is interested in this tools, whether this tools could merge into existence fuzz tools.

Thanks in advance.

Best wishes

Bamvor

[1] https://github.com/kernelslacker/trinity
[2] https://github.com/google/syzkaller
[3] https://github.com/nccgroup/TriforceLinuxSyscallFuzzer
     https://github.com/nccgroup/TriforceAFL
[4] https://github.com/lvc/abi-dumper.git
[5] https://github.com/bjzhang/trinity/tree/syscall_unittest
[6] https://github.com/bjzhang/abi-dumper/tree/json_output

^ permalink raw reply	[flat|nested] 8+ messages in thread

* Re: [RFD] Efficient unit test and fuzz tools for kernel/libc porting
  2016-07-06  7:39 [RFD] Efficient unit test and fuzz tools for kernel/libc porting Zhangjian (Bamvor)
@ 2016-07-06  8:00 ` Dmitry Vyukov
  2016-07-06  8:24   ` Zhangjian (Bamvor)
  2016-07-06  8:00 ` Zhangjian (Bamvor)
  2016-07-20 15:47 ` Joseph Myers
  2 siblings, 1 reply; 8+ messages in thread
From: Dmitry Vyukov @ 2016-07-06  8:00 UTC (permalink / raw)
  To: syzkaller
  Cc: LKML, linux-arch, libc-alpha, trinity, aponomarenko, Jess Hertz,
	Tim Newsham, arnd, Catalin Marinas, Mark Brown, joseph,
	maxim.kuvyrkov, Yury Norov, pinskia, schwab, agraf,
	marcus.shawcroft, Ding Tianhong, guohanjun, cuibixuan, lijinyue,
	Zefan Li

On Wed, Jul 6, 2016 at 9:39 AM, Zhangjian (Bamvor)
<bamvor.zhangjian@huawei.com> wrote:
> HI,
>
> When I working on the ILP32 ABI for ARMv8 in last two years, I has
> encountered lots of syscall issues such as wrong number of arguments,
> different data type in binary interface. I realized that the correctness of
> argument passing between the C library and core kernel code is a common
> problem when bringing up new architecture or ABI to kernel and libc.
> Existing fuzz testing tools such as trinity[1], syzkaller[2] and triforce[3]
> only generate random or boundary values for syscall parameters and then
> inject them into kernel, but those tools won't validate if the results of
> those syscalls are correct or not. Thus they can not act as an unit test for
> ILP32. In this year, considering the abi of ILP32 is changes during
> discussion, I am thinking if I could use some sort of automatically tools to
> check whether the wrapper is correct or not. After learn and compare some
> fuzz tools, I feel that there is no such fuzz tools could help me. So, I
> wrote a new fuzz tools base on the trinity and it found several wrapper
> issues in glibc. I will first explain the different with existing fuzz tools
> and paste my propsosal in the end.
>
> Trinity is developed in a long time. It could randomize the parameter of
> syscall and run individual syscall standalone or parallel. When I do the
> long time parallel test(not for ILP32), it could report some bug, e.g. hang,
> panic. It is useful but it is indeed hard to debug because it usually fail
> after a long time running. I do not know what does it exactly do.
>
> Compare with Trinity, syzkaller is quite different. Here is the comparision
> between syzkaller and our tools:
> 1.  Syzkaller could recursively randomize base date type in syscall which
> means it is possible generate more meaningfull syscall test. But it only
> test the syscall through syscall() function. It assume that the c library is
> correct and stable. But it is wrong if we are porting new abi(such as ILP32)
> or architecture to glibc and kernel. We need to take c library into account.
> This is what my tools could do.
>
> 2.  Syzkaller could generate the readable short testcases. Our tools could
> only test individual syscall and check the correctness of parameter and
> return value. I think it is enough for the unit test which tests syscall one
> by one.
>
> 3.  Syzkaller could do the coverage. Our tools could not. I think it is
> useful for me. I plan to add the coverage later.
>
> In my ILP32 works, my tools reported several off_t endian issues in glibc.
> Our tools work like this:
>
> Dump the function                                 Dump the function
> prototype from                                    prototype from c
> vmlinux from the                                  library from the
> sys_call_table                                    given list(posix
> array in kernel.                                  interfaces or user
>        |                                          defined).
>        |                                                 |
>        |                                                 |
>       \|/                                               \|/
>        `                                                 `
> Generate jprobe        Modity Trinity to          Generate struct
> hook according to      support run syscall        fuzz generator
> prototype which        syscall from c             from the prototype.
> will recursively       libray instead             And add them of
> print the syscall      syscall() function         to trinity. Trinity
> value.                       |                    will recursively
>        \                     |                    print the function
>         \                    |                    parameter.
>          \                   |                           /
>           -----------------------------------------------
>                              |
>                             \|/
>                              `
>               Run the trinity each syscall once
>               and compare the function parameter
>               printed in kernel and userspace
>               If inconsistent, print specific
>               information, such endian issue,
>               32<->64bit conversion issue and
>               so on.
>
> Tools of function dump and hook generator based on the abi-dumper[4]. Other
> functions base on trinity. Return value test is similar except generating
> the kretprobe hook instead jprobe hook.
>
> There are some hacks and the original funtion of trinity may be broken. The
> main changes in trinity are as follows:
> 1.  Call syscall through c library via call_glibc_syscalls() instead of
> direct syscall via syscall().
> 2.  Add new file generate-struct.c including the missing data type mentioned
> in syscall. This file is generated by ./struct_extract.py with a little
> modification. It should be fully auto generated in future.
> 3.  Add more date types in fill_arg()(generate-args.c) and
> include/syscall.h.
> 4.  Modify the syscallentry struct in syscalls directory according to the
> newly added data types.
> 5.  Add or Change some output message for script.
> 6.  Add jprobe hooks in modules directory. Such hook will be inserted before
> trinity test and removed after test.
>
> Step to generate generate-struct.c and jprobe hooks:
> 1.  Compile the kernel/libc/binary which include the functions you want to
> generate.
> 2.  Dump function and struct information through modified abi-dumper.pl,
> named them as symbolinfo and typeinfo. You may need add "--all" and/or
> "--dump-static" depends on your binaries.
> 3.  Generate the file through trinity/scripts/struct_extract.py.
>
> We also submit a proposal to linuxcon europe last month. Hope we could get a
> chance to share our works face to face.
>
> The patches could be found at [5] and [6]. It works but ugly. We hope we
> could get some early response, such as if community is interested in this
> tools, whether this tools could merge into existence fuzz tools.


Hi Bamvor,

Nice work!

Coverage should be easy to do with CONFIG_KCOV, but do you need
fuzzing/coverage? It seems that testing a predefined set of special
values for each arg should be enough for your use case. Namely special
values that can detect endianess/truncation/sign extension/etc issues.

I think there is also a number of glibc functions that don't directly
map to syscalls. Most notably wrappers around various ioctl's (e.g.
ptsname). Do you test them?

^ permalink raw reply	[flat|nested] 8+ messages in thread

* Re: [RFD] Efficient unit test and fuzz tools for kernel/libc porting
  2016-07-06  7:39 [RFD] Efficient unit test and fuzz tools for kernel/libc porting Zhangjian (Bamvor)
  2016-07-06  8:00 ` Dmitry Vyukov
@ 2016-07-06  8:00 ` Zhangjian (Bamvor)
  2016-07-20 15:47 ` Joseph Myers
  2 siblings, 0 replies; 8+ messages in thread
From: Zhangjian (Bamvor) @ 2016-07-06  8:00 UTC (permalink / raw)
  To: LKML, Linux-Arch, libc-alpha, trinity, syzkaller, aponomarenko,
	Jess Hertz, Tim Newsham
  Cc: Arnd Bergmann, Catalin Marinas, Mark Brown, Joseph S. Myers,
	Maxim Kuvyrkov, Yury Norov, Andrew Pinski, Andreas Schwab,
	Alexander Graf, marcus.shawcroft, Ding Tianhong, Hanjun Guo,
	cuibixuan, lijinyue, Zefan Li

Fix the wrong email address for Arnd. Sorry for the noise.

On 2016/7/6 15:39, Zhangjian (Bamvor) wrote:
> HI,
>
> When I working on the ILP32 ABI for ARMv8 in last two years, I has encountered lots of syscall issues such as wrong number of arguments, different data type in binary interface. I realized that the correctness of argument passing between the C library and core kernel code is a common problem when bringing up new architecture or ABI to kernel and libc. Existing fuzz testing tools such as trinity[1], syzkaller[2] and triforce[3] only generate random or boundary values for syscall parameters and then inject them into kernel, but those tools won't validate if the results of those syscalls are correct or not. Thus they can not act as an unit test for ILP32. In this year, considering the abi of ILP32 is changes during discussion, I am thinking if I could use some sort of automatically tools to
  check whether the wrapper is correct or not. After learn and compare some fuzz tools, I feel that there is no such fuzz tools could help me. So, I wrote a new fuzz tools base on the trinity a
> nd it found several wrapper issues in glibc. I will first explain the different with existing fuzz tools and paste my propsosal in the end.
>
> Trinity is developed in a long time. It could randomize the parameter of syscall and run individual syscall standalone or parallel. When I do the long time parallel test(not for ILP32), it could report some bug, e.g. hang, panic. It is useful but it is indeed hard to debug because it usually fail after a long time running. I do not know what does it exactly do.
>
> Compare with Trinity, syzkaller is quite different. Here is the comparision between syzkaller and our tools:
> 1.  Syzkaller could recursively randomize base date type in syscall which means it is possible generate more meaningfull syscall test. But it only test the syscall through syscall() function. It assume that the c library is correct and stable. But it is wrong if we are porting new abi(such as ILP32) or architecture to glibc and kernel. We need to take c library into account. This is what my tools could do.
>
> 2.  Syzkaller could generate the readable short testcases. Our tools could only test individual syscall and check the correctness of parameter and return value. I think it is enough for the unit test which tests syscall one by one.
>
> 3.  Syzkaller could do the coverage. Our tools could not. I think it is useful for me. I plan to add the coverage later.
>
> In my ILP32 works, my tools reported several off_t endian issues in glibc. Our tools work like this:
>
> Dump the function                                 Dump the function
> prototype from                                    prototype from c
> vmlinux from the                                  library from the
> sys_call_table                                    given list(posix
> array in kernel.                                  interfaces or user
>         |                                          defined).
>         |                                                 |
>         |                                                 |
>        \|/                                               \|/
>         `                                                 `
> Generate jprobe        Modity Trinity to          Generate struct
> hook according to      support run syscall        fuzz generator
> prototype which        syscall from c             from the prototype.
> will recursively       libray instead             And add them of
> print the syscall      syscall() function         to trinity. Trinity
> value.                       |                    will recursively
>         \                     |                    print the function
>          \                    |                    parameter.
>           \                   |                           /
>            -----------------------------------------------
>                               |
>                              \|/
>                               `
>                Run the trinity each syscall once
>                and compare the function parameter
>                printed in kernel and userspace
>                If inconsistent, print specific
>                information, such endian issue,
>                32<->64bit conversion issue and
>                so on.
>
> Tools of function dump and hook generator based on the abi-dumper[4]. Other functions base on trinity. Return value test is similar except generating the kretprobe hook instead jprobe hook.
>
> There are some hacks and the original funtion of trinity may be broken. The main changes in trinity are as follows:
> 1.  Call syscall through c library via call_glibc_syscalls() instead of direct syscall via syscall().
> 2.  Add new file generate-struct.c including the missing data type mentioned in syscall. This file is generated by ./struct_extract.py with a little modification. It should be fully auto generated in future.
> 3.  Add more date types in fill_arg()(generate-args.c) and include/syscall.h.
> 4.  Modify the syscallentry struct in syscalls directory according to the newly added data types.
> 5.  Add or Change some output message for script.
> 6.  Add jprobe hooks in modules directory. Such hook will be inserted before trinity test and removed after test.
>
> Step to generate generate-struct.c and jprobe hooks:
> 1.  Compile the kernel/libc/binary which include the functions you want to generate.
> 2.  Dump function and struct information through modified abi-dumper.pl, named them as symbolinfo and typeinfo. You may need add "--all" and/or "--dump-static" depends on your binaries.
> 3.  Generate the file through trinity/scripts/struct_extract.py.
>
> We also submit a proposal to linuxcon europe last month. Hope we could get a chance to share our works face to face.
>
> The patches could be found at [5] and [6]. It works but ugly. We hope we could get some early response, such as if community is interested in this tools, whether this tools could merge into existence fuzz tools.
>
> Thanks in advance.
>
> Best wishes
>
> Bamvor
>
> [1] https://github.com/kernelslacker/trinity
> [2] https://github.com/google/syzkaller
> [3] https://github.com/nccgroup/TriforceLinuxSyscallFuzzer
>      https://github.com/nccgroup/TriforceAFL
> [4] https://github.com/lvc/abi-dumper.git
> [5] https://github.com/bjzhang/trinity/tree/syscall_unittest
> [6] https://github.com/bjzhang/abi-dumper/tree/json_output
>
> --
> To unsubscribe from this list: send the line "unsubscribe linux-arch" in
> the body of a message to majordomo@vger.kernel.org
> More majordomo info at  http://vger.kernel.org/majordomo-info.html


^ permalink raw reply	[flat|nested] 8+ messages in thread

* Re: [RFD] Efficient unit test and fuzz tools for kernel/libc porting
  2016-07-06  8:00 ` Dmitry Vyukov
@ 2016-07-06  8:24   ` Zhangjian (Bamvor)
  2016-07-06  9:09     ` Dmitry Vyukov
  0 siblings, 1 reply; 8+ messages in thread
From: Zhangjian (Bamvor) @ 2016-07-06  8:24 UTC (permalink / raw)
  To: Dmitry Vyukov, syzkaller
  Cc: LKML, linux-arch, libc-alpha, trinity, aponomarenko, Jess Hertz,
	Tim Newsham, Arnd Bergmann, Catalin Marinas, Mark Brown, joseph,
	maxim.kuvyrkov, Yury Norov, pinskia, schwab, agraf,
	marcus.shawcroft, Ding Tianhong, guohanjun, cuibixuan, lijinyue,
	Zefan Li

Hi, Dmitry

On 2016/7/6 16:00, Dmitry Vyukov wrote:
> On Wed, Jul 6, 2016 at 9:39 AM, Zhangjian (Bamvor)
> <bamvor.zhangjian@huawei.com> wrote:
>> HI,
>>
>> When I working on the ILP32 ABI for ARMv8 in last two years, I has
>> encountered lots of syscall issues such as wrong number of arguments,
>> different data type in binary interface. I realized that the correctness of
>> argument passing between the C library and core kernel code is a common
>> problem when bringing up new architecture or ABI to kernel and libc.
>> Existing fuzz testing tools such as trinity[1], syzkaller[2] and triforce[3]
>> only generate random or boundary values for syscall parameters and then
>> inject them into kernel, but those tools won't validate if the results of
>> those syscalls are correct or not. Thus they can not act as an unit test for
>> ILP32. In this year, considering the abi of ILP32 is changes during
>> discussion, I am thinking if I could use some sort of automatically tools to
>> check whether the wrapper is correct or not. After learn and compare some
>> fuzz tools, I feel that there is no such fuzz tools could help me. So, I
>> wrote a new fuzz tools base on the trinity and it found several wrapper
>> issues in glibc. I will first explain the different with existing fuzz tools
>> and paste my propsosal in the end.
>>
>> Trinity is developed in a long time. It could randomize the parameter of
>> syscall and run individual syscall standalone or parallel. When I do the
>> long time parallel test(not for ILP32), it could report some bug, e.g. hang,
>> panic. It is useful but it is indeed hard to debug because it usually fail
>> after a long time running. I do not know what does it exactly do.
>>
>> Compare with Trinity, syzkaller is quite different. Here is the comparision
>> between syzkaller and our tools:
>> 1.  Syzkaller could recursively randomize base date type in syscall which
>> means it is possible generate more meaningfull syscall test. But it only
>> test the syscall through syscall() function. It assume that the c library is
>> correct and stable. But it is wrong if we are porting new abi(such as ILP32)
>> or architecture to glibc and kernel. We need to take c library into account.
>> This is what my tools could do.
>>
>> 2.  Syzkaller could generate the readable short testcases. Our tools could
>> only test individual syscall and check the correctness of parameter and
>> return value. I think it is enough for the unit test which tests syscall one
>> by one.
>>
>> 3.  Syzkaller could do the coverage. Our tools could not. I think it is
>> useful for me. I plan to add the coverage later.
>>
>> In my ILP32 works, my tools reported several off_t endian issues in glibc.
>> Our tools work like this:
>>
>> Dump the function                                 Dump the function
>> prototype from                                    prototype from c
>> vmlinux from the                                  library from the
>> sys_call_table                                    given list(posix
>> array in kernel.                                  interfaces or user
>>         |                                          defined).
>>         |                                                 |
>>         |                                                 |
>>        \|/                                               \|/
>>         `                                                 `
>> Generate jprobe        Modity Trinity to          Generate struct
>> hook according to      support run syscall        fuzz generator
>> prototype which        syscall from c             from the prototype.
>> will recursively       libray instead             And add them of
>> print the syscall      syscall() function         to trinity. Trinity
>> value.                       |                    will recursively
>>         \                     |                    print the function
>>          \                    |                    parameter.
>>           \                   |                           /
>>            -----------------------------------------------
>>                               |
>>                              \|/
>>                               `
>>                Run the trinity each syscall once
>>                and compare the function parameter
>>                printed in kernel and userspace
>>                If inconsistent, print specific
>>                information, such endian issue,
>>                32<->64bit conversion issue and
>>                so on.
>>
>> Tools of function dump and hook generator based on the abi-dumper[4]. Other
>> functions base on trinity. Return value test is similar except generating
>> the kretprobe hook instead jprobe hook.
>>
>> There are some hacks and the original funtion of trinity may be broken. The
>> main changes in trinity are as follows:
>> 1.  Call syscall through c library via call_glibc_syscalls() instead of
>> direct syscall via syscall().
>> 2.  Add new file generate-struct.c including the missing data type mentioned
>> in syscall. This file is generated by ./struct_extract.py with a little
>> modification. It should be fully auto generated in future.
>> 3.  Add more date types in fill_arg()(generate-args.c) and
>> include/syscall.h.
>> 4.  Modify the syscallentry struct in syscalls directory according to the
>> newly added data types.
>> 5.  Add or Change some output message for script.
>> 6.  Add jprobe hooks in modules directory. Such hook will be inserted before
>> trinity test and removed after test.
>>
>> Step to generate generate-struct.c and jprobe hooks:
>> 1.  Compile the kernel/libc/binary which include the functions you want to
>> generate.
>> 2.  Dump function and struct information through modified abi-dumper.pl,
>> named them as symbolinfo and typeinfo. You may need add "--all" and/or
>> "--dump-static" depends on your binaries.
>> 3.  Generate the file through trinity/scripts/struct_extract.py.
>>
>> We also submit a proposal to linuxcon europe last month. Hope we could get a
>> chance to share our works face to face.
>>
>> The patches could be found at [5] and [6]. It works but ugly. We hope we
>> could get some early response, such as if community is interested in this
>> tools, whether this tools could merge into existence fuzz tools.
>
>
> Hi Bamvor,
>
> Nice work!
>
> Coverage should be easy to do with CONFIG_KCOV, but do you need
> fuzzing/coverage? It seems that testing a predefined set of special
> values for each arg should be enough for your use case. Namely special
> values that can detect endianess/truncation/sign extension/etc issues.
Yes. We are trying to cover endianess/truncation/sign extension at this
moment.
For coverage, there are some code path in syscall wrapper in both glibc
and kernel. E.g. overflow check in glibc. I am thinking if coverage
could help on this.
>
> I think there is also a number of glibc functions that don't directly
> map to syscalls. Most notably wrappers around various ioctl's (e.g.
> ptsname). Do you test them?
No. Currently, our tools only focus on the syscall function in glibc. In
these syscall level, we could compare the parameter and return value
directly. As you said, there are only several type of issues. It is easy
to handle by tools.

I do not know how to test these complex cases. E.g. the ptsname may call
ioctl, *stat* syscall. Compare the original parameter is meaningless. But
it seems a good type of testcase to show how the user use the syscalls.
Do you have some ideas?

Regards

Bamvor

> --
> To unsubscribe from this list: send the line "unsubscribe linux-arch" in
> the body of a message to majordomo@vger.kernel.org
> More majordomo info at  http://vger.kernel.org/majordomo-info.html
>


^ permalink raw reply	[flat|nested] 8+ messages in thread

* Re: [RFD] Efficient unit test and fuzz tools for kernel/libc porting
  2016-07-06  8:24   ` Zhangjian (Bamvor)
@ 2016-07-06  9:09     ` Dmitry Vyukov
  2016-07-06 10:38       ` Zhangjian (Bamvor)
  0 siblings, 1 reply; 8+ messages in thread
From: Dmitry Vyukov @ 2016-07-06  9:09 UTC (permalink / raw)
  To: Zhangjian (Bamvor)
  Cc: syzkaller, LKML, linux-arch, libc-alpha, trinity, aponomarenko,
	Jess Hertz, Tim Newsham, Arnd Bergmann, Catalin Marinas,
	Mark Brown, joseph, maxim.kuvyrkov, Yury Norov, pinskia, schwab,
	agraf, marcus.shawcroft, Ding Tianhong, guohanjun, cuibixuan,
	lijinyue, Zefan Li

On Wed, Jul 6, 2016 at 10:24 AM, Zhangjian (Bamvor)
<bamvor.zhangjian@huawei.com> wrote:
> Hi, Dmitry
>
>
>> Hi Bamvor,
>>
>> Nice work!
>>
>> Coverage should be easy to do with CONFIG_KCOV, but do you need
>> fuzzing/coverage? It seems that testing a predefined set of special
>> values for each arg should be enough for your use case. Namely special
>> values that can detect endianess/truncation/sign extension/etc issues.
>
> Yes. We are trying to cover endianess/truncation/sign extension at this
> moment.
> For coverage, there are some code path in syscall wrapper in both glibc
> and kernel. E.g. overflow check in glibc. I am thinking if coverage
> could help on this.

Ah, you mean user-space coverage. You may try AFL in binary
instrumentation mode for this.


>> I think there is also a number of glibc functions that don't directly
>> map to syscalls. Most notably wrappers around various ioctl's (e.g.
>> ptsname). Do you test them?
>
> No. Currently, our tools only focus on the syscall function in glibc. In
> these syscall level, we could compare the parameter and return value
> directly. As you said, there are only several type of issues. It is easy
> to handle by tools.
>
> I do not know how to test these complex cases. E.g. the ptsname may call
> ioctl, *stat* syscall. Compare the original parameter is meaningless. But
> it seems a good type of testcase to show how the user use the syscalls.
> Do you have some ideas?

I don't have any ideas for automated testing. One could write a model,
of course....

^ permalink raw reply	[flat|nested] 8+ messages in thread

* Re: [RFD] Efficient unit test and fuzz tools for kernel/libc porting
  2016-07-06  9:09     ` Dmitry Vyukov
@ 2016-07-06 10:38       ` Zhangjian (Bamvor)
  0 siblings, 0 replies; 8+ messages in thread
From: Zhangjian (Bamvor) @ 2016-07-06 10:38 UTC (permalink / raw)
  To: Dmitry Vyukov
  Cc: syzkaller, LKML, linux-arch, libc-alpha, trinity, aponomarenko,
	Jess Hertz, Tim Newsham, Arnd Bergmann, Catalin Marinas,
	Mark Brown, joseph, maxim.kuvyrkov, Yury Norov, pinskia, schwab,
	agraf, marcus.shawcroft, Ding Tianhong, guohanjun, cuibixuan,
	lijinyue, Zefan Li

Hi, Dmitry

On 2016/7/6 17:09, Dmitry Vyukov wrote:
> On Wed, Jul 6, 2016 at 10:24 AM, Zhangjian (Bamvor)
> <bamvor.zhangjian@huawei.com> wrote:
>> Hi, Dmitry
>>
>>
>>> Hi Bamvor,
>>>
>>> Nice work!
>>>
>>> Coverage should be easy to do with CONFIG_KCOV, but do you need
>>> fuzzing/coverage? It seems that testing a predefined set of special
>>> values for each arg should be enough for your use case. Namely special
>>> values that can detect endianess/truncation/sign extension/etc issues.
>>
>> Yes. We are trying to cover endianess/truncation/sign extension at this
>> moment.
>> For coverage, there are some code path in syscall wrapper in both glibc
>> and kernel. E.g. overflow check in glibc. I am thinking if coverage
>> could help on this.
>
> Ah, you mean user-space coverage. You may try AFL in binary
> instrumentation mode for this.
Good idea. AFL seems a wonderful tools. I saw some discussion about use AFL
to do kernel fuzz(triforce). If AFL support arm64, I could try it my
aarch64 ILP32 works.

Regards

Bamvor
>
>
>>> I think there is also a number of glibc functions that don't directly
>>> map to syscalls. Most notably wrappers around various ioctl's (e.g.
>>> ptsname). Do you test them?
>>
>> No. Currently, our tools only focus on the syscall function in glibc. In
>> these syscall level, we could compare the parameter and return value
>> directly. As you said, there are only several type of issues. It is easy
>> to handle by tools.
>>
>> I do not know how to test these complex cases. E.g. the ptsname may call
>> ioctl, *stat* syscall. Compare the original parameter is meaningless. But
>> it seems a good type of testcase to show how the user use the syscalls.
>> Do you have some ideas?
>
> I don't have any ideas for automated testing. One could write a model,
> of course....
> --
> To unsubscribe from this list: send the line "unsubscribe linux-arch" in
> the body of a message to majordomo@vger.kernel.org
> More majordomo info at  http://vger.kernel.org/majordomo-info.html
>

^ permalink raw reply	[flat|nested] 8+ messages in thread

* Re: [RFD] Efficient unit test and fuzz tools for kernel/libc porting
  2016-07-06  7:39 [RFD] Efficient unit test and fuzz tools for kernel/libc porting Zhangjian (Bamvor)
  2016-07-06  8:00 ` Dmitry Vyukov
  2016-07-06  8:00 ` Zhangjian (Bamvor)
@ 2016-07-20 15:47 ` Joseph Myers
  2016-07-21 12:39   ` Zhangjian (Bamvor)
  2 siblings, 1 reply; 8+ messages in thread
From: Joseph Myers @ 2016-07-20 15:47 UTC (permalink / raw)
  To: Zhangjian (Bamvor)
  Cc: linux-kernel, linux-arch, libc-alpha, trinity, syzkaller,
	aponomarenko, Jess Hertz, Tim Newsham, arnd, catalin.marinas,
	broonie, maxim.kuvyrkov, ynorov, pinskia, schwab, agraf,
	marcus.shawcroft, dingtianhong, guohanjun, cuibixuan, lijinyue,
	lizefan

On Wed, 6 Jul 2016, Zhangjian (Bamvor) wrote:

> correct or not. After learn and compare some fuzz tools, I feel that there is
> no such fuzz tools could help me. So, I wrote a new fuzz tools base on the
> trinity and it found several wrapper issues in glibc. I will first explain the
> different with existing fuzz tools and paste my propsosal in the end.

I'm not at all clear on whether any of the people working on AArch64 ILP32 
glibc have run the glibc testsuite and investigated the results in detail 
(the patch submissions have failed to include glibc testsuite results and 
have included bugs that would have been detected by the glibc testsuite).  
But, if you've found bugs in a new glibc port that were not detected by 
the existing testsuite, then tests for those bugs should be contributed to 
glibc (even if no existing port has those bugs, improving the test 
coverage is still a good idea).

-- 
Joseph S. Myers
joseph@codesourcery.com

^ permalink raw reply	[flat|nested] 8+ messages in thread

* Re: [RFD] Efficient unit test and fuzz tools for kernel/libc porting
  2016-07-20 15:47 ` Joseph Myers
@ 2016-07-21 12:39   ` Zhangjian (Bamvor)
  0 siblings, 0 replies; 8+ messages in thread
From: Zhangjian (Bamvor) @ 2016-07-21 12:39 UTC (permalink / raw)
  To: Joseph Myers
  Cc: LKML, Linux-Arch, libc-alpha, trinity, syzkaller, aponomarenko,
	Jess Hertz, Tim Newsham, Arnd Bergmann, Catalin Marinas,
	Mark Brown, Maxim Kuvyrkov, Yury Norov, Andrew Pinski,
	Andreas Schwab, Alexander Graf, marcus.shawcroft, Ding Tianhong,
	Hanjun Guo, cuibixuan, lijinyue, Zefan Li

Hi, Joseph

On 2016/7/20 23:47, Joseph Myers wrote:
> On Wed, 6 Jul 2016, Zhangjian (Bamvor) wrote:
>
>> correct or not. After learn and compare some fuzz tools, I feel that there is
>> no such fuzz tools could help me. So, I wrote a new fuzz tools base on the
>> trinity and it found several wrapper issues in glibc. I will first explain the
>> different with existing fuzz tools and paste my propsosal in the end.
>
> I'm not at all clear on whether any of the people working on AArch64 ILP32
> glibc have run the glibc testsuite and investigated the results in detail
> (the patch submissions have failed to include glibc testsuite results and
>  have included bugs that would have been detected by the glibc testsuite).
I run test glibc testsuite in previous glibc version with v6 kernel patch
backport to kernel-4.1, without regression. I usually run glibc testsuite
after ltp test result looks good. So, maybe it hard to find a issue by
glibc testsuite in this case.
> But, if you've found bugs in a new glibc port that were not detected by
> the existing testsuite, then tests for those bugs should be contributed to
> glibc (even if no existing port has those bugs, improving the test
> coverage is still a good idea).
It is good idea. I will review the fixed issues(such as wrong context in
signal, wrong parameter in off_t/stat relative syscalls) and check if it is
suitable to add it to glibc testsuite. (Actually, I do not know which
test suite (ltp or glibc) I should improve for a specific issue).
I hope our tools could help on improving the coverage of syscall relative
code at least.

Thanks.

Bamvor


^ permalink raw reply	[flat|nested] 8+ messages in thread

end of thread, other threads:[~2016-07-21 12:39 UTC | newest]

Thread overview: 8+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2016-07-06  7:39 [RFD] Efficient unit test and fuzz tools for kernel/libc porting Zhangjian (Bamvor)
2016-07-06  8:00 ` Dmitry Vyukov
2016-07-06  8:24   ` Zhangjian (Bamvor)
2016-07-06  9:09     ` Dmitry Vyukov
2016-07-06 10:38       ` Zhangjian (Bamvor)
2016-07-06  8:00 ` Zhangjian (Bamvor)
2016-07-20 15:47 ` Joseph Myers
2016-07-21 12:39   ` Zhangjian (Bamvor)

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).