Bufferoverrun Analysis - small fixes and improvements #1736

sjxer723 · 2023-02-17T03:35:11Z

Hi, this pr contains some improvements and fixes to buffer overrun analysis. The two fixes were originally inspired due to the two false positives/negatives we found during using it.

The buffer overrun checker ignores checking the case when the index may be +oo while the array size is less than +oo. It may cause a false negative. For example,

int a[1];
for(int i=0; a[i]; i++) {}

There will be a buffer overrun issue within the condition judgment of the for statement since i will finally be equal to 1 and the statement will check whether a[1] is nonzero.

The buffer overrun checker makes a rough estimation when adding two upper bounds, i.e., when i1 <= max {1, i1}, i2 <= i2, BO only let i1 + i2 <= 1 + i2, while the true upper bound should be max{1 + i2, i1 + i2}. Hence it will cause a false positive when checking the codes below:

#define MAX_LEN (16)

void foo(const unsigned char *additional, size_t len, size_t nonce_len, size_t entropy_len)
{
        unsigned char seed[MAX_LEN];
        size_t        seedlen = 0;

        if(entropy_len > MAX_LEN) 
            return;
        if(nonce_len > MAX_LEN - entropy_len) 
            return;
        if(len > MAX_LEN - entropy_len - nonce_len) 
            return;

        seedlen +=  entropy_len;
        if(nonce_len != 0)
                seedlen += nonce_len;
        if(additional != NULL && len != 0){
                memcpy(seed + seedlen, additional, len);
                seedlen += len;
        }
}

int main()
{
        unsigned char arr[20] = {0};
        foo(arr, 8, 0, 5);
        return 0;
}

Before executing the fourth if statement, BO will regard the value of seedlen as max{16, entropy_len}. However, when executing the fourth if statement, by adding the two variables seedlen and nonce_len, BO makes a rough estimation as above description, and will regard the upper bound of seedlen as 16 + nonce_len. Hence, it will cause a false positive when executing the call to the memcpy function.

For the two issues above, I have made the following improvements.

Within the checking function of array access, I let BO report a L3-level error when the upper bound of index is +oo and the upper bound of the array size is limited.
I have implemented a more precise plus of bounds. It handles the case when one operator is c1 + max{x1, d1} and another is c2 + d2.

… and the upper bound of size is smaller than +oo Summary: The buffer overrun checker misses to handle the case when the upper bound of offset is +oo, and the upper bound of array size is less than +oo, which will causes false negative in some test cases. For instance, for the following program, ``` int a[1]; for(int i=0; a[i]; i++) {} ``` The variable `i` will eventually be equal 1 and causes an overrun error within the loop statement. However, this error was missed by buffer overrun checker.

…/lower bounds

facebook-github-bot · 2023-02-28T09:12:53Z

@skcho has imported this pull request. If you are a Meta employee, you can view this diff on Phabricator.

skcho · 2023-02-28T09:50:15Z

infer/src/bufferoverrun/bufferOverrunProofObligations.ml

+ else if
+ Bound.is_infty (ItvPure.ub real_idx) && Bound.is_not_infty (ItvPure.ub size) 
+ then {report_issue_type= Issue IssueType.buffer_overrun_l3; propagate= false}
+ (* su < iu = +oo, probably an error *)


However, this error was missed by buffer overrun checker.

a.c:4: error: Buffer Overrun L4 Offset: [0, +oo] Size: 2. 2. void foo() { 3. int a[2]; 4. for(int i=0; a[i]; i++) {} ^ 5. }

For the example case, it reports L4 issue as above. The reason you did not see the issue is that it suppresses the issue types that are likely to be false positive, e.g. offset or size interval includes an infinity bound like above.

infer/infer/src/base/IssueType.ml

Lines 380 to 382 in 5aeb169

let buffer_overrun_l4 =

register ~enabled:false ~id:"BUFFER_OVERRUN_L4" Error BufferOverrunChecker

~user_documentation:"See [BUFFER_OVERRUN_L1](#buffer_overrun_l1)"

Note that ~enabled:false is given when defining the L4 issue type.

To enable all issue types, you can pass --no-filtering option, or --enable-issue-type to enable each of them.

skcho · 2023-02-28T10:04:36Z

infer/src/bufferoverrun/bufferOverrunUtils.ml

+ let mem = (
+ match Tenv.lookup tenv typname with
+ | Some {fields} -> 
+ let (mem, _) = decl_local_struct_fields_loc model_env loc fields (mem, 1) in mem
+ | None -> mem 
+ ) 
+ in 


I think it is better to pass current dimension and inst_num, to distinguish the allocation sites.

let mem, inst_num = match Tenv.lookup tenv typname with | Some {fields} -> decl_local_struct_fields_loc model_env loc fields ~inst_num ~dimension mem | None -> (mem, inst_num) in

skcho · 2023-02-28T10:19:03Z

infer/src/bufferoverrun/bounds.ml

+ | MinMax (c1, Plus, Max, d1, x1), Linear (c2, x2)
+ | Linear (c2, x2), MinMax (c1, Plus, Max, d1, x1) ->
+ mk_MinMaxB (Max, (Linear (Z.(c1 + d1 + c2), x2)), (Linear (Z.(c1 + c2), SymLinear.plus (SymLinear.singleton_one x1) x2)))
+ | MinMax (c1, Minus, Min, d1, x1), Linear (c2, x2)
+ | Linear (c2, x2), MinMax (c1, Minus, Min, d1, x1) ->
+ mk_MinMaxB (Max, (Linear (Z.(c1 - d1 + c2), x2)), (Linear (Z.(c1 + c2), SymLinear.plus (SymLinear.singleton_minus_one x1) x2)))


While MinMaxB was introduced to express more complex min/max expressions, it is NOT always a better choice than others. By its complex nature, many operations may easily lose precision on MinMaxB bound, for example, during widening it is more likely to have infinity bounds according to my experience. Therefore, if we want to introduce MinMaxB here for the plus operation, we should evaluate overall precision impacts. Let me check if it is fine for our use cases.

facebook-github-bot added the CLA Signed label Feb 17, 2023

sjxer723 added 2 commits February 24, 2023 10:48

[inferbo] Make a more precise upper/lower bound when adding two upper…

fd0a4e6

…/lower bounds

sjxer723 force-pushed the main branch from 675e04e to 43f4581 Compare February 24, 2023 02:49

[inferbo] Initialize locations for arrays within struct typed variables

347d391

sjxer723 force-pushed the main branch from 43f4581 to 347d391 Compare February 24, 2023 03:36

skcho reviewed Feb 28, 2023

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Bufferoverrun Analysis - small fixes and improvements #1736

Bufferoverrun Analysis - small fixes and improvements #1736

sjxer723 commented Feb 17, 2023

facebook-github-bot commented Feb 28, 2023

skcho Feb 28, 2023 •

edited

skcho Feb 28, 2023

skcho Feb 28, 2023

	let buffer_overrun_l4 =
	register ~enabled:false ~id:"BUFFER_OVERRUN_L4" Error BufferOverrunChecker
	~user_documentation:"See [BUFFER_OVERRUN_L1](#buffer_overrun_l1)"

Bufferoverrun Analysis - small fixes and improvements #1736

Are you sure you want to change the base?

Bufferoverrun Analysis - small fixes and improvements #1736

Conversation

sjxer723 commented Feb 17, 2023

facebook-github-bot commented Feb 28, 2023

skcho Feb 28, 2023 • edited

Choose a reason for hiding this comment

skcho Feb 28, 2023

Choose a reason for hiding this comment

skcho Feb 28, 2023

Choose a reason for hiding this comment

skcho Feb 28, 2023 •

edited