Implement `Memory1` (`RULE-8-7-1`) by jeongsoolee09 · Pull Request #967 · github/codeql-coding-standards

jeongsoolee09 · 2025-11-18T23:09:50Z

Description

Implement Memory1 (RULE-8-7-1).

Change request type

Release or process automation (GitHub workflows, internal scripts)
Internal documentation
External documentation
Query files (.ql, .qll, .qls or unit tests)
External scripts (analysis report or other code shipped as part of a release)

Rules with added or modified queries

No rules added
Queries have been added for the following rules:
- RULE-8-7-1
Queries have been modified for the following rules:
- rule number here

Release change checklist

A change note (development_handbook.md#change-notes) is required for any pull request which modifies:

The structure or layout of the release artifacts.
The evaluation performance (memory, execution time) of an existing query.
The results of an existing query in any circumstance.

If you are only adding new rule queries, a change note is not required.

Author: Is a change note required?

Yes
No

🚨🚨🚨
Reviewer: Confirm that format of shared queries (not the .qll file, the
.ql file that imports it) is valid by running them within VS Code.

Confirmed

Reviewer: Confirm that either a change note is not required or the change note is required and has been added.

Confirmed

Query development review checklist

For PRs that add new queries or modify existing queries, the following checklist should be completed by both the author and reviewer:

Author

Have all the relevant rule package description files been checked in?
Have you verified that the metadata properties of each new query is set appropriately?
Do all the unit tests contain both "COMPLIANT" and "NON_COMPLIANT" cases?
Are the alert messages properly formatted and consistent with the style guide?
Have you run the queries on OpenPilot and verified that the performance and results are acceptable?
As a rule of thumb, predicates specific to the query should take no more than 1 minute, and for simple queries be under 10 seconds. If this is not the case, this should be highlighted and agreed in the code review process.
Does the query have an appropriate level of in-query comments/documentation?
Have you considered/identified possible edge cases?
Does the query not reinvent features in the standard library?
Can the query be simplified further (not golfed!)

Reviewer

Have all the relevant rule package description files been checked in?
Have you verified that the metadata properties of each new query is set appropriately?
Do all the unit tests contain both "COMPLIANT" and "NON_COMPLIANT" cases?
Are the alert messages properly formatted and consistent with the style guide?
Have you run the queries on OpenPilot and verified that the performance and results are acceptable?
As a rule of thumb, predicates specific to the query should take no more than 1 minute, and for simple queries be under 10 seconds. If this is not the case, this should be highlighted and agreed in the code review process.
Does the query have an appropriate level of in-query comments/documentation?
Have you considered/identified possible edge cases?
Does the query not reinvent features in the standard library?
Can the query be simplified further (not golfed!)

…or it

…terminator, remove file pointer cases 1. Add headers, Adding missing headers: For obvious reasons. 2. Remove cases without null terminator: Both clang and g++ do not permit strings to be allocated that are declared to be shorter than the actual initializing expression. Since this is a C++ rule, we rule them out. 3. File pointer manipulation functions (e.g. fgets): Not required by the rule.

…w-nodes-MISRA-C++-2023-Memory-Experimental' into jeongsoolee09/MISRA-C++-2023-Memory-Experimental

These contain false positives due to the limitation of the status quo of the query.

MichaelRFairhurst

So close to ready!

MichaelRFairhurst · 2026-03-25T02:13:57Z

+  CallocFunctionCall() { this.isCallocCall() }
+
+  override int getMinNumBytes() {
+    result = lowerBound(this.getArgument(0)) * lowerBound(this.getArgument(1))


Perhaps we should file a bug to come back to this.

In theory, it would be great to have two versions of the query: one where we know with certainty that the resulting pointer is out of bounds if flow analysis is correct -- we assume the maximum allocation size and the smallest pointer offsets. Then another where we suspect a possible invalid pointer, where we assume the minimum allocation size and the largest pointer offsets. These could share most behavior and would have different precisions.

In the meantime, lets ship!

MichaelRFairhurst · 2026-03-25T02:20:24Z

+
+newtype TArrayAllocation =
+  TStackAllocation(ArrayDeclaration arrayDecl) or
+  TDynamicAllocation(NarrowedHeapAllocationFunctionCall narrowedAlloc)


Let's file a bug to come back to the third kind of "allocation," which is just taking the address of a non-array variable or lvalue.

int x = 0; int *p = &x; // p is essentially a buffer of size 1

Partly I say let's come back because we would need to be careful to distinguish:

int x = 0; int arr[5] = {0}; int *p1 = &x; // generally, taking an address to anything should be a buffer of size 1 int *p2 = &arr[0]; // except this // Note that any lvalue expression can create a "buffer" of size 1, not just variables: int &f() { return x; } int *p3 = &f(); // also a "buffer" of size 1 int *p4 = &*p3; // also a "buffer" of size 1

Implemented to some degree in 44ef266.

MichaelRFairhurst · 2026-03-25T02:23:13Z

+   */
+  int getOffset() {
+    if this.asPointerArithmetic() instanceof PointerSubExpr
+    then result = -this.getOffsetExpr().getValue().toInt()


Another thing to file is that this currently only works on constant values, but in the future we could extend this to use range analysis.

Good point. Introducing range analysis should be careful, otherwise it might generate a lot of noise. This is out of scope of this PR and should be reserved for later.

MichaelRFairhurst · 2026-03-25T02:26:27Z

+    sink.getNode() = end.getBasePointerNode()
+  |
+    srcOffset = start.getOffset() and
+    sinkOffset = end.getOffset() and


This overwrites the previous offset, but they should add up.

For example:

int arr[5]; int *p = arr; int p1 = p + 3; // offset: 3, length: 5 int p2 = p1 + 2; // offset: 5, length: 5

Currently, this will produce sinkOffset = 2 for the last line

Implemented in 394b7ad and documented in 2e4ace6.

Nice!

Only thought now, do still need srcOffset, sinkOffset to be in the table / to be predicate parameters?

In the base case, the srcOffset and sinkOffset come from start and end, not from srcSinkLengthMap.

In the recursive case, the srcOffset from the previous iteration is unused (srcSinkLengthMap(_, start, /*here -> */ _, ...). The sinkOffset from the previous iteration is only bound to be the new srcOffset, which we just determined wasn't used in the next iteration.

srcOffset and sinkOffset are then also not used by the select

jeongsoolee09 · 2026-04-21T20:10:30Z

Two things to note about the multidimensional arrays:

If a row access and the row element access are apart from each other with a function boundary in between, the query loses indirection information. We probably need to add a level column to simulate push-pop behavior: taking an indirection edge during initialization pushes and accesses pops them.
Currently, the relationship between FatPointer and DataFlow::Node is one-to-many, if we don't-care the level column. The multidimensional array accesses have duplicate alerts and this is probably what is causing it.

MichaelRFairhurst

literally just minor tweaks!! This looks 🔥 great!

MichaelRFairhurst · 2026-04-22T17:47:59Z

+/**
+ * This module provides classes and predicates for analyzing the size of buffers
+ * or objects from their base or a byte-offset, and identifying the potential for
+ * expressions accessing those buffers to overflow.


In this case, can we have c/common/src/codingatandards/c/OutOfBounds.qll import cpp.OutOfBounds ? A simple wrapper import file would be a reasonable easy refactor.

(or find each c query that imports OutOfBounds and update those)?

I think it's probably reasonable

MichaelRFairhurst · 2026-04-22T17:50:41Z

+ * @precision medium
+ * @problem.severity error
+ * @tags external/misra/id/rule-8-7-1
+ *       scope/system


looks like we're missing correctness

MichaelRFairhurst · 2026-04-22T17:51:44Z

+  /**
+   * Gets the declared length of this array.
+   */
+  int getLength() { result = length }


This and the int length; on line 26 can be deleted now, right?

MichaelRFairhurst · 2026-04-22T21:51:03Z

+    sink.getNode() = end.getBasePointerNode()
+  |
+    srcOffset = start.getOffset() and
+    sinkOffset = end.getOffset() and


Nice!

Only thought now, do still need srcOffset, sinkOffset to be in the table / to be predicate parameters?

In the base case, the srcOffset and sinkOffset come from start and end, not from srcSinkLengthMap.

In the recursive case, the srcOffset from the previous iteration is unused (srcSinkLengthMap(_, start, /*here -> */ _, ...). The sinkOffset from the previous iteration is only bound to be the new srcOffset, which we just determined wasn't used in the next iteration.

srcOffset and sinkOffset are then also not used by the select

MichaelRFairhurst · 2026-04-22T22:00:30Z

+      array +
+      4; // NON_COMPLIANT: pointer points more than one beyond the last element
+  int *invalid2 =
+      array - 1; // NON_COMPLIANT: pointer is outside boundary [FALSE_NEGATIVE]


No longer a false negative! 🎉

MichaelRFairhurst · 2026-04-22T22:18:30Z

+    strcat(buf1, " ");     // NON_COMPLIANT - not null terminated
+    strcat(buf2, " ");     // COMPLIANT
+    strcat(buf3, " ");     // COMPLIANT
+    strcat(buf4, "12345"); // NON_COMPLIANT


Hmm, this is actually a FN

MichaelRFairhurst · 2026-04-22T22:20:56Z

+          "description": "Pointers obtained as result of performing arithmetic should point to an initialized object, or an element right next to the last element of an array.",
+          "kind": "path-problem",
+          "name": "Pointer arithmetic shall not form an invalid pointer",
+          "precision": "medium",


I'd be very tempted to say "high" precision.

Nicely done :)

MichaelRFairhurst · 2026-04-22T22:22:41Z

+          "severity": "error",
+          "short_name": "PointerArithmeticFormsAnInvalidPointer",
+          "tags": [
+            "scope/system"


tags should probably include correctness and security, for this and below

MichaelRFairhurst · 2026-04-22T22:23:16Z

+          "tags": [
+            "scope/system"
+          ]
+        },


Consider adding an implementation scope that we only handle constant offsets for increased precision

MichaelRFairhurst · 2026-04-22T22:31:27Z

+  )
+select end, src, sink,
+  "This pointer accesses element at index " + totalOffset +
+    " while the underlying object has length " + length + "."


One small thought on verbiage.

While "Object" is the right term here, generally speaking, I'm not sure devs think of objects as something that has a "length"... I'd say objects have a "size," and that size is in bytes, making "length" perhaps doubly confusing.

If you want to go with "length," I'd probably suggest "array" or "buffer." Though it could be confusing to say that &x is either.

Maybe something like, "Pointer formed that points to element X of an object contains Y elements" ?

jeongsoolee09 added 2 commits November 18, 2025 17:43

Number Memory packages

b2231c9

Add rule description files

9b5d8b2

jeongsoolee09 self-assigned this Nov 18, 2025

jeongsoolee09 added 27 commits January 12, 2026 17:29

Add Memory1 package files

a5d4127

Expose malloc, calloc and realloc

1a2cde8

Minor comments

c21e862

Checkpoint

c0b1e55

Split out source and sinks into their cases

9d3bab0

Checkpoint

a8a6db7

Merge branch 'main' into jeongsoolee09/MISRA-C++-2023-Memory

4464702

Checkpoint: Add DynamicAllocation case

e2c5870

First working draft

fe2a3c4

Refine into path-problem

5ea652b

Change TaintTracking to DataFlow

7b860d9

Finalize first working draft for stack / heap arrays

08b8bf7

Document code copy and clean up imports

062c62f

Add multidimensional arrays alloc'ed on stack

e264dfd

Add test.cpp and expected test results

4d2bc8b

Add exclusion for Memory1.qll

21500b8

Adjust precision of existing rule and add a supplementary rule

e9f39a2

Add supplementary query files

a62e2e1

Fix @kind from problem to path-problem

ca62995

Copy OutOfBounds.qll to cpp/common/src/codingstandards/cpp/

8abf097

Remove unused import codingstandards.cpp.Variable in OutOfBounds.qll

f5454de

Add PointerArgumentToCstringFunctionIsInvalid.ql and create testref f…

d82ed6e

…or it

Copy test.c from ARR38-C and add strncpy

356bbf2

Remove testref and add qlref and expected

9ced913

Address case of strncat

1c5dc84

Remove unused predicate and update .expected

2f80208

jeongsoolee09 added 7 commits March 10, 2026 10:05

Update pointer_only.cpp according to test.cpp

5d2f62f

Fix redeclaration issue

53c0ef7

Merge remote-tracking branch 'origin/michaelrfairhurst/update-dataflo…

19f8e94

…w-nodes-MISRA-C++-2023-Memory-Experimental' into jeongsoolee09/MISRA-C++-2023-Memory-Experimental

Add multidimensional_only.cpp

2d6c9eb

Add docstrings

9493945

Support the multidimensional array cases

212b266

Finish experimentation with srcSinkLengthMap and merge

e3c80f5

jeongsoolee09 requested a review from MichaelRFairhurst March 23, 2026 21:46

jeongsoolee09 added 6 commits March 23, 2026 17:48

Merge branch 'main' into jeongsoolee09/MISRA-C++-2023-Memory

369c99b

Update RuleMetadata.qll

9272117

Remove unnecessary files and fix formatting

4bcc59f

Fix formatting and mention that OutOfBounds is a copy

14bdcbd

Add test cases

d254ec2

These contain false positives due to the limitation of the status quo of the query.

Format message and finish final draft

a692ba7

MichaelRFairhurst requested changes Mar 25, 2026

View reviewed changes

jeongsoolee09 added 12 commits April 9, 2026 14:26

Merge branch 'main' into jeongsoolee09/MISRA-C++-2023-Memory

49f49e6

Merge branch 'main' into jeongsoolee09/MISRA-C++-2023-Memory

51c0778

Fix minor formatting in the test file

328842e

Add note to IndirectUninitializedNode

08c151b

Add an accumulating logic

394b7ad

Add some documentation

2e4ace6

Add more docs

592c717

Fix formatting error

9f9d369

Add support for address of arbitrary lvalue exprs

44ef266

Fix formatting

8a45c9c

Update expected results of both queries

5b9b5d8

Merge branch 'main' into jeongsoolee09/MISRA-C++-2023-Memory

7b3569f

jeongsoolee09 requested a review from MichaelRFairhurst April 21, 2026 20:10

MichaelRFairhurst requested changes Apr 22, 2026

View reviewed changes

Conversation

jeongsoolee09 commented Nov 18, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Change request type

Rules with added or modified queries

Release change checklist

Query development review checklist

Author

Reviewer

Uh oh!

MichaelRFairhurst left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

jeongsoolee09 commented Apr 21, 2026

Uh oh!

MichaelRFairhurst left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

jeongsoolee09 commented Nov 18, 2025 •

edited

Loading