You might improve the efficiency of the generated code using one of the following optimizations:
Unroll for-Loops
Inline Code
Eliminate Redundant Copies of Function Inputs