No operating system calls, such as memory allocation or file i/o, should occur in time-critical inner loops. This may require design changes.
Minimize the number of pointers in your data structures. Each pointer reference requires a potentially-nonlocal memory reference, resulting in a cache miss, and can take up to 8 bytes of storage on some machines.
Go to the first, previous, next, last section, table of contents.