There are some functions in ANSI-C (and our own API's as well) which are known for continuous security problems.
We could use a static code analysis tool (e.g. flawfinder) to find at least the insecure ANSI-C calls.
Wireshark's code (especially the dissectors) is checked using a custom Perl script (tools/checkAPIs.pl) that checks whether various APIs are called. Unsafe APIs are 'prohibited' which raises errors on the buildbot if anyone checks in code using those APIs.
sprintf -> g_snprintf
The sprintf call is known to be insecure as no buffer length is given to this call, which often results in hard to find bugs when this function writes behind the given buffer.
As there are currently many places where sprintf is still used, a list of files currently doesn't make sense.
There is still a list of such calls in the EMEMification page since most of these calls should be replaces with g_snprintf() to an emem buffer anyway.
strcpy -> strlcpy / strcat -> strlcat
The stcpy/strcat functions can easily write behind the given buffer, see: http://www.gratisoft.us/todd/papers/strlcpy.html
Introducing API routines to deal with TLVs (type, length, value) could eliminate a lot of hand-crafted dissector code with potential problems (like endless loops).
There will be some things to think about, as TLV can have a variety of differences: byte ordering, 8,16,32 or even esoteric bit lengths, ... - UlfLamping