putty-source

mirror of https://git.tartarus.org/simon/putty.git synced 2025-01-09 01:18:00 +00:00

Author	SHA1	Message	Date
Simon Tatham	c4c4d2c5cb	dup_mb_to_wc, dup_wc_to_mb: remove the 'flags' parameter. This parameter was undocumented, and Windows-specific: its semantics date from before PuTTY was cross-platform, and are "Pass this flags parameter straight through to the Win32 API's conversion functions". So in Windows platform code you can pass flags like MB_USEGLYPHCHARS, but in cross-platform code, you dare not pass anything nonzero at all because the Unix frontend won't recognise it (or, likely, even compile). I've kept the flag for now in the underlying mb_to_wc / wc_to_mb functions. Partly that's because there's one place in the Windows code where the parameter _is_ used; mostly, it's because I'm about to replace those functions anyway, so there's no point in editing all the call sites twice.	2024-09-24 09:42:58 +01:00
Simon Tatham	964890f1a1	Stringify all the CONF_foo identifiers, for debugging. When dumping out the contents of a Conf, it's useful not to have to guess what the integer indices mean. By putting these identifiers in a separate array in its own library module, I should avoid them getting linked in to production binaries to take up space, as long as conf_id() is only called from inside debug() statements. And to enforce _that_, it isn't even declared in a header file unless you #define DEBUG.	2024-09-23 16:49:29 +01:00
Simon Tatham	392be3e494	New utility function: decode_utf8_to_wide_string. We already had encode_wide_string_as_utf8, which treats the wide string as UTF-16 or UTF-32 as appropriate to the size of wchar_t. I'm about to need the inverse function, and was surprised that it didn't already exist (even though enough component parts did to make it easy).	2023-05-29 15:08:49 +01:00
Simon Tatham	36db93748e	New utility function: dupwcs. Just like dupstr, but for wchar_t strings.	2023-05-29 15:08:49 +01:00
Simon Tatham	9e01de7c2b	decode_utf8: add an enumeration of failure reasons. Now you can optionally get back an enum value indicating whether the character was successfully decoded, or whether U+FFFD was substituted due to some kind of problem, and if the latter, what problem. For a start, this allows distinguishing 'real' U+FFFD (encoded legitimately in the input) from one invented by the decoder. Also, it allows the recipient of the decode to treat failures differently, either by passing on a useful error report to the user (as utf8_unknown_char now does) or by doing something special. In particular, there are two distinct error codes for a truncated UTF-8 encoding, depending on whether it was truncated by the end of the input or by encountering a non-continuation byte. The former code means that the string is not legal UTF-8 _as it is_, but doesn't rule out it being a (bytewise) prefix of a legal UTF-8 string - so if a client is receiving UTF-8 data a byte at a time, they can treat that error code specially and not make it a fatal error.	2023-02-17 17:16:54 +00:00
Simon Tatham	f4519b6533	Add UTF-8 support to the new Windows ConsoleIO system. This allows you to set a flag in conio_setup() which causes the returned ConsoleIO object to interpret all its output as UTF-8, by translating it to UTF-16 and using WriteConsoleW to write it in Unicode. Similarly, input is read using ReadConsoleW and decoded from UTF-16 to UTF-8. This flag is set to false in most places, to avoid making sudden breaking changes. But when we're about to present a prompts_t to the user, it's set from the new 'utf8' flag in that prompt, which in turn is set by the userauth layer in any case where the prompts are going to the server. The idea is that this should be the start of a fix for the long- standing character-set handling bug that strings transmitted during SSH userauth (usernames, passwords, k-i prompts and responses) are all supposed to be in UTF-8, but we've always encoded them in whatever our input system happens to be using, and not done any tidying up on them. We get occasional complaints about this from users whose passwords contain characters that are encoded differently between UTF-8 and their local encoding, but I've never got round to fixing it because it's a large piece of engineering. Indeed, this isn't nearly the end of it. The next step is to add UTF-8 support to all the _other_ ways of presenting a prompts_t, as best we can. Like the previous change to console handling, it seems very likely that this will break someone's workflow. So there's a fallback command-line option '-legacy-charset-handling' to revert to PuTTY's previous behaviour.	2022-11-26 10:49:03 +00:00
Simon Tatham	dbd0bde415	New utility function burnwcs(). Just like burnstr(), it memsets a NUL-terminated string to all zeroes before freeing it. The only difference is that it does it to a string of wchar_t.	2022-11-26 10:32:36 +00:00
Simon Tatham	d3e186e81b	Function to check a UTF-8 string for unknown characters. So we can reject things we don't know how to NFC yet.	2022-11-11 08:49:05 +00:00
Simon Tatham	b35d23f699	Implement Unicode normalisation. A new module in 'utils' computes NFC and NFD, via a new set of data tables generated by read_ucd.py. The new module comes with a new test program, which can read the NormalizationTest.txt that appears in the Unicode Character Database. All the tests pass, as of Unicode 15.	2022-11-11 08:48:18 +00:00
Simon Tatham	69e217d23a	Make decode_utf8() read from a BinarySource. This enables it to handle data that isn't presented as a NUL-terminated string. In particular, the NUL byte can appear _within_ the string and be correctly translated to the NUL wide character. So I've been able to remove the awkwardness in the test rig of having to include the terminating NUL in every test to ensure NUL has been tested, and instead, insert a single explicit test for it. Similarly to the previous commit, the simplification at the (one) call site gives me a strong feeling of 'this is what the API should have been all along'!	2022-11-09 19:21:02 +00:00
Simon Tatham	834b58e39b	Make encode_utf8() output to a BinarySink. Previously it output to an ordinary char buffer, and returned the number of bytes it had written. But three out of the four call sites immediately chucked the resulting bytes into a BinarySink anyway. The fourth, in windows/unicode.c, really is writing into successive locations of a fixed-size buffer - but we can make that into a BinarySink too, using the buffer_sink added in the previous commit. So now encode_utf8() is renamed put_utf8_char, and the call sites all look simpler than they started out.	2022-11-09 19:02:32 +00:00
Simon Tatham	a8981212f5	Add a missing prototype. Too much C++, I expect - 'void foo()' in C++ means what I wanted it to mean!	2022-09-03 11:59:12 +01:00
Simon Tatham	d155009ded	Utility function to do terminal word wrapping. I'm planning to use this to replace some of the manually wrapped lines in console messages.	2022-07-07 18:05:32 +01:00
Simon Tatham	f579b3c01e	Certificate trust scope: change to a boolean-expression system. This replaces the previous placeholder scheme of having a list of hostname wildcards with implicit logical-OR semantics (if any wildcard matched then the certificate would be trusted to sign for that host). That scheme didn't allow for exceptions within a domain ('everything in example.com except extra-high-security-machine.example.com'), and also had no way to specify port numbers. In the new system, you can still write a hostname wildcard by itself in the simple case, but now those are just atomic subexpressions in a boolean-logic domain-specific language I've made up. So if you want multiple wildcards, you can separate them with \|\| in a single longer expression, and also you can use && and ! to impose exceptions on top of that. Full details of the expression language are in the comment at the top of utils/cert-expr.c. It'll need documenting properly before release, of course. For the sake of backwards compatibility for early adopters who've already set up configuration in the old system, I've put in some code that will read the old MatchHosts configuration and automatically translate it into the equivalent boolean expression (by simply stringing together the list of wildcards with \|\| between them).	2022-06-25 14:32:23 +01:00
Simon Tatham	08d58fe13e	Routines for %-encoding and %-decoding. These make a good storage format for mostly-textual data in configuration, if it can't afford to reserve any character as a delimiter. Assuming very few characters need to be escaped, the space cost is lower than base64, and also you can read it by eye.	2022-06-25 14:30:39 +01:00
Simon Tatham	76205b89e2	A few more ptrlen functions. ptrlen_contains and ptrlen_contains_only are useful for checking that a string is composed entirely of certain characters, or avoids them. ptrlen_end makes a pointer to the byte just past the end of the specified string. And it can be used with make_ptrlen_startend, which makes a ptrlen out of two pointers instead of a pointer and a length.	2022-06-25 14:30:39 +01:00
Simon Tatham	5a28658a6d	Remove uni_tbl from struct unicode_data. Instead of maintaining a single sparse table mapping Unicode to the currently selected code page, we now maintain a collection of such tables mapping Unicode to any code page we've so far found a need to work with, and we add code pages to that list as necessary, and never throw them away (since there are a limited number of them). This means that the wc_to_mb family of functions are effectively stateless: they no longer depend on a 'struct unicode_data' corresponding to the current terminal settings. So I've removed that parameter from all of them. This fills in the missing piece of yesterday's commit `a216d86106`: now wc_to_mb too should be able to handle internally-implemented character sets, by hastily making their reverse mapping table if it doesn't already have it. (That was only a _latent_ bug, because the only use of wc_to_mb in the cross-platform or Windows code _did_ want to convert to the currently selected code page, so the old strategy worked in that case. But there was no protection against an unworkable use of it being added later.)	2022-06-01 09:28:25 +01:00
Simon Tatham	d06ae2f5c3	New utility function base64_valid(). For when you want to tell the difference between a base64-encoded string and some other kind of string that might replace it.	2022-05-01 11:27:37 +01:00
Simon Tatham	043c24844a	Improve the base64 utility functions. The low-level functions to handle a single atom of base64 at a time have been in 'utils' / misc.h for ages, but the higher-level family of base64_encode functions that handle a whole data block were hidden away in sshpubk.c, and there was no higher-level decode function at all. Now moved both into 'utils' modules and declared them in misc.h rather than ssh.h. Also, improved the APIs: they all take ptrlen in place of separate data and length arguments, their naming is more consistent and more explicit (the previous base64_encode which didn't name its destination is now base64_encode_fp), and the encode functions now accept cpl == 0 as a special case meaning that the output base64 data is wanted in the form of an unbroken single-line string with no trailing \n.	2022-04-25 14:10:16 +01:00
Simon Tatham	e7d51505c7	Utility function strbuf_dup. If you already have a string (of potentially-binary data) in the form of a ptrlen reference to somewhere else, and you want to keep a copy somewhere, it's useful to copy it into a strbuf. But it takes a couple of lines of faff to do that, and it's nicer to wrap that up into a tiny helper function. This commit adds that helper function strbuf_dup, and its non-movable sibling strbuf_dup_nm for secret data. Also, gone through the existing code and found a bunch of cases where this makes things less verbose.	2022-04-24 08:38:27 +01:00
Simon Tatham	31db2e67bb	Make smemeq return unsigned, not bool. bool is dangerous in a time-safe context, because C compilers might insert a control flow divergence to implement the implicit normalisation of nonzero integers to 1 when you assign to a bool. Everywhere else time-safe, I avoid using it; but smemeq has been an exception until now, because the response to smemeq returning failure was to do an obvious protocol-level divergence _anyway_ (like disconnecting due to MAC mismatch). But I'm about to want to use smemeq in a context where I use the result _subtly_ and don't want to give away what it is, so now it's time to get rid of that bool and have smemeq return unsigned.	2022-04-15 17:46:06 +01:00
Simon Tatham	b360ea6ac1	Add a manual single-char UTF-8 decoder. This parallels encode_utf8 which we already had. Decoding is more fraught with perils than encoding, so I've also included a small test program.	2022-03-12 18:51:21 +00:00
Simon Tatham	21f602be40	Add utility function dup_wc_to_mb. This parallels dup_mb_to_wc, which already existed. I haven't needed the same thing this way round yet, but I'm about to.	2022-03-12 18:51:21 +00:00
Simon Tatham	5935c68288	Update source file names in comments and docs. Correcting a source file name in the docs just now reminded me that I've seen a lot of outdated source file names elsewhere in the code, due to all the reorganisation since we moved to cmake. Here's a giant pass of trying to make them all accurate again.	2022-01-22 15:51:31 +00:00
Simon Tatham	be8d3974ff	Generalise strbuf_catf() into put_fmt(). marshal.h now provides a macro put_fmt() which allows you to write arbitrary printf-formatted data to an arbitrary BinarySink. We already had this facility for strbufs in particular, in the form of strbuf_catf(). That was able to take advantage of knowing the inner structure of a strbuf to minimise memory allocation (it would snprintf directly into the strbuf's existing buffer if possible). For a general black-box BinarySink we can't do that, so instead we dupvprintf into a temporary buffer. For consistency, I've removed strbuf_catf, and converted all uses of it into the new put_fmt - and I've also added an extra vtable method in the BinarySink API, so that put_fmt can still use strbuf_catf's more efficient memory management when talking to a strbuf, and fall back to the simpler strategy when that's not available.	2021-11-19 11:32:47 +00:00
Simon Tatham	efee4e0eae	Add some more bufchain_try_* functions. We already had bufchain_try_fetch_consume; now we also have bufchain_try_fetch (for when you want to wait until that much data is available but then not commit to removing it), and bufchain_try_consume (so you can conveniently ignore a certain amount of incoming data).	2021-11-19 10:35:38 +00:00
Simon Tatham	c35d8b8328	win_set_[icon_]title: send a codepage along with the string. While fixing the previous commit I noticed that window titles don't actually _work_ properly if you change the terminal character set, because the text accumulated in the OSC string buffer is sent to the TermWin as raw bytes, with no indication of what character set it should interpret them as. You might get lucky if you happened to choose the right charset (in particular, UTF-8 is a common default), but if you change the charset half way through a run, then there's certainly no way the frontend will know to interpret two window titles sent before and after the change in two different charsets. So, now win_set_title() and win_set_icon_title() both include a codepage parameter along with the byte string, and it's up to them to translate the provided window title from that encoding to whatever the local window system expects to receive. On Windows, that's wide-string Unicode, so we can just use the existing dup_mb_to_wc utility function. But in GTK, it's UTF-8, so I had to write an extra utility function to encode a wide string as UTF-8.	2021-10-16 14:00:46 +01:00
Simon Tatham	cc3e4992d5	Break up x11fwd.c. This is a module that I'd noticed in the past was too monolithic. There's a big pile of stub functions in uxpgnt.c that only have to be there because the implementation of true X11 _forwarding_ (i.e. actually managing a channel within an SSH connection), which Pageant doesn't need, was in the same module as more general X11-related utility functions which Pageant does need. So I've broken up this awkward monolith. Now x11fwd.c contains only the code that really does all go together for dealing with SSH X forwarding: the management of an X forwarding channel (including the vtables to make it behave as Channel at the SSH end and a Plug at the end that connects to the local X server), and the management of authorisation for those channels, including maintaining a tree234 of possible auth values and verifying the one we received. Most of the functions removed from this file have moved into the utils subdir, and also into the utils library (i.e. further down the link order), because they were basically just string and data processing. One exception is x11_setup_display, which parses a display string and returns a struct telling you everything about how to connect to it. That talks to the networking code (it does name lookups and makes a SockAddr), so it has to live in the network library rather than utils, and therefore it's not in the utils subdirectory either. The other exception is x11_get_screen_number, which it turned out nothing called at all! Apparently the job it used to do is now done as part of x11_setup_display. So I've just removed it completely.	2021-04-18 08:18:27 +01:00
Simon Tatham	609502b04b	Add utility function 'memxor'.	2021-02-20 16:49:52 +00:00
Simon Tatham	c18e5dc8fb	cmdgen: add a --dump option. Also spelled '-O text', this takes a public or private key as input, and produces on standard output a dump of all the actual numbers involved in the key: the exponent and modulus for RSA, the p,q,g,y parameters for DSA, the affine x and y coordinates of the public elliptic curve point for ECC keys, and all the extra bits and pieces in the private keys too. Partly I expect this to be useful to me for debugging: I've had to paste key files a few too many times through base64 decoders and hex dump tools, then manually decode SSH marshalling and paste the result into the Python REPL to get an integer object. Now I should be able to get _straight_ to text I can paste into Python. But also, it's a way that other applications can use the key generator: if you need to generate, say, an RSA key in some format I don't support (I've recently heard of an XML-based one, for example), then you can run 'puttygen -t rsa --dump' and have it print the elements of a freshly generated keypair on standard output, and then all you have to do is understand the output format.	2020-02-22 18:42:13 +00:00
Simon Tatham	e0e133b4b0	Expose the rest of LoadedFile in headers. This will allow it to be used more conveniently for things other than key files. For the moment, the implementation still lives in sshpubk.c. Moving it out into utils.c or misc.c would be nicer, but it has awkward dependencies on marshal.c and the per-platform f_open function. Perhaps another time.	2020-02-09 22:02:23 +00:00
Simon Tatham	cbfba7a0e9	Greatly improve printf format-string checking. I've added the gcc-style attribute("printf") to a lot of printf-shaped functions in this code base that didn't have it. To make that easier, I moved the wrapping macro into defs.h, and also enabled it if we detect the __clang__ macro as well as __GNU__ (hence, it will be used when building for Windows using clang-cl). The result is that a great many format strings in the code are now checked by the compiler, where they were previously not. This causes build failures, which I'll fix in the next commit.	2020-01-26 16:35:04 +00:00
Simon Tatham	7590d0625b	Introduce and use strbuf_chomp. Those chomp operations in wincons.c and uxcons.c looked ugly, and I'm not totally convinced they couldn't underrun the buffer by 1 byte in weird circumstances. strbuf_chomp is neater.	2020-01-22 22:30:26 +00:00
Simon Tatham	5891142aee	New functions to shrink a strbuf. These are better than my previous approach of just assigning to sb->len, because firstly they check by assertion that the new length is within range, and secondly they preserve the invariant that the byte stored in the buffer just after the length runs out is \0. Switched to using the new functions everywhere a grep could turn up opportunities.	2020-01-21 20:24:04 +00:00
Simon Tatham	ef843e9638	New macro PTRLEN_DECL_LITERAL. This is like PTRLEN_LITERAL, but you can use it in a _declaration_ of a compile-time constant ptrlen, instead of a literal in expression context. 'const ptrlen foo = PTRLEN_DECL_LITERAL("bar");'	2020-01-09 19:57:35 +00:00
Simon Tatham	1547c9c1ec	Make dupcat() into a variadic macro. Up until now, it's been a variadic _function_, whose argument list consists of 'const char *' ASCIZ strings to concatenate, terminated by one containing a null pointer. Now, that function is dupcat_fn(), and it's wrapped by a C99 variadic _macro_ called dupcat(), which automatically suffixes the null-pointer terminating argument. This has three benefits. Firstly, it's just less effort at every call site. Secondly, it protects against the risk of accidentally leaving off the NULL, causing arbitrary words of stack memory to be dereferenced as char pointers. And thirdly, it protects against the more subtle risk of writing a bare 'NULL' as the terminating argument, instead of casting it explicitly to a pointer. That last one is necessary because C permits the macro NULL to expand to an integer constant such as 0, so NULL by itself may not have pointer type, and worse, it may not be marshalled in a variadic argument list in the same way as a pointer. (For example, on a 64-bit machine it might only occupy 32 bits. And yet, on another 64-bit platform, it might work just fine, so that you don't notice the mistake!) I was inspired to do this by happening to notice one of those bare NULL terminators, and thinking I'd better check if there were any more. Turned out there were quite a few. Now there are none.	2019-10-14 19:42:37 +01:00
Simon Tatham	5d718ef64b	Whitespace rationalisation of entire code base. The number of people has been steadily increasing who read our source code with an editor that thinks tab stops are 4 spaces apart, as opposed to the traditional tty-derived 8 that the PuTTY code expects. So I've been wondering for ages about just fixing it, and switching to a spaces-only policy throughout the code. And I recently found out about 'git blame -w', which should make this change not too disruptive for the purposes of source-control archaeology; so perhaps now is the time. While I'm at it, I've also taken the opportunity to remove all the trailing spaces from source lines (on the basis that git dislikes them, and is the only thing that seems to have a strong opinion one way or the other). Apologies to anyone downstream of this code who has complicated patch sets to rebase past this change. I don't intend it to be needed again.	2019-09-08 20:29:21 +01:00
Jacob Nevins	81be535f67	Tweak __attribute__((format)) for MinGW. This silences a bunch of spurious format warnings on a Ubuntu 14.04 mingw-w64 cross-compilation.	2019-04-21 13:02:40 +01:00
Simon Tatham	ce780c9b33	Add casts to silence VS warnings in GET_32BIT et al. Visual Studio is quite aggressive about displaying warnings everywhere that you implicitly narrow from one integer type to another, and I've not generally felt it improves readability to add enough explicit casts to silence the warnings. But the ones in the inline functions in misc.h are literally two orders of magnitude more annoying than the rest, because that file gets included in nearly every translation unit, so the warnings come up over 100 times each. So I think these are worth fixing.	2019-04-06 10:25:27 +01:00
Simon Tatham	4d69032d2c	New utility function to read a whole disk file. I'm going to want this in a moment for Uppity, and it seems like the sort of thing I should put straight into utils.c now, rather than having to move it over later when I inevitably find another use for it. Rather than insisting on allocating a string buffer the way fgetline does, it reads a whole file and transfers the result into an arbitrary BinarySink, which works out the same if you use a strbuf at the call site, but can do other things too if that turns out useful.	2019-03-28 18:12:48 +00:00
Simon Tatham	7ae5c35419	New utility function: ptrlen_get_word(). This is similar to strtok, only it operates on a ptrlen. Therefore it can be properly stateless, or rather, it stores its state by overwriting the input ptrlen to point to a tail of its previous value. Also in this commit I add a clarifying comment about when ptrlen_{starts,ends}with will write through its 'tail' pointer.	2019-03-24 14:13:37 +00:00
Simon Tatham	3936616feb	Add line-length limit feature in StripCtrlChars. Now it can optionally check that output lines don't go beyond a certain length (measured in terminal columns, via wcwidth, rather than bytes or characters). In this mode, lines are prefixed with a distinctive character (namely '\|'), and if a line is too long, then it is broken and the continuation line gets a different prefix ('>'). When StripCtrlChars is targeting a terminal, it asks the terminal to call wcwidth on its behalf, so it can be sure to use the same idea as the real terminal about which characters are wide (i.e. depending on the configuration of ambiguous characters). This mode isn't yet used anywhere.	2019-03-16 12:25:23 +00:00
Simon Tatham	d049f0ab6c	Make stripctrl_string take an existing StripCtrlChars. Now instead of making a StripCtrlChars just for that function call, it uses an existing one, pointing it at the output strbuf via stripctrl_retarget. This adds flexibility (now you can use the same convenient string- sanitising function with a StripCtrl configured in any way you like) and also saves pointless setting-up and tearing-down of identical sccs all the time. The existing call sites in PSCP and PSFTP now use a static StripCtrlChars instance that was made at program startup.	2019-03-09 16:21:49 +00:00
Simon Tatham	cfef137ea2	StripCtrlChars: retarget and reset functions. stripctrl_retarget() points the StripCtrlChars at a new BinarySink, to avoid having to pointlessly throw it away and make a new one all the time. Since that probably means the same scc is going to be reused for processing a fresh data stream, we also don't want any character-set conversion state hanging over from the previous stream, so we also reset the state in the process. Just in case it's needed, stripctrl_reset() is also provided to do that operation on its own.	2019-03-09 16:21:49 +00:00
Simon Tatham	757c91e2de	Extra ptrlen function, ptrlen_endswith(). Identical to ptrlen_startswith(), only the other way round.	2019-03-09 16:21:49 +00:00
Simon Tatham	d2ddb2fdf4	Remove obsolete sanitise_term_data(). The previous commit removed its last use, so now we can garbage- collect it, including its long-standing FIXME comment which is now fulfilled by the new StripCtrlChars system.	2019-03-06 20:31:26 +00:00
Simon Tatham	e74790003c	StripCtrlChars: option to provide a target Terminal. If you use the new stripctrl_new_term() to construct a StripCtrlChars instead of the existing stripctrl_new(), then the resulting object will align itself with the character-set configuration of the Terminal object you point it at. (In fact, it'll reuse the same actual translation code, courtesy of the last few refactoring commits.) So it will interpret things as control characters precisely if that Terminal would also have done so. The previous locale-based sanitisation is appropriate if you're sending the sanitised output to an OS terminal device managed outside this process - the LC_CTYPE setting has the best chance of knowing how that terminal device will interpret a byte stream. But I want to start using the same sanitisation system for data intended for PuTTY's own internal terminal emulator, in which case there's no reason why LC_CTYPE should be expected to match that terminal's configuration, and no reason to need it to either since we can check the internal terminal configuration directly. One small bodge: stripctrl_new_term() is actually a macro, which passes in the function pointer term_translate() to the underlying real constructor. That's just so that console-only tools can link in stripctrl.c without acquiring a dependency on terminal.c (similarly to how we pass random_read in to the mp_random functions).	2019-03-06 20:31:26 +00:00
Simon Tatham	511eea450a	Factor out encode_utf8 from luni_send into utils.c. I knew there had to already be a UTF-8 encoder _somewhere_ in this code base, but it took me a while to find it! Now it's reusable in other contexts.	2019-03-06 19:05:36 +00:00
Simon Tatham	a7abc7c867	Extra-secure versions of sgrowarray and strbuf. These versions, distinguished by the _nm suffix on their names, avoid using realloc to grow the array, in case it moves the block and leaves a copy of the data in the freed memory at the old address. (The suffix 'nm' stands for 'no moving'.) Instead, the array is grown by making a new allocation, manually copying the data over, and carefully clearing the old block before freeing it. (An alternative would be to give this code base its own custom heap in which the ordinary realloc takes care about this kind of thing, but I don't really feel like going to that much effort!)	2019-03-02 06:54:17 +00:00
Simon Tatham	e0a76971cc	New array-growing macros: sgrowarray and sgrowarrayn. The idea of these is that they centralise the common idiom along the lines of if (logical_array_len >= physical_array_size) { physical_array_size = logical_array_len * 5 / 4 + 256; array = sresize(array, physical_array_size, ElementType); } which happens at a zillion call sites throughout this code base, with different random choices of the geometric factor and additive constant, sometimes forgetting them completely, and generally doing a lot of repeated work. The new macro sgrowarray(array,size,n) has the semantics: here are the array pointer and its physical size for you to modify, now please ensure that the nth element exists, so I can write into it. And sgrowarrayn(array,size,n,m) is the same except that it ensures that the array has size at least n+m (so sgrowarray is just the special case where m=1). Now that this is a single centralised implementation that will be used everywhere, I've also gone to more effort in the implementation, with careful overflow checks that would have been painful to put at all the previous call sites. This commit also switches over every use of sresize(), apart from a few where I really didn't think it would gain anything. A consequence of that is that a lot of array-size variables have to have their types changed to size_t, because the macros require that (they address-take the size to pass to the underlying function).	2019-02-28 20:15:38 +00:00

1 2 3

125 Commits