cosmopolitan/libc
Ivan Komarov f7ff515961
*scanf() fixes to make TeX work (#1109)
* Fix reading the same symbol twice when using `{f,}scanf()`

PR #924 appears to use `unget()` subtly incorrectly when parsing
floating point numbers. The rest of the code only uses `unget()`
immediately followed by `goto Done;` to return back the symbol that
can't possibly belong to the directive we're processing.

With floating-point, however, the ungot characters could very well
be valid for the *next* directive, so we will essentially read them
twice. It can't be seen in `sscanf()` tests because `unget()` is a
no-op there, but the test I added for `fscanf()` fails like this:

        ...
        EXPECT_EQ(0xDEAD, i1)
                need 57005 (or 0xdead) =
                 got 908973 (or 0x000ddead)
        ...
        EXPECT_EQ(0xBEEF, i2)
                need 48879 (or 0xbeef) =
                 got 769775 (or 0x000bbeef)

This means we read 0xDDEAD instead of 0xDEAD and 0xBBEEF instead of
0xBEEF. I checked that both musl and glibc read 0xDEAD/0xBEEF, as
expected.

Fix the failing test by removing the unneeded `unget()` calls.

* Don't read invalid floating-point numbers in `*scanf()`

Currently, we just ignore any errors from `strtod()`. They can
happen either because no valid float can be parsed at all, or
because the state machine recognizes only a prefix of a valid
floating-point number.

Fix this by making sure `strtod()` parses everything we recognized,
provided it's non-empty. This requires to pop the last character
off the FP buffer, which is supposed to be parsed by the next
`*scanf()` directive.

* Make `%c` parsing in `*scanf()` respect the C standard

Currently, `%c`-style directives always succeed even if there
are actually fewer characters in the input than requested.

Before the fix, the added test fails like this:

        ...
        EXPECT_EQ(2, sscanf("ab", "%c %c %c", &c2, &c3, &c4))
                need 2 (or 0x02 or '\2' or ENOENT) =
                 got 3 (or 0x03 or '\3' or ESRCH)
        ...
        EXPECT_EQ(0, sscanf("abcd", "%5c", s2))
                need 0 (or 0x0 or '\0') =
                 got 1 (or 0x01 or '\1' or EPERM)

musl and glibc pass this test.
2024-02-23 07:15:30 -08:00
..
calls Introduce getcpu() system call from glibc 2024-02-21 18:17:20 -08:00
crt Release Cosmopolitan v3.3 2024-02-20 13:27:59 -08:00
dlopen Make cosmo.h work a little better 2024-02-21 16:40:09 -08:00
elf more modeline errata (#1019) 2023-12-16 23:07:10 -05:00
fmt Make cosmo.h work a little better 2024-02-21 16:40:09 -08:00
integral Release Cosmopolitan v3.3 2024-02-20 13:27:59 -08:00
intrin Make --ftrace more crash proof with foreign code 2024-02-22 11:06:31 -08:00
irq more modeline errata (#1019) 2023-12-16 23:07:10 -05:00
isystem Resurrect <windows.h> as <windowsesque.h> 2024-02-21 16:41:11 -08:00
log Fix regression in logger functions 2024-02-22 14:08:20 -08:00
mem Make cosmo.h work a little better 2024-02-21 16:40:09 -08:00
nexgen32e Make cosmo.h work a little better 2024-02-21 16:40:09 -08:00
nt Introduce getcpu() system call from glibc 2024-02-21 18:17:20 -08:00
proc Release Cosmopolitan v3.3 2024-02-20 13:27:59 -08:00
runtime Make --ftrace more crash proof with foreign code 2024-02-22 11:06:31 -08:00
sock Release Cosmopolitan v3.3 2024-02-20 13:27:59 -08:00
stdio *scanf() fixes to make TeX work (#1109) 2024-02-23 07:15:30 -08:00
str Release Cosmopolitan v3.3 2024-02-20 13:27:59 -08:00
sysv Release Cosmopolitan v3.3 2024-02-20 13:27:59 -08:00
testlib Make improvements 2024-02-12 10:23:00 -08:00
thread Release Cosmopolitan v3.3 2024-02-20 13:27:59 -08:00
time Release Cosmopolitan v3.3 2024-02-20 13:27:59 -08:00
tinymath Release Cosmopolitan v3.3 2024-02-20 13:27:59 -08:00
vga more modeline errata (#1019) 2023-12-16 23:07:10 -05:00
x Make cosmo.h work a little better 2024-02-21 16:40:09 -08:00
ar.h Reduce header complexity 2023-11-28 14:39:42 -08:00
assert.h Add dontthrow attribute to most libc functions 2024-01-09 01:26:03 -08:00
atomic.h Optimize memory layout 2022-09-12 04:26:52 -07:00
BUILD.mk Fine tune OpenMP some more 2024-01-30 06:30:24 -08:00
complex.h libc headers: make <complex.h> work, add struct ipv6_mreq (#1100) 2024-02-05 17:22:56 -05:00
cosmo.h Make improvements 2024-02-12 10:23:00 -08:00
cxxabi.h libc: Fix __cxa_thread_atexit prototype (#1088) 2024-01-22 10:23:28 -08:00
dce.h Make cosmo.h work a little better 2024-02-21 16:40:09 -08:00
dos.internal.h Introduce #include <cosmo.h> to toolchain users 2023-06-09 18:03:05 -07:00
empty.s Do some basic build tuning 2023-05-10 04:20:46 -07:00
errno.h Reduce header complexity 2023-11-28 14:39:42 -08:00
imag.internal.h Remove evil constants from cosmopolitan.h 2022-12-17 00:42:45 -08:00
inttypes.h Fix issues for latest GCC toolchain 2023-10-11 14:54:42 -07:00
iso646.internal.h Add more missing C / C++ headers 2022-09-04 04:53:52 -07:00
limits.h Make improvements 2023-10-08 08:59:53 -07:00
literal.h Get GNU MPFR and MPC tests to pass 2023-08-21 15:05:10 -07:00
mach.internal.h Reduce header complexity 2023-11-28 14:39:42 -08:00
macho.internal.h Fiddle around with Mach-O 2023-05-20 04:13:49 -07:00
macros.internal.h Make improvements 2024-02-12 10:23:00 -08:00
math.h Release Cosmopolitan v3.3 2024-02-20 13:27:59 -08:00
paths.h Embed cocmd.com interpreter for system() / open() 2022-10-02 15:29:57 -07:00
README.md Reformat libc README.md (#437) 2022-06-21 07:50:42 -07:00
serialize.h Reduce header complexity 2023-11-28 14:39:42 -08:00
stdalign.internal.h Add more missing C / C++ headers 2022-09-04 04:53:52 -07:00
stdbool.h Make improvements 2024-02-12 10:23:00 -08:00
stdckdint.h Reduce header complexity 2023-11-28 14:39:42 -08:00
stdlib.h Add dontthrow attribute to most libc functions 2024-01-09 01:26:03 -08:00
temp.h Add dontthrow attribute to most libc functions 2024-01-09 01:26:03 -08:00
testlib-test.txt Fix some zipos directory related bugs 2023-09-19 02:30:42 -07:00
type2str.h Improve new C23 checked arithmetic feature 2023-06-16 15:32:18 -07:00
zip.internal.h Add dontthrow attribute to most libc functions 2024-01-09 01:26:03 -08:00

Cosmopolitan Standard Library

This directory defines static archives defining functions, like printf(), mmap(), win32, etc. Please note that the Cosmopolitan build configuration doesn't link any C/C++ library dependencies by default, so you still have the flexibility to choose the one provided by your system. If you'd prefer Cosmopolitan, just add $(LIBC) and $(CRT) to your linker arguments.

Your library is compromised of many bite-sized static archives. We use the checkdeps tool to guarantee that the contents of the archives are organized in a logical way that's easy to use with or without our makefile infrastructure, since there's no cyclic dependencies.

The Cosmopolitan Library exports only the most stable canonical system calls for all supported operating systems, regardless of which platform is used for compilation. We polyfill many of the APIs, e.g. read(), write() so they work consistently everywhere while other apis, e.g. CreateWindowEx(), might only work on one platform, in which case they become no-op functions on others.

Cosmopolitan polyfill wrappers will usually use the dollar sign naming convention, so they may be bypassed when necessary. This same convention is used when multiple implementations of string library and other performance-critical function are provided to allow Cosmopolitan to go fast on both old and newer computers.

We take an approach to configuration that relies heavily on the compiler's dead code elimination pass (libc/dce.h). Most of the code is written so that, for example, folks not wanting support for OpenBSD can flip a bit in SUPPORT_VECTOR and that code will be omitted from the build. The same is true for builds that are tuned using -march=native which effectively asks the library to not include runtime support hooks for x86 processors older than what you use.

Please note that, unlike Cygwin or MinGW, Cosmopolitan does not achieve broad support by bolting on a POSIX emulation layer. We do nothing more than (in most cases) stateless API translations that get you 90% of the way there in a fast lightweight manner. We therefore can't address some of the subtle differences, such as the nuances of absolute paths on Windows. Our approach could be compared to something more along the lines of, "the Russians just used a pencil to write in space", versus spending millions researching a pen like NASA.