crypto/init.c: do not try to load itself when linked statically #6725

LionNatsu · 2018-07-16T23:49:06Z

The leak-one-reference trick for keeping the crypto module loaded is
pointless when the program has already statically linked OpenSSL.
It can be also dangerous, because DSO_dsobyaddr will try to load
the main program itself.

In Linux, for example, trying to load an executable file will generate an
error and be ignored, but bad things will happen when the main program is
compiled and linked with PIE hardening. Because the PIE-hardened program acts
more like a shared object, it can be accidentally dlopen() and loaded into
memory again, mess up the mapping and flush the global state.

The leak-one-reference trick for keeping the crypto module loaded is pointless when the program has already statically linked OpenSSL. It can be also dangerous, because DSO_dsobyaddr will try to load the main program itself. In Linux, for example, trying to load an executable file will generate an error and be ignored, but bad things will happen when the main program is compiled and linked with PIE hardening. Because the PIE-hardened program acts more like a shared object, it can be accidentally dlopen() and loaded into memory again, mess up the mapping and flush the global state. CLA: trivial

The statically linked program should not try to keep loaded by dlopen trick. Fixes: nodejs#21845 Refs: openssl/openssl#6725

LionNatsu · 2018-08-31T17:45:20Z

Ping?

levitte · 2018-09-03T12:01:44Z

You seem to have fixed it for nodejs, can we ask you to create a PR for OpenSSL proper as well?

levitte · 2018-09-03T12:02:25Z

I assume, btw, that this is an issue both in master and in the 1.1.0 branch, correct?

LionNatsu · 2018-09-03T14:25:36Z

You seem to have fixed it for nodejs, can we ask you to create a PR for OpenSSL proper as well?

Emmmm yes, sir. But I don't understand, this is a PR, isn't it? Please correct me if there is anything wrong.

I assume, btw, that this is an issue both in master and in the 1.1.0 branch, correct?

Correct. So far there is nothing that checks OPENSSL_NO_STATIC_ENGINE in 1.1.0 stable branch.

openssl/crypto/init.c

Lines 118 to 128 in b50c9f3

    
           static CRYPTO_ONCE load_crypto_nodelete = CRYPTO_ONCE_STATIC_INIT; 
        
           DEFINE_RUN_ONCE_STATIC(ossl_init_load_crypto_nodelete) 
        
           { 
        
           #ifdef OPENSSL_INIT_DEBUG 
        
               fprintf(stderr, "OPENSSL_INIT: ossl_init_load_crypto_nodelete()\n"); 
        
           #endif 
        
           #if !defined(OPENSSL_NO_DSO) && !defined(OPENSSL_USE_NODELETE) 
        
           # ifdef DSO_WIN32 
        
               { 
        
                   HMODULE handle = NULL; 
        
                   BOOL ret;

levitte · 2018-09-03T16:39:49Z

D'oh, you're right, this is indeed a PR. I have no idea how I missed that earlier today. My apologies...

levitte · 2018-09-03T18:21:01Z

Uhmmmm, I gotta ask, what is it that would cause dladdr to call dlopen? After all, DSO_pathbyaddr ends up calling dladdr, at least on dlfcn systems.

LionNatsu · 2018-09-03T19:39:14Z

what is it that would cause dladdr to call dlopen? After all, DSO_pathbyaddr ends up calling dladdr, at least on dlfcn systems.

Sorry, could you explain more about your question?

I'd like to make a short summary here, for your information:
A statically linked OpenSSL libcrypto:

OPENSSL_init_crypto called RUN_ONCE(&load_crypto_nodelete, ossl_init_load_crypto_nodelete),
ossl_init_load_crypto_nodelete called DSO_free(DSO_dsobyaddr(&base_inited, DSO_FLAG_NO_UNLOAD_ON_FREE));.
where static int base_inited = 0; in "crypto/init.c". This address is linked into the body of the main program, when no-shared.
DSO_dsobyaddr called DSO_pathbyaddr(addr, ..., ...) to get the path of OpenSSL crypto library itself, using dladdr(). But when no-shared, it will get the path of the main program.
DSO_dsobyaddr later, called DSO_load(NULL, filename, NULL, flags), using dlopen() to load the main program.

levitte · 2018-09-04T12:39:29Z

Sorry, I mixed up DSO_dsobyaddr and DSO_pathbyaddr...

It seems a bit uncertain how this gets addressed... after all, we create a static library even when building the shared library (except on Windows 'cause reasons), and that might end up in the application as well and still cause your issue. On the other hand, it may also be linked into another shared library.

But awright, this is a specific use case and we can treat it as such.

levitte · 2018-09-04T12:50:32Z

Note that I'm approving with a bit of trepidation...

paulidale · 2018-09-04T20:16:39Z

I'm unconvinced but not to the point of rejecting this. Not having static engines doesn't mean OpenSSL is statically linked, it just means that OpenSSL doesn't have any built in engines. It can be either statically or dynamically linked in such a case.

I'm having difficulties trying to work out how a library could detect if it is statically linked or dynamically loaded. The best that comes to mind is a comparison of returns from DSO_dsobyaddr(&main) and DSO_dsobyaddr(&OPENSSL_init_crypto) but this won't be universal.

LionNatsu · 2018-09-05T09:27:31Z

How about this?

Let the build script

compiles and links .so without OPENSSL_STATIC_LIBCRYPTO, then,
compiles and links .a with OPENSSL_STATIC_LIBCRYPTO

trying to work out how a library could detect if it is statically linked or dynamically loaded.

Sure, detecting at runtime will be better.

levitte · 2018-09-05T12:38:01Z

How about this?

Let the build script

compiles and links .so without OPENSSL_STATIC_LIBCRYPTO, then,

compiles and links .a with OPENSSL_STATIC_LIBCRYPTO

Note that this won't cover when a static library is linked into another shared library. I assume that's not a common thing to do, but it's still perfectly possible, given a build with -fPIC or similar...

Anyway, this will require creating separate object files for shared library and for static library, which our build system currently doesn't support (it's in the planning for the next major release, but nothing really stops us from introducing it earlier... however, considering where we are in the release cycle for the moment, it's probably much too late to get this in for 1.1.1).

trying to work out how a library could detect if it is statically linked or dynamically loaded.

Sure, detecting at runtime will be better.

Can this be done? How?

levitte · 2018-09-05T12:42:20Z

The best that comes to mind is a comparison of returns from DSO_dsobyaddr(&main) and DSO_dsobyaddr(&OPENSSL_init_crypto) but this won't be universal.

I would suggest using DSO_pathbyaddr instead, since DSO_dsobyaddr might do a load, which isn't what we want in this case.

LionNatsu · 2018-09-05T16:35:03Z

Sure, detecting at runtime will be better.
Can this be done? How?

No idea.

Note that this won't cover when a static library is linked into another shared library.

That's quite interesting. In fact, I'm wondering why libcrypto.so tries to prevent itself from unloading. I assume that is for improving performance.

Normally, a program "linked a shared library" loads the library by ld.so automatically and will never "dlclose" it. The underlying reference count should be always greater than one, so leaking another reference to prevent from unloading by "dlclose" is a no-op in fact.

[main] == DT_NEEDED => [libcrypto.so]
[main] == DT_NEEDED => [libfoo.so] == DT_NEEDED => [libcrypto.so]

A program "linked a static library" will never call the unload handler until the program itself is going to exit, so adding reference count is also a no-op, or in fact not permitted by system or letting PIE program to be reloaded.

[main & libcrypto.a]
[main & (libfoo.a & libcrypto.a)]

This is a chain, all libraries actually (recursively) "linked" with the main program always stay with the living process, regardless of "static" or "shared".

Programs or libraries which NOT link libcrypto.so but "dlopen" it during runtime, may "dlclose" it and "dlopen" it again soon. If we leak a reference, libcrypto.so will stay there. I think this might be (one of) the case(s) that it should leak a reference. Am I correct?

[any] -- dlopen() --> [libcrypto.so]  # init
      -- dlclose() -> [libcrypto.so]  # unload
      -- dlopen() --> [libcrypto.so]  # init
      -- dlclose() -> [libcrypto.so]  # unload

Now, consider a library was statically linked into another library, say libpassword-manager.so. Now the same pattern here: a program "dlopen" libpassword-manager.so and "dlclose" it soon. Shall we force libpassword-manager.so to leak one reference? What if the parts outside of libcrypto.a want to be unloaded (e.g. to synchronize files, to clean buffers, to disconnect sockets) as the "dlclose" of the main program implies?

[any] -- dlopen() --> [libpm.so & libcrypto.a]  # init libpm, init libcrypto
      -- dlclose() -> [libpm.so & libcrypto.a]  # unload libpm, unload libcrypto
      -- dlopen() --> [libpm.so & libcrypto.a]  # init libpm, init libcrypto
      -- dlclose() -> [libpm.so & libcrypto.a]  # unload libpm, unload libcrypto

So my conclusion is .a shouldn't leak any reference of itself, otherwise other parts than OpenSSL could be lingering in the memory and unload handlers will not be called as intended.

LionNatsu · 2018-09-05T18:00:08Z

Let the build script
compiles and links .so without
OPENSSL_STATIC_LIBCRYPTO, then,
compiles and links .a with
OPENSSL_STATIC_LIBCRYPTO

Note that this won't cover when a static library is linked into another shared library.

If you agree with the comment above, that a static library shouldn't do the "nodelete" no matter it was linked into a program or another library, preventing the static library from doing this does simply cover all the situations in my opinion.

levitte · 2018-09-05T23:00:13Z

The no-delete thing was added to cover for another issue, our use of atexit, which executes at process takedown, not when libcrypto is unloaded. It's quite possible that there's a better mechanism that will actually run on libcrypto unload, but if there is such a thing, we need it to be portable.

paulidale · 2018-09-05T23:16:39Z

On many Unix systems, there is a concept of a dtor (destructor) list which corresponds to the ctor (constructor) one but runs on library/program unload. I'd be surprised in Windows didn't have something similar.

levitte · 2018-09-05T23:29:03Z

On many Unix systems, there is a concept of a dtor (destructor) list which corresponds to the ctor (constructor) one but runs on library/program unload.

Sure... and they don't necessarily look the same. This is completely undefined by POSIX. So there is that portability bit...

I'd be surprised in Windows didn't have something similar.

One word: DllMain

LionNatsu · 2018-09-06T04:42:40Z

It's quite possible that there's a better mechanism that will actually run on libcrypto unload, but if there is such a thing, we need it to be portable.

"The most portable" mechanism is DT_INIT and DT_FINI in standard ELF format. They are corresponding to _init(void) and _fini(void) in UNIX. They are exclusive thus there can only be one handler for init, one handler for fini.

Most of UNIX-like system also implemented DT_INIT_ARRAY and DT_FINI_ARRAY. GNU C library places a function in the array so it can provide atexit(void) and on_exit(arg). Some compilers have __attribute__((constructor)) and __attribute__((destructor)) and these function attributes add functions to that array.

An incomplete list of compilers support __attribute__(({con,de}structor)):

GNU C compiler: https://gcc.gnu.org/onlinedocs/gcc/Common-Function-Attributes.html
Clang compiler: llvm-mirror/clang@3068ae0f (snice 2008)
Arm Compiler: http://www.keil.com/support/man/docs/armcc/armcc_chr1359124975087.htm
Solaris C compiler: https://docs.oracle.com/cd/E24457_01/html/E21990/gjzke.html
Intel C++ Compiler: https://software.intel.com/en-us/c-compilers claims the compilers have source compatibility with the GNU Compiler Collection (GCC) on Linux*, macOS*, and Android*.

IBM XL C/C++ for AIX does not support __attribute__(({con,de}structor)) but _init(void)/_fini(void).

As I described above, atexit(void) in GNU C library is placed at DT_FINI_ARRAY so it will also be called when the shared library is unloaded. But the original POSIX standard said it will only be call "at normal process termination". Such uncertainty could be hard to deal with.

If we don't use "no-delete" things, I think the most general method will be __attribute__(({con,de}structor)). I can also found some other uses in the source code of OpenSSL https://github.com/openssl/openssl/search?q=__attribute__+constructor

levitte · 2018-09-06T06:27:12Z

I know that ELF is quite popular these days, and that certainly simplifies things... but OpenSSL doesn't count on being built on ELF only.

I'll throw in a curve ball: ftp://ftp.mrynet.com/os/HP-MPE/docs.hp.com/en/11782/LinkerOnliinehelp0709/libraryroutines.htm

paulidale · 2018-09-06T07:12:56Z

Initialisers and finalisers are present here. Albeit using a different method. Still, the point is made. We can’t rely on them being present. Pauli

…

On 6 Sep 2018, at 4:27 pm, Richard Levitte ***@***.***> wrote: I know that ELF is quite popular these days, and that certainly simplifies things... but OpenSSL doesn't count on being built on ELF only. I'll throw in a curve ball: ftp://ftp.mrynet.com/os/HP-MPE/docs.hp.com/en/11782/LinkerOnliinehelp0709/libraryroutines.htm — You are receiving this because you commented. Reply to this email directly, view it on GitHub <#6725 (comment)>, or mute the thread <https://github.com/notifications/unsubscribe-auth/ARFiwQ1j17hu8nboBQIbRivhCBL6SUkcks5uYMBQgaJpZM4VR-DK>.

levitte · 2018-09-06T07:44:35Z

Albeit using a different method.

Exactly, hence portability problems that will need to be resolved should we decide to adopt this route. Personally, I would have much preferred to use the initializers and finalizers, but the choice that was made does reflect that we couldn't be sure we covered all the bases.

LionNatsu · 2018-09-06T10:06:55Z

Emm... Using one method to try to workaround all the platforms (except Windows) is neat and does keep the code small.

However, will preprocessor branches work better for the portability issue? Branches for __attribute__((destructor)), for DllMain(), for #progma fini, and for other situations…?

levitte · 2018-09-06T20:17:33Z

Yeah, I can see us doing that. It will require some careful coding and should therefore not be rushed. I don't think it's likely to go into 1.1.1, but probably in a subsequent update, unless this is seen as a showstopper. @mattcaswell, what are your thoughts on the matter?

mattcaswell · 2018-09-07T08:42:21Z

Yeah, I can see us doing that. It will require some careful coding and should therefore not be rushed. I don't think it's likely to go into 1.1.1, but probably in a subsequent update, unless this is seen as a showstopper. @mattcaswell, what are your thoughts on the matter?

Absolutely not to be rushed and not a showstopper for 1.1.1 IMO.

BridgeAR · 2019-06-04T10:49:40Z

Could this be looked at again?

lal12 · 2019-06-04T18:21:22Z

Could this be looked at again?

The OPENSSL_NO_PINSHARED was added, which does achieves that fix, but the build system does not seem to use that. I don't know if the user is expected to specify the define themselves or if it just was forgotten. However I guess it renders this PR outdated.

mattcaswell · 2019-06-05T09:49:46Z

The OPENSSL_NO_PINSHARED was added, which does achieves that fix, but the build system does not seem to use that. I don't know if the user is expected to specify the define themselves or if it just was forgotten. However I guess it renders this PR outdated.

Specifying the "no-pinshared" option to Configure will cause that define to be specified.

lal12 · 2019-06-05T10:04:13Z

Ok, I overlooked that. I looked for a more "automatic" way, like when building only static. But it is probably hard to detect for many cases, by "automatic" means.

openssl-machine · 2020-06-02T14:18:30Z

This issue is in a state where it requires action by @openssl/committers but the last update was 363 days ago

t8m · 2021-05-12T09:13:53Z

Closing as this is already handled by the no-pinshared option.

LionNatsu changed the title ~~crypto: do not try to load itself when linked statically~~ crypto/init.c: do not try to load itself when linked statically Jul 16, 2018

LionNatsu force-pushed the patch-1 branch from 0a6b808 to 134d225 Compare July 16, 2018 23:55

LionNatsu mentioned this pull request Jul 17, 2018

node crashes with "free(): invalid pointer" when exiting nodejs/node#21845

Closed

LionNatsu added a commit to LionNatsu/node that referenced this pull request Jul 17, 2018

deps: backport crypto/init.c fix for openssl110

9868d40

The statically linked program should not try to keep loaded by dlopen trick. Fixes: nodejs#21845 Refs: openssl/openssl#6725

LionNatsu mentioned this pull request Jul 17, 2018

deps: backport crypto/init.c fix for openssl110 nodejs/node#21848

Closed

2 tasks

mattcaswell added this to the Assessed milestone Jul 20, 2018

levitte approved these changes Sep 4, 2018

View reviewed changes

levitte requested a review from mattcaswell September 4, 2018 12:50

richardlau mentioned this pull request Jun 3, 2019

/usr/bin/node: '/usr/bin/node' is not an ELF executable for ARM nodejs/node#27925

Closed

mgabeler-lee-6rs mentioned this pull request Mar 5, 2020

exec -a with unqualified path causes memory corruption in older nodejs versions nodenv/nodenv#151

Closed

iamamoose modified the milestones: Assessed, Post 1.1.1 Jun 29, 2020

t8m closed this May 12, 2021

LionNatsu deleted the patch-1 branch July 17, 2021 12:31

Uh oh!

crypto/init.c: do not try to load itself when linked statically #6725

crypto/init.c: do not try to load itself when linked statically #6725

Uh oh!

Conversation

LionNatsu commented Jul 16, 2018

Uh oh!

LionNatsu commented Aug 31, 2018

Uh oh!

levitte commented Sep 3, 2018

Uh oh!

levitte commented Sep 3, 2018

Uh oh!

LionNatsu commented Sep 3, 2018

Uh oh!

levitte commented Sep 3, 2018 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

levitte commented Sep 3, 2018

Uh oh!

LionNatsu commented Sep 3, 2018

Uh oh!

levitte commented Sep 4, 2018

Uh oh!

levitte commented Sep 4, 2018

Uh oh!

paulidale commented Sep 4, 2018

Uh oh!

LionNatsu commented Sep 5, 2018 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

levitte commented Sep 5, 2018

Uh oh!

levitte commented Sep 5, 2018

Uh oh!

LionNatsu commented Sep 5, 2018 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

LionNatsu commented Sep 5, 2018 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

levitte commented Sep 5, 2018

Uh oh!

paulidale commented Sep 5, 2018

Uh oh!

levitte commented Sep 5, 2018

Uh oh!

LionNatsu commented Sep 6, 2018

Uh oh!

levitte commented Sep 6, 2018

Uh oh!

paulidale commented Sep 6, 2018 via email

Uh oh!

levitte commented Sep 6, 2018

Uh oh!

LionNatsu commented Sep 6, 2018

Uh oh!

levitte commented Sep 6, 2018

Uh oh!

mattcaswell commented Sep 7, 2018

Uh oh!

BridgeAR commented Jun 4, 2019

Uh oh!

lal12 commented Jun 4, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

mattcaswell commented Jun 5, 2019

Uh oh!

lal12 commented Jun 5, 2019

Uh oh!

openssl-machine commented Jun 2, 2020

Uh oh!

t8m commented May 12, 2021

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

levitte commented Sep 3, 2018 •

edited

Loading

LionNatsu commented Sep 5, 2018 •

edited

Loading

LionNatsu commented Sep 5, 2018 •

edited

Loading

LionNatsu commented Sep 5, 2018 •

edited

Loading

lal12 commented Jun 4, 2019 •

edited

Loading