Analyzing some encrypted ARJ archives

There's an item in the Internet Archive's software collection, which someone uploaded and named "EPS CD 2 German 1994". https://archive.org/details/eps-cd-2-german-1994 It contains a few hundred .EXE files, organized in a handful of directories. And that's all. There's no explanation of what it is. No documentation. Not even a picture of the CD-ROM that it was … Continue reading Analyzing some encrypted ARJ archives

Notes on writing “magic” patterns for the “file” command

The file command is a classic Unix-centric utility that guesses the format of a file, based only on the file's contents. It may also print some additional information about the contents of the file. The modern open-source version of it can be found at <https://darwinsys.com/file/&gt;. Other posts about the file command: A script to analyze … Continue reading Notes on writing “magic” patterns for the “file” command

Win32 I/O character encoding supplement 3: UTF-8 manifest

For a list of other posts in this series, refer to the first post. A relatively recent Windows software development feature, affecting character encoding, is the ability to request a specific "ANSI" character encoding (or "code page"), presumably UTF-8, using a manifest. I decided to investigate what this really does. This "manifest method" is independent … Continue reading Win32 I/O character encoding supplement 3: UTF-8 manifest

Updated survey of LHarc and LHA

Since my first post on DOS versions of LHarc/LHA, I've found a few more versions of the software. Six of them appear to be original/official, and all of those are Japanese-language: 1.13d, 2.05b, 2.13, 2.52, 2.54, and 2.55. And I found quite a few new modified or hacked versions, two of which I'll discuss: "v1.14a" … Continue reading Updated survey of LHarc and LHA

Survey of EXPAND/DECOMP utilities

If you look at old DOS and Windows software distribution disks, you may see a lot of files whose names have the last character replaced by a "_" character, or sometimes a "$" character. For example: mplayer.ex_ mplayer.hl_ mplayer.re_ msacm.dl_ msacm.dr_ msadpcm.ac_ mscomstf.dl_ ... Many such files belong to a family of compressed file formats … Continue reading Survey of EXPAND/DECOMP utilities

What is LZSS compression?

I'm not asking how to implement LZSS. I'm asking how to distinguish things-that-are-LZSS from things-that-are-not-LZSS. It's generally understood that LZSS is a kind of data compression. It's supposedly a derivative of LZ77. But you may struggle to find out anything definitive or verifiable about LZSS. The name LZSS is more than likely derived from Lempel–Ziv–Storer–Szymanski, … Continue reading What is LZSS compression?