Mnemeric

Mnemeric is a data encoding algorithm for turning lengthy, unintelligible codes like device serial numbers and passcodes into natural language phrases that you can remember and communicate easily.

This prototype was developed in 39hrs for the SYNCSHACK 2021 hackathon, and won the competition's Best Algorithm Prize.

See a working prototype here.

Watch a video demo here.

Background

This project is the product of our accumulated frustration having to remember lengthy and confusing credit card details, passwords though, student ids, phone numbers, and to communicate gift card codes and device serial numbers to technical support over the phone. Humans naturally aren't great at remembering unintelligible sequences of unrelated symbols well - they aren’t just as meaningful as words.

Reducing this information overload without sacrificing our privacy will declutter our heads and enable all individuals who participate in modern life to focus on what really matters.

What is Mnemeric

Mnemeric is a cipher that translates between binary data and natural language. Users can supply complex sequences of ASCII characters on our website and encode them into simple, memorable words. They are able to customise the settings to cater for various code types (numeric, alphanumeric, UPC, ISBN etc.) Inversely, natural language phrases can also be decoded back into ASCII code.

The Filtered Dictionary of Words for Encoding

We calculated the frequencies of all the words in the OANC (Open American National Corpus), applied some length filtering and character filtering (lowercase, remove hyphen etc.), then after filtering profanity using a profanity list from Carnegie Mellon University, we took the top 8196 most frequent words. This allows us to encode 13 bits into a single dictionary word. 2^13 = 8196

Encoding Scheme and Control Word

Since encoding directly between ASCII and dictionary words will only provide a word to character ratio of 13/8, which is quite poor, and the fact that many types of data in the real world have a limited alphabet of characters, we decided to implement an encoding scheme to optimise the encoding of different types of data. Our scheme uses a **13 bit control word **at the start of the code phrase to store data about the encoding scheme (5 bits), a message length offset (4 bits) which is necessary for accurate decoding, room for a small checksum (2 bits) for validation, and a encoding number (2 bits) which allows the user to select between four possible encodings. The encoding number serves to allow users to avoid rare code phrases which contain several duplicate words, or specific words.

0	1	2	3	4	5	6	7	8	9	10	11	12
Encoding Scheme					End of Message Offset				Checksum		Hash

Please note that the checksum and hash components are not implemented in the demo. The Encoding Scheme is a set of five boolean values that determine whether a common set of characters is present in the encoded data, this makes the encoding of data that has a limited character set (such as numeric only) much more efficient.

0	1	2	3	4
Encoding Scheme huffman encoding
Separators	Special	Capital	Lowercase	Numeric

Note that the Separators character set is a subset of the Special character set, so these values are mutually exclusive. If both of these values are set to true, we consider the following three bits to represent an extended encoding type.

The Web Application

We used Next.js bootstrapped with create-next-app. The front end was built with React.js and various component libraries such as Material UI. The app is run completely clientside (see sourcecode) so you can be sure than none of your input into the website is exposed elsewhere on the web.

Name		Name	Last commit message	Last commit date
Latest commit History 21 Commits
components		components
pages		pages
public		public
styles		styles
.eslintrc.json		.eslintrc.json
.gitignore		.gitignore
.prettierrc		.prettierrc
README.md		README.md
next.config.js		next.config.js
optionsData.js		optionsData.js
package-lock.json		package-lock.json
package.json		package.json
words.js		words.js

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Mnemeric

Background

What is Mnemeric

The Filtered Dictionary of Words for Encoding

Encoding Scheme and Control Word

The Web Application

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Mnemeric

Background

What is Mnemeric

The Filtered Dictionary of Words for Encoding

Encoding Scheme and Control Word

The Web Application

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages