Preview Animation

Welcome to Text Recovery Project πŸ‘‹

A python library for distributed training of a Transformer neural network across the Internet to solve the Running Key Cipher, widely known in the field of Cryptography.

Hugging Face demo Visualize%20in%20W&B MkDocs link Python version PyPI version PyPi Downloads License Apache 2.0

The main goal of the project is to study the possibility of using Transformer neural network to β€œread” meaningful text in columns that can be compiled for a Running Key Cipher. You can read more about the problem here. The second goal is to train a fairly large model in a distributed manner with the help of volunteers from around the globe 🌎.

In addition, rather fun πŸ˜… goal is to train a large enough model so that it can handle the case described below.
Let there be an original sentence:

Hello, my name is Zendaya Maree Stoermer Coleman but you can just call me Zendaya.

The columns for this sentence will be compiled in such a way that the last seven contain from ten to thirteen letters of the English alphabet, and all the others from two to five. Thus, the last seven characters will be much harder to "read" compared to the rest. However, we can guess from the meaning of the sentence that this is the name Zendaya.

In other words, the goal is also to train a model that can understand and correctly β€œread” the last word.