Aaron Kelly's Blog

Indexing an 8GB MBOX file

This is a post series on how I exported, converted, and made searchable a gigantic mailbox, containing ~15 years worth of email and attachments.

To do this, I make use of a number of techniques in the following posts. I’ve placed them in a logical order here, but they can also be read individually:

[[Exfiltrating data from cloud services]]

[[Journey to the Center of the MBOX]]

[[Sorting and formatting MBOX output]]

[[Preparing text files for indexing]]

[[Searching for an email indexing tool]]