Todo-list

Todo

Rewrite the diff algorithm to not rely on naive unified diff result parsing.

(The original entry is located in /home/runner/work/mail-deduplicate/mail-deduplicate/mail_deduplicate/deduplicate.py:docstring of mail_deduplicate.deduplicate.DuplicateSet.diff, line 3.)

Todo

Investigate what mailbox.MaildirMessage.get_date() does and if we can use it.

(The original entry is located in /home/runner/work/mail-deduplicate/mail-deduplicate/mail_deduplicate/mail.py:docstring of mail_deduplicate.mail.DedupMail.timestamp, line 11.)

Todo

Allow customization of the way the size is computed, by getting the file size instead for example: `python size = os.path.getsize(mail_file) `

(The original entry is located in /home/runner/work/mail-deduplicate/mail-deduplicate/mail_deduplicate/mail.py:docstring of mail_deduplicate.mail.DedupMail.size, line 7.)

Todo

Future finer autodetection heuristics should be implemented here. Some ideas:

  • single mail from a maildir

  • plain text mail content

  • other mailbox formats supported in Python’s standard library:

    • MH

    • Babyl

    • MMDF

(The original entry is located in /home/runner/work/mail-deduplicate/mail-deduplicate/mail_deduplicate/mail_box.py:docstring of mail_deduplicate.mail_box.autodetect_box_type, line 12.)