mbox

mbox

mbox is a generic term for a family of related file formats used for holding collections of electronic mail messages. All messages in an mbox mailbox are concatenated and stored as plain text in a single file. The beginning of each message is indicated by a line whose first five characters consist of "From" followed by a space (the so named "From_ line" or "'From ' line" or simply "From line") and the return path e-mail address. A blank line is appended to the end of each message. For a while, the mbox format was popular because text processing tools can be readily used on the plain text files used to store the e-mail messages.

Unlike the Internet protocols used for the exchange of e-mail, the format used for the storage of e-mail has never been formally defined through the RFC standardization mechanism and has been entirely left to the developer of an e-mail client.

mbox (RFC 4155) stores mailbox messages in their original Internet Message (RFC 2822) format, usually in files directly accessible to users. A similar format is the MH Message Handling System. Other systems, such as Microsoft Exchange Server and the Cyrus IMAP server store mailboxes in centralised databases managed by the mail system and not directly accessible by individual users.

The maildir mailbox format is often cited as an alternative to the mbox format for network e-mail storage systems.

Contents

Family

Four popular but incompatible variants on the same idea comprise a family of mbox formats: mboxo, mboxrd, mboxcl, and mboxcl2. The naming scheme was developed by Daniel J. Bernstein, Rahul Dhesi, and others in 1996. Each originated from a different version of Unix. mboxcl and mboxcl2 originated from the file format used by Unix System V Release 4 mail tools. mboxrd was invented by Rahul Dhesi et al. as a rationalisation of mboxo and subsequently adopted by some Unix mail tools including qmail.

mboxo and mboxrd locate the message start by scanning for From lines that are typically found in the e-mail message header. If a "From " string occurs at the beginning of a line in either the headers or the body of a message (unlikely for the former for correctly formatted messages, but likely for the latter), the e-mail message must be modified before the message is stored in an mbox mailbox file or the line will be taken as a message boundary. This is typically done by prepending a greater-than sign:

   >From my point of view...

In the mboxo format, this can lead to corruption of the message. If a line already contained >From  at the beginning (such as in a quotation), it is unchanged when written. When subsequently read by the mail software, the leading > is erroneously removed. The mboxrd format solves this by converting From  to >From  and converting >From  to >>From , etc. The transformation is then always reversible.[1]

The mboxcl and mboxcl2 formats do not scan for the From  line. Instead, they use a Content-Length: header to determine each message's length.

Modified mbox

Some e-mail clients use a modification of the mbox format for their mail folders.

  • Eudora uses an mboxo variation where a sender's e-mail address is replaced by the constant string "???@???". Most mbox clients store incoming messages as received. Eudora separates out attachments embedded in the message, storing the attachments as separate individual files in one folder.[citation needed]
  • The Mozilla family of MUAs (Mozilla, Netscape, Thunderbird, et al.) use an mboxrd variation with more complex From line quoting rules.[2]

File locking

Various mutually incompatible mechanisms have been used by different mbox formats to enable message file locking, including fcntl(), lockf(), and "dot locking". This does not work well with network mounted file systems, such as the Network File System (NFS).[citation needed]

Because more than one message is stored in a single file, some form of file locking is needed to avoid the corruption that can result from two or more processes modifying the mailbox simultaneously. This could happen if a network e-mail delivery program delivers a new message at the same time as a mail reader is deleting an existing message.

mbox files should be locked also while they are being read. Otherwise the reader may see corrupted message contents if another process is modifying the mbox at the same time, even though no actual file corruption occurs.

See also

References

  1. ^ http://homepage.ntlworld.com./jonathan.deboynepollard/FGA/mail-mbox-formats.html
  2. ^ "Importing and exporting your mail"

Further reading


Wikimedia Foundation. 2010.

Игры ⚽ Поможем написать реферат

Look at other dictionaries:

  • Mbox — is a generic term for a family of related file formats used for holding collections of electronic mail messages. All messages in an mbox mailbox are concatenated and stored as plain text in a single file. The beginning of each message is… …   Wikipedia

  • Mbox — nennt man eine Gruppe von Dateiformaten, mit denen mehrere E Mails in einer Datei gespeichert werden. mbox Dateien tragen für gewöhnlich keine Endung, gelegentlich wird aber mbox oder mbx verwendet. Das mbox Dateiformat ist das klassische… …   Deutsch Wikipedia

  • mbox — mbox, von englisch „mailbox“ („Postfach“), ist ein Dateiformat, mit dem mehrere E Mails zusammen in einer Datei gespeichert werden. Das Format ist meist von Unix bekannt. Inhaltsverzeichnis 1 Aufbau 1.1 Vorteile 1.2 Nachteile …   Deutsch Wikipedia

  • Mbox — Saltar a navegación, búsqueda mbox es un término genérico para una familia de formatos de fichero que se usan para almacenar conjuntos de correos electrónicos. Todos los mensajes en un buzón mailbox están concatenados en un único fichero. El… …   Wikipedia Español

  • mbox — es un término genérico para una familia de formatos de fichero que se usan para almacenar conjuntos de correos electrónicos. Todos los mensajes en un buzón mailbox están concatenados en un único fichero. El principio de cada mensaje está marcado… …   Wikipedia Español

  • Mbox — Pour les articles homonymes, voir box. Le format mbox est un format ouvert de stockage de courriel couramment utilisé. Il repose sur les trois principes suivants : attribuer un fichier à chaque dossier (au lieu d un fichier par message, ou d …   Wikipédia en Français

  • Mboxo — Mbox Pour les articles homonymes, voir box. Le format mbox est un format ouvert de stockage de courriel couramment utilisé. Il repose sur les trois principes suivants : attribuer un fichier à chaque dossier (au lieu d un fichier par message …   Wikipédia en Français

  • Mboxrd — Mbox Pour les articles homonymes, voir box. Le format mbox est un format ouvert de stockage de courriel couramment utilisé. Il repose sur les trois principes suivants : attribuer un fichier à chaque dossier (au lieu d un fichier par message …   Wikipédia en Français

  • Jeremy Jones (racer) — mbox type = style text = This article may require to meet Wikipedia s . Please [fullurl:FULLPAGENAME|action=edit improve this article] if you can. #if:September 2008| (September 2008) #switch:NAMESPACE ns:0 Talk Image Template = #if:September… …   Wikipedia

  • Алкины — (иначе ацетиленовые углеводороды)  углеводороды, содержащие тройную связь между атомами углерода, образующие гомологический ряд с общей формулой CnH2n 2. Атомы углерода при тройной связи находятся в состоянии sp гибридизации …   Википедия

Share the article and excerpts

Direct link
Do a right-click on the link above
and select “Copy Link”