email.tex 16.5 KB
Newer Older
1
% Copyright (C) 2001-2007 Python Software Foundation
2
% Author: barry@python.org (Barry Warsaw)
3

4
\section{\module{email} ---
5 6 7 8 9
	 An email and MIME handling package}

\declaremodule{standard}{email}
\modulesynopsis{Package supporting the parsing, manipulating, and
    generating email messages, including MIME documents.}
10 11
\moduleauthor{Barry A. Warsaw}{barry@python.org}
\sectionauthor{Barry A. Warsaw}{barry@python.org}
12 13 14 15 16 17

\versionadded{2.2}

The \module{email} package is a library for managing email messages,
including MIME and other \rfc{2822}-based message documents.  It
subsumes most of the functionality in several older standard modules
18 19
such as \refmodule{rfc822}, \refmodule{mimetools},
\refmodule{multifile}, and other non-standard packages such as
20
\module{mimecntl}.  It is specifically \emph{not} designed to do any
21 22 23 24
sending of email messages to SMTP (\rfc{2821}), NNTP, or other servers; those
are functions of modules such as \refmodule{smtplib} and \refmodule{nntplib}.
The \module{email} package attempts to be as RFC-compliant as possible,
supporting in addition to \rfc{2822}, such MIME-related RFCs as
25
\rfc{2045}, \rfc{2046}, \rfc{2047}, and \rfc{2231}.
26 27 28 29 30 31 32 33

The primary distinguishing feature of the \module{email} package is
that it splits the parsing and generating of email messages from the
internal \emph{object model} representation of email.  Applications
using the \module{email} package deal primarily with objects; you can
add sub-objects to messages, remove sub-objects from messages,
completely re-arrange the contents, etc.  There is a separate parser
and a separate generator which handles the transformation from flat
34
text to the object model, and then back to flat text again.  There
35 36 37 38 39 40 41
are also handy subclasses for some common MIME object types, and a few
miscellaneous utilities that help with such common tasks as extracting
and parsing message field values, creating RFC-compliant dates, etc.

The following sections describe the functionality of the
\module{email} package.  The ordering follows a progression that
should be common in applications: an email message is read as flat
42 43
text from a file or other source, the text is parsed to produce the
object structure of the email message, this structure is manipulated,
44
and finally, the object tree is rendered back into flat text.
45

46 47 48
It is perfectly feasible to create the object structure out of whole
cloth --- i.e. completely from scratch.  From there, a similar
progression can be taken as above.
49 50 51 52 53

Also included are detailed specifications of all the classes and
modules that the \module{email} package provides, the exception
classes you might encounter while using the \module{email} package,
some auxiliary utilities, and a few examples.  For users of the older
54 55
\module{mimelib} package, or previous versions of the \module{email}
package, a section on differences and porting is provided.
56

57 58
\begin{seealso}
    \seemodule{smtplib}{SMTP protocol client}
59
    \seemodule{nntplib}{NNTP protocol client}
60 61
\end{seealso}

62
\subsection{Representing an email message}
63
\input{emailmessage}
64 65

\subsection{Parsing email messages}
66
\input{emailparser}
67 68

\subsection{Generating MIME documents}
69
\input{emailgenerator}
70 71

\subsection{Creating email and MIME objects from scratch}
72
\input{emailmimebase}
73

74
\subsection{Internationalized headers}
75
\input{emailheaders}
76

77 78 79
\subsection{Representing character sets}
\input{emailcharsets}

80 81
\subsection{Encoders}
\input{emailencoders}
82

83
\subsection{Exception and Defect classes}
84
\input{emailexc}
85

86 87
\subsection{Miscellaneous utilities}
\input{emailutil}
88

89 90
\subsection{Iterators}
\input{emailiter}
91

92
\subsection{Package History\label{email-pkg-history}}
93

94 95 96
This table describes the release history of the email package, corresponding
to the version of Python that the package was released with.  For purposes of
this document, when you see a note about change or added versions, these refer
97
to the Python version the change was made in, \emph{not} the email package
98 99
version.  This table also describes the Python compatibility of each version
of the package.
100

101 102 103 104 105 106
\begin{tableiii}{l|l|l}{constant}{email version}{distributed with}{compatible with}
\lineiii{1.x}{Python 2.2.0 to Python 2.2.1}{\emph{no longer supported}}
\lineiii{2.5}{Python 2.2.2+ and Python 2.3}{Python 2.1 to 2.5}
\lineiii{3.0}{Python 2.4}{Python 2.3 to 2.5}
\lineiii{4.0}{Python 2.5}{Python 2.3 to 2.5}
\end{tableiii}
107

108
Here are the major differences between \module{email} version 4 and version 3:
109 110 111 112 113 114 115 116 117 118 119 120 121 122 123 124 125 126 127 128

\begin{itemize}
\item All modules have been renamed according to \pep{8} standards.  For
      example, the version 3 module \module{email.Message} was renamed to
      \module{email.message} in version 4.

\item A new subpackage \module{email.mime} was added and all the version 3
      \module{email.MIME*} modules were renamed and situated into the
      \module{email.mime} subpackage.  For example, the version 3 module
      \module{email.MIMEText} was renamed to \module{email.mime.text}.

      \emph{Note that the version 3 names will continue to work until Python
      2.6}.

\item The \module{email.mime.application} module was added, which contains the
      \class{MIMEApplication} class.

\item Methods that were deprecated in version 3 have been removed.  These
      include \method{Generator.__call__()}, \method{Message.get_type()},
      \method{Message.get_main_type()}, \method{Message.get_subtype()}.
129 130 131 132 133 134 135 136 137

\item Fixes have been added for \rfc{2231} support which can change some of
      the return types for \function{Message.get_param()} and friends.  Under
      some circumstances, values which used to return a 3-tuple now return
      simple strings (specifically, if all extended parameter segments were
      unencoded, there is no language and charset designation expected, so the
      return type is now a simple string).  Also, \%-decoding used to be done
      for both encoded and unencoded segments; this decoding is now done only
      for encoded segments.
138 139 140
\end{itemize}

Here are the major differences between \module{email} version 3 and version 2:
141 142 143 144

\begin{itemize}
\item The \class{FeedParser} class was introduced, and the \class{Parser}
      class was implemented in terms of the \class{FeedParser}.  All parsing
145
      therefore is non-strict, and parsing will make a best effort never to
146 147 148 149 150 151 152 153 154 155 156 157 158
      raise an exception.  Problems found while parsing messages are stored in
      the message's \var{defect} attribute.

\item All aspects of the API which raised \exception{DeprecationWarning}s in
      version 2 have been removed.  These include the \var{_encoder} argument
      to the \class{MIMEText} constructor, the \method{Message.add_payload()}
      method, the \function{Utils.dump_address_pair()} function, and the
      functions \function{Utils.decode()} and \function{Utils.encode()}.

\item New \exception{DeprecationWarning}s have been added to:
      \method{Generator.__call__()}, \method{Message.get_type()},
      \method{Message.get_main_type()}, \method{Message.get_subtype()}, and
      the \var{strict} argument to the \class{Parser} class.  These are
159
      expected to be removed in future versions.
160 161 162 163 164

\item Support for Pythons earlier than 2.3 has been removed.
\end{itemize}

Here are the differences between \module{email} version 2 and version 1:
165 166 167 168

\begin{itemize}
\item The \module{email.Header} and \module{email.Charset} modules
      have been added.
Barry Warsaw's avatar
nits  
Barry Warsaw committed
169

170 171
\item The pickle format for \class{Message} instances has changed.
      Since this was never (and still isn't) formally defined, this
172
      isn't considered a backward incompatibility.  However if your
173 174 175 176
      application pickles and unpickles \class{Message} instances, be
      aware that in \module{email} version 2, \class{Message}
      instances now have private variables \var{_charset} and
      \var{_default_type}.
Barry Warsaw's avatar
nits  
Barry Warsaw committed
177

178
\item Several methods in the \class{Message} class have been
179
      deprecated, or their signatures changed.  Also, many new methods
180
      have been added.  See the documentation for the \class{Message}
181
      class for details.  The changes should be completely backward
182
      compatible.
Barry Warsaw's avatar
nits  
Barry Warsaw committed
183

184 185 186 187
\item The object structure has changed in the face of
      \mimetype{message/rfc822} content types.  In \module{email}
      version 1, such a type would be represented by a scalar payload,
      i.e. the container message's \method{is_multipart()} returned
188 189
      false, \method{get_payload()} was not a list object, but a single
      \class{Message} instance.
190 191 192

      This structure was inconsistent with the rest of the package, so
      the object representation for \mimetype{message/rfc822} content
193
      types was changed.  In \module{email} version 2, the container
194 195 196 197
      \emph{does} return \code{True} from \method{is_multipart()}, and
      \method{get_payload()} returns a list containing a single
      \class{Message} item.

198
      Note that this is one place that backward compatibility could
199 200 201 202 203
      not be completely maintained.  However, if you're already
      testing the return type of \method{get_payload()}, you should be
      fine.  You just need to make sure your code doesn't do a
      \method{set_payload()} with a \class{Message} instance on a
      container with a content type of \mimetype{message/rfc822}.
Barry Warsaw's avatar
nits  
Barry Warsaw committed
204

205 206 207 208 209
\item The \class{Parser} constructor's \var{strict} argument was
      added, and its \method{parse()} and \method{parsestr()} methods
      grew a \var{headersonly} argument.  The \var{strict} flag was
      also added to functions \function{email.message_from_file()}
      and \function{email.message_from_string()}.
Barry Warsaw's avatar
nits  
Barry Warsaw committed
210

211 212 213
\item \method{Generator.__call__()} is deprecated; use
      \method{Generator.flatten()} instead.  The \class{Generator}
      class has also grown the \method{clone()} method.
Barry Warsaw's avatar
nits  
Barry Warsaw committed
214

215 216
\item The \class{DecodedGenerator} class in the
      \module{email.Generator} module was added.
Barry Warsaw's avatar
nits  
Barry Warsaw committed
217

218 219
\item The intermediate base classes \class{MIMENonMultipart} and
      \class{MIMEMultipart} have been added, and interposed in the
220
      class hierarchy for most of the other MIME-related derived
221
      classes.
Barry Warsaw's avatar
nits  
Barry Warsaw committed
222

223 224 225
\item The \var{_encoder} argument to the \class{MIMEText} constructor
      has been deprecated.  Encoding  now happens implicitly based
      on the \var{_charset} argument.
Barry Warsaw's avatar
nits  
Barry Warsaw committed
226

227 228 229 230 231 232
\item The following functions in the \module{email.Utils} module have
      been deprecated: \function{dump_address_pairs()},
      \function{decode()}, and \function{encode()}.  The following
      functions have been added to the module:
      \function{make_msgid()}, \function{decode_rfc2231()},
      \function{encode_rfc2231()}, and \function{decode_params()}.
Barry Warsaw's avatar
nits  
Barry Warsaw committed
233

234 235 236 237
\item The non-public function \function{email.Iterators._structure()}
      was added.
\end{itemize}

238 239 240
\subsection{Differences from \module{mimelib}}

The \module{email} package was originally prototyped as a separate
241
library called
242
\ulink{\texttt{mimelib}}{http://mimelib.sf.net/}.
243
Changes have been made so that
244 245 246
method names are more consistent, and some methods or modules have
either been added or removed.  The semantics of some of the methods
have also changed.  For the most part, any functionality available in
Fred Drake's avatar
Fred Drake committed
247
\module{mimelib} is still available in the \refmodule{email} package,
248 249 250
albeit often in a different way.  Backward compatibility between
the \module{mimelib} package and the \module{email} package was not a
priority.
251 252

Here is a brief description of the differences between the
Fred Drake's avatar
Fred Drake committed
253
\module{mimelib} and the \refmodule{email} packages, along with hints on
254 255 256
how to port your applications.

Of course, the most visible difference between the two packages is
Fred Drake's avatar
Fred Drake committed
257
that the package name has been changed to \refmodule{email}.  In
258 259 260 261 262
addition, the top-level package has the following differences:

\begin{itemize}
\item \function{messageFromString()} has been renamed to
      \function{message_from_string()}.
Barry Warsaw's avatar
nits  
Barry Warsaw committed
263

264 265
\item \function{messageFromFile()} has been renamed to
      \function{message_from_file()}.
Barry Warsaw's avatar
nits  
Barry Warsaw committed
266

267 268 269 270 271 272
\end{itemize}

The \class{Message} class has the following differences:

\begin{itemize}
\item The method \method{asString()} was renamed to \method{as_string()}.
Barry Warsaw's avatar
nits  
Barry Warsaw committed
273

274 275
\item The method \method{ismultipart()} was renamed to
      \method{is_multipart()}.
Barry Warsaw's avatar
nits  
Barry Warsaw committed
276

277 278
\item The \method{get_payload()} method has grown a \var{decode}
      optional argument.
Barry Warsaw's avatar
nits  
Barry Warsaw committed
279

280
\item The method \method{getall()} was renamed to \method{get_all()}.
Barry Warsaw's avatar
nits  
Barry Warsaw committed
281

282
\item The method \method{addheader()} was renamed to \method{add_header()}.
Barry Warsaw's avatar
nits  
Barry Warsaw committed
283

284
\item The method \method{gettype()} was renamed to \method{get_type()}.
Barry Warsaw's avatar
nits  
Barry Warsaw committed
285

286
\item The method \method{getmaintype()} was renamed to
287
      \method{get_main_type()}.
Barry Warsaw's avatar
nits  
Barry Warsaw committed
288

289 290
\item The method \method{getsubtype()} was renamed to
      \method{get_subtype()}.
Barry Warsaw's avatar
nits  
Barry Warsaw committed
291

292 293 294 295
\item The method \method{getparams()} was renamed to
      \method{get_params()}.
      Also, whereas \method{getparams()} returned a list of strings,
      \method{get_params()} returns a list of 2-tuples, effectively
296
      the key/value pairs of the parameters, split on the \character{=}
297
      sign.
Barry Warsaw's avatar
nits  
Barry Warsaw committed
298

299
\item The method \method{getparam()} was renamed to \method{get_param()}.
Barry Warsaw's avatar
nits  
Barry Warsaw committed
300

301 302
\item The method \method{getcharsets()} was renamed to
      \method{get_charsets()}.
Barry Warsaw's avatar
nits  
Barry Warsaw committed
303

304 305
\item The method \method{getfilename()} was renamed to
      \method{get_filename()}.
Barry Warsaw's avatar
nits  
Barry Warsaw committed
306

307 308
\item The method \method{getboundary()} was renamed to
      \method{get_boundary()}.
Barry Warsaw's avatar
nits  
Barry Warsaw committed
309

310 311
\item The method \method{setboundary()} was renamed to
      \method{set_boundary()}.
Barry Warsaw's avatar
nits  
Barry Warsaw committed
312

313 314 315
\item The method \method{getdecodedpayload()} was removed.  To get
      similar functionality, pass the value 1 to the \var{decode} flag
      of the {get_payload()} method.
Barry Warsaw's avatar
nits  
Barry Warsaw committed
316

317 318 319
\item The method \method{getpayloadastext()} was removed.  Similar
      functionality
      is supported by the \class{DecodedGenerator} class in the
320
      \refmodule{email.generator} module.
Barry Warsaw's avatar
nits  
Barry Warsaw committed
321

322 323 324
\item The method \method{getbodyastext()} was removed.  You can get
      similar functionality by creating an iterator with
      \function{typed_subpart_iterator()} in the
325
      \refmodule{email.iterators} module.
326 327 328 329
\end{itemize}

The \class{Parser} class has no differences in its public interface.
It does have some additional smarts to recognize
330
\mimetype{message/delivery-status} type messages, which it represents as
331 332 333
a \class{Message} instance containing separate \class{Message}
subparts for each header block in the delivery status
notification\footnote{Delivery Status Notifications (DSN) are defined
Fred Drake's avatar
Fred Drake committed
334
in \rfc{1894}.}.
335 336

The \class{Generator} class has no differences in its public
337
interface.  There is a new class in the \refmodule{email.generator}
338 339 340 341 342 343 344 345 346 347
module though, called \class{DecodedGenerator} which provides most of
the functionality previously available in the
\method{Message.getpayloadastext()} method.

The following modules and classes have been changed:

\begin{itemize}
\item The \class{MIMEBase} class constructor arguments \var{_major}
      and \var{_minor} have changed to \var{_maintype} and
      \var{_subtype} respectively.
Barry Warsaw's avatar
nits  
Barry Warsaw committed
348

349 350 351
\item The \code{Image} class/module has been renamed to
      \code{MIMEImage}.  The \var{_minor} argument has been renamed to
      \var{_subtype}.
Barry Warsaw's avatar
nits  
Barry Warsaw committed
352

353 354 355
\item The \code{Text} class/module has been renamed to
      \code{MIMEText}.  The \var{_minor} argument has been renamed to
      \var{_subtype}.
Barry Warsaw's avatar
nits  
Barry Warsaw committed
356

357 358 359 360 361 362 363
\item The \code{MessageRFC822} class/module has been renamed to
      \code{MIMEMessage}.  Note that an earlier version of
      \module{mimelib} called this class/module \code{RFC822}, but
      that clashed with the Python standard library module
      \refmodule{rfc822} on some case-insensitive file systems.

      Also, the \class{MIMEMessage} class now represents any kind of
364
      MIME message with main type \mimetype{message}.  It takes an
365
      optional argument \var{_subtype} which is used to set the MIME
366
      subtype.  \var{_subtype} defaults to \mimetype{rfc822}.
367 368 369 370
\end{itemize}

\module{mimelib} provided some utility functions in its
\module{address} and \module{date} modules.  All of these functions
371
have been moved to the \refmodule{email.utils} module.
372 373 374

The \code{MsgReader} class/module has been removed.  Its functionality
is most closely supported in the \function{body_line_iterator()}
375
function in the \refmodule{email.iterators} module.
376 377 378

\subsection{Examples}

379 380 381 382 383 384
Here are a few examples of how to use the \module{email} package to
read, write, and send simple email messages, as well as more complex
MIME messages.

First, let's see how to create and send a simple text message:

385
\verbatiminput{email-simple.py}
386 387

Here's an example of how to send a MIME message containing a bunch of
Barry Warsaw's avatar
Barry Warsaw committed
388
family pictures that may be residing in a directory:
389

390
\verbatiminput{email-mime.py}
391

392 393 394 395
Here's an example of how to send the entire contents of a directory as
an email message:
\footnote{Thanks to Matthew Dixon Cowles for the original inspiration
          and examples.}
396

397
\verbatiminput{email-dir.py}
398 399 400 401

And finally, here's an example of how to unpack a MIME message like
the one above, into a directory of files:

402
\verbatiminput{email-unpack.py}