mirror of
https://github.com/python/cpython.git
synced 2025-11-17 09:30:10 +00:00
svn+ssh://svn.python.org/python/branches/py3k
................
r73941 | georg.brandl | 2009-07-11 12:39:00 +0200 (Sa, 11 Jul 2009) | 9 lines
Merged revisions 73940 via svnmerge from
svn+ssh://pythondev@svn.python.org/python/trunk
........
r73940 | georg.brandl | 2009-07-11 12:37:38 +0200 (Sa, 11 Jul 2009) | 1 line
#6430: add note about size of "u" type.
........
................
r73942 | georg.brandl | 2009-07-11 12:39:23 +0200 (Sa, 11 Jul 2009) | 1 line
#6430: remove mention of "w" array typecode.
................
r73943 | georg.brandl | 2009-07-11 12:43:08 +0200 (Sa, 11 Jul 2009) | 1 line
#6421: The self argument of module-level PyCFunctions is now a reference to the module object.
................
r74076 | georg.brandl | 2009-07-18 11:07:48 +0200 (Sa, 18 Jul 2009) | 1 line
#6502: add missing comma in docstring.
................
r74094 | georg.brandl | 2009-07-19 09:25:56 +0200 (So, 19 Jul 2009) | 10 lines
Recorded merge of revisions 74089 via svnmerge from
svn+ssh://pythondev@svn.python.org/python/trunk
........
r74089 | senthil.kumaran | 2009-07-19 04:43:43 +0200 (So, 19 Jul 2009) | 3 lines
Fix for issue5102, timeout value propages between redirects, proxy, digest and
auth handlers. Fixed tests to reflect the same.
........
................
r74186 | georg.brandl | 2009-07-23 11:19:09 +0200 (Do, 23 Jul 2009) | 9 lines
Recorded merge of revisions 74185 via svnmerge from
svn+ssh://pythondev@svn.python.org/python/trunk
........
r74185 | georg.brandl | 2009-07-23 11:17:09 +0200 (Do, 23 Jul 2009) | 1 line
Fix the "pylocals" gdb command.
........
................
r74211 | georg.brandl | 2009-07-26 16:48:09 +0200 (So, 26 Jul 2009) | 9 lines
Recorded merge of revisions 74210 via svnmerge from
svn+ssh://pythondev@svn.python.org/python/trunk
........
r74210 | georg.brandl | 2009-07-26 16:44:23 +0200 (So, 26 Jul 2009) | 1 line
Move member descriptions inside the classes.
........
................
r74212 | georg.brandl | 2009-07-26 16:54:51 +0200 (So, 26 Jul 2009) | 9 lines
Merged revisions 74209 via svnmerge from
svn+ssh://pythondev@svn.python.org/python/trunk
........
r74209 | georg.brandl | 2009-07-26 16:37:28 +0200 (So, 26 Jul 2009) | 1 line
builtin -> built-in.
........
................
r74213 | georg.brandl | 2009-07-26 17:02:41 +0200 (So, 26 Jul 2009) | 9 lines
Merged revisions 74207 via svnmerge from
svn+ssh://pythondev@svn.python.org/python/trunk
........
r74207 | georg.brandl | 2009-07-26 16:19:57 +0200 (So, 26 Jul 2009) | 1 line
#6577: fix (hopefully) all links to builtin instead of module/class-specific objects.
........
................
r74214 | georg.brandl | 2009-07-26 17:03:49 +0200 (So, 26 Jul 2009) | 9 lines
Merged revisions 74205 via svnmerge from
svn+ssh://pythondev@svn.python.org/python/trunk
........
r74205 | georg.brandl | 2009-07-26 15:36:39 +0200 (So, 26 Jul 2009) | 1 line
#6576: fix cross-refs in re docs.
........
................
r74247 | georg.brandl | 2009-07-29 09:27:08 +0200 (Mi, 29 Jul 2009) | 9 lines
Merged revisions 74239 via svnmerge from
svn+ssh://pythondev@svn.python.org/python/trunk
........
r74239 | georg.brandl | 2009-07-28 18:55:32 +0000 (Di, 28 Jul 2009) | 1 line
Clarify quote_plus() usage.
........
................
r74254 | georg.brandl | 2009-07-29 18:14:16 +0200 (Mi, 29 Jul 2009) | 1 line
#6586: fix return/argument type doc for os.read() and os.write().
................
r74262 | alexandre.vassalotti | 2009-07-29 21:54:39 +0200 (Mi, 29 Jul 2009) | 57 lines
Merged revisions 74074,74077,74111,74188,74192-74193,74200,74252-74253,74258-74261 via svnmerge from
svn+ssh://pythondev@svn.python.org/python/trunk
........
r74074 | georg.brandl | 2009-07-18 05:03:10 -0400 (Sat, 18 Jul 2009) | 1 line
#6513: fix example code: warning categories are classes, not instances.
........
r74077 | georg.brandl | 2009-07-18 05:43:40 -0400 (Sat, 18 Jul 2009) | 1 line
#6489: fix an ambiguity in getiterator() documentation.
........
r74111 | benjamin.peterson | 2009-07-20 09:30:10 -0400 (Mon, 20 Jul 2009) | 1 line
remove docs for deprecated -p option
........
r74188 | benjamin.peterson | 2009-07-23 10:25:31 -0400 (Thu, 23 Jul 2009) | 1 line
use bools
........
r74192 | georg.brandl | 2009-07-24 12:28:38 -0400 (Fri, 24 Jul 2009) | 1 line
Fix arg types of et#.
........
r74193 | georg.brandl | 2009-07-24 12:46:38 -0400 (Fri, 24 Jul 2009) | 1 line
Dont put "void" in signature for nullary functions.
........
r74200 | georg.brandl | 2009-07-25 09:02:15 -0400 (Sat, 25 Jul 2009) | 1 line
#6571: add index entries for more operators.
........
r74252 | georg.brandl | 2009-07-29 12:06:31 -0400 (Wed, 29 Jul 2009) | 1 line
#6593: fix link targets.
........
r74253 | georg.brandl | 2009-07-29 12:09:17 -0400 (Wed, 29 Jul 2009) | 1 line
#6591: add reference to ioctl in fcntl module for platforms other than Windows.
........
r74258 | georg.brandl | 2009-07-29 12:57:05 -0400 (Wed, 29 Jul 2009) | 1 line
Add a link to readline, and mention IPython and bpython.
........
r74259 | georg.brandl | 2009-07-29 13:07:21 -0400 (Wed, 29 Jul 2009) | 1 line
Fix some markup and small factual glitches found by M. Markert.
........
r74260 | georg.brandl | 2009-07-29 13:15:20 -0400 (Wed, 29 Jul 2009) | 1 line
Fix a few markup glitches.
........
r74261 | georg.brandl | 2009-07-29 13:50:25 -0400 (Wed, 29 Jul 2009) | 1 line
Rewrite the section about classes a bit; mostly tidbits, and a larger update to the section about "private" variables to reflect the Pythonic consensus better.
........
................
r74311 | georg.brandl | 2009-08-04 22:29:27 +0200 (Di, 04 Aug 2009) | 1 line
Slightly improve buffer-related error message.
................
r74334 | georg.brandl | 2009-08-06 19:51:03 +0200 (Do, 06 Aug 2009) | 1 line
#6648: mention surrogateescape handler where all standard handlers are listed.
................
r74368 | georg.brandl | 2009-08-13 09:56:35 +0200 (Do, 13 Aug 2009) | 21 lines
Merged revisions 74328,74332-74333,74365 via svnmerge from
svn+ssh://pythondev@svn.python.org/python/trunk
........
r74328 | georg.brandl | 2009-08-06 17:06:25 +0200 (Do, 06 Aug 2009) | 1 line
Fix base keyword arg name for int() and long().
........
r74332 | georg.brandl | 2009-08-06 19:23:21 +0200 (Do, 06 Aug 2009) | 1 line
Fix punctuation and one copy-paste error.
........
r74333 | georg.brandl | 2009-08-06 19:43:55 +0200 (Do, 06 Aug 2009) | 1 line
#6658: fix two typos.
........
r74365 | georg.brandl | 2009-08-13 09:48:05 +0200 (Do, 13 Aug 2009) | 1 line
#6679: Remove mention that sub supports no flags.
........
................
464 lines
17 KiB
ReStructuredText
464 lines
17 KiB
ReStructuredText
:mod:`csv` --- CSV File Reading and Writing
|
|
===========================================
|
|
|
|
.. module:: csv
|
|
:synopsis: Write and read tabular data to and from delimited files.
|
|
.. sectionauthor:: Skip Montanaro <skip@pobox.com>
|
|
|
|
|
|
.. index::
|
|
single: csv
|
|
pair: data; tabular
|
|
|
|
The so-called CSV (Comma Separated Values) format is the most common import and
|
|
export format for spreadsheets and databases. There is no "CSV standard", so
|
|
the format is operationally defined by the many applications which read and
|
|
write it. The lack of a standard means that subtle differences often exist in
|
|
the data produced and consumed by different applications. These differences can
|
|
make it annoying to process CSV files from multiple sources. Still, while the
|
|
delimiters and quoting characters vary, the overall format is similar enough
|
|
that it is possible to write a single module which can efficiently manipulate
|
|
such data, hiding the details of reading and writing the data from the
|
|
programmer.
|
|
|
|
The :mod:`csv` module implements classes to read and write tabular data in CSV
|
|
format. It allows programmers to say, "write this data in the format preferred
|
|
by Excel," or "read data from this file which was generated by Excel," without
|
|
knowing the precise details of the CSV format used by Excel. Programmers can
|
|
also describe the CSV formats understood by other applications or define their
|
|
own special-purpose CSV formats.
|
|
|
|
The :mod:`csv` module's :class:`reader` and :class:`writer` objects read and
|
|
write sequences. Programmers can also read and write data in dictionary form
|
|
using the :class:`DictReader` and :class:`DictWriter` classes.
|
|
|
|
.. seealso::
|
|
|
|
:pep:`305` - CSV File API
|
|
The Python Enhancement Proposal which proposed this addition to Python.
|
|
|
|
|
|
.. _csv-contents:
|
|
|
|
Module Contents
|
|
---------------
|
|
|
|
The :mod:`csv` module defines the following functions:
|
|
|
|
|
|
.. function:: reader(csvfile, dialect='excel', **fmtparams)
|
|
|
|
Return a reader object which will iterate over lines in the given *csvfile*.
|
|
*csvfile* can be any object which supports the :term:`iterator` protocol and returns a
|
|
string each time its :meth:`!next` method is called --- file objects and list
|
|
objects are both suitable. If *csvfile* is a file object, it should be opened
|
|
with ``newline=''``. [#]_ An optional
|
|
*dialect* parameter can be given which is used to define a set of parameters
|
|
specific to a particular CSV dialect. It may be an instance of a subclass of
|
|
the :class:`Dialect` class or one of the strings returned by the
|
|
:func:`list_dialects` function. The other optional *fmtparams* keyword arguments
|
|
can be given to override individual formatting parameters in the current
|
|
dialect. For full details about the dialect and formatting parameters, see
|
|
section :ref:`csv-fmt-params`.
|
|
|
|
Each row read from the csv file is returned as a list of strings. No
|
|
automatic data type conversion is performed unless the ``QUOTE_NONNUMERIC`` format
|
|
option is specified (in which case unquoted fields are transformed into floats).
|
|
|
|
A short usage example::
|
|
|
|
>>> import csv
|
|
>>> spamReader = csv.reader(open('eggs.csv', newline=''), delimiter=' ', quotechar='|')
|
|
>>> for row in spamReader:
|
|
... print(', '.join(row))
|
|
Spam, Spam, Spam, Spam, Spam, Baked Beans
|
|
Spam, Lovely Spam, Wonderful Spam
|
|
|
|
|
|
.. function:: writer(csvfile, dialect='excel', **fmtparams)
|
|
|
|
Return a writer object responsible for converting the user's data into delimited
|
|
strings on the given file-like object. *csvfile* can be any object with a
|
|
:func:`write` method. An optional *dialect*
|
|
parameter can be given which is used to define a set of parameters specific to a
|
|
particular CSV dialect. It may be an instance of a subclass of the
|
|
:class:`Dialect` class or one of the strings returned by the
|
|
:func:`list_dialects` function. The other optional *fmtparams* keyword arguments
|
|
can be given to override individual formatting parameters in the current
|
|
dialect. For full details about the dialect and formatting parameters, see
|
|
section :ref:`csv-fmt-params`. To make it
|
|
as easy as possible to interface with modules which implement the DB API, the
|
|
value :const:`None` is written as the empty string. While this isn't a
|
|
reversible transformation, it makes it easier to dump SQL NULL data values to
|
|
CSV files without preprocessing the data returned from a ``cursor.fetch*`` call.
|
|
All other non-string data are stringified with :func:`str` before being written.
|
|
|
|
A short usage example::
|
|
|
|
>>> import csv
|
|
>>> spamWriter = csv.writer(open('eggs.csv', 'w'), delimiter=' ',
|
|
... quotechar='|', quoting=QUOTE_MINIMAL)
|
|
>>> spamWriter.writerow(['Spam'] * 5 + ['Baked Beans'])
|
|
>>> spamWriter.writerow(['Spam', 'Lovely Spam', 'Wonderful Spam'])
|
|
|
|
|
|
.. function:: register_dialect(name[, dialect], **fmtparams)
|
|
|
|
Associate *dialect* with *name*. *name* must be a string. The
|
|
dialect can be specified either by passing a sub-class of :class:`Dialect`, or
|
|
by *fmtparams* keyword arguments, or both, with keyword arguments overriding
|
|
parameters of the dialect. For full details about the dialect and formatting
|
|
parameters, see section :ref:`csv-fmt-params`.
|
|
|
|
|
|
.. function:: unregister_dialect(name)
|
|
|
|
Delete the dialect associated with *name* from the dialect registry. An
|
|
:exc:`Error` is raised if *name* is not a registered dialect name.
|
|
|
|
|
|
.. function:: get_dialect(name)
|
|
|
|
Return the dialect associated with *name*. An :exc:`Error` is raised if
|
|
*name* is not a registered dialect name. This function returns an immutable
|
|
:class:`Dialect`.
|
|
|
|
.. function:: list_dialects()
|
|
|
|
Return the names of all registered dialects.
|
|
|
|
|
|
.. function:: field_size_limit([new_limit])
|
|
|
|
Returns the current maximum field size allowed by the parser. If *new_limit* is
|
|
given, this becomes the new limit.
|
|
|
|
|
|
The :mod:`csv` module defines the following classes:
|
|
|
|
.. class:: DictReader(csvfile, fieldnames=None, restkey=None, restval=None, dialect='excel', *args, **kwds)
|
|
|
|
Create an object which operates like a regular reader but maps the information
|
|
read into a dict whose keys are given by the optional *fieldnames* parameter.
|
|
If the *fieldnames* parameter is omitted, the values in the first row of the
|
|
*csvfile* will be used as the fieldnames. If the row read has fewer fields than
|
|
the fieldnames sequence, the value of *restval* will be used as the default
|
|
value. If the row read has more fields than the fieldnames sequence, the
|
|
remaining data is added as a sequence keyed by the value of *restkey*. If the
|
|
row read has fewer fields than the fieldnames sequence, the remaining keys take
|
|
the value of the optional *restval* parameter. Any other optional or keyword
|
|
arguments are passed to the underlying :class:`reader` instance.
|
|
|
|
|
|
.. class:: DictWriter(csvfile, fieldnames, restval='', extrasaction='raise', dialect='excel', *args, **kwds)
|
|
|
|
Create an object which operates like a regular writer but maps dictionaries onto
|
|
output rows. The *fieldnames* parameter identifies the order in which values in
|
|
the dictionary passed to the :meth:`writerow` method are written to the
|
|
*csvfile*. The optional *restval* parameter specifies the value to be written
|
|
if the dictionary is missing a key in *fieldnames*. If the dictionary passed to
|
|
the :meth:`writerow` method contains a key not found in *fieldnames*, the
|
|
optional *extrasaction* parameter indicates what action to take. If it is set
|
|
to ``'raise'`` a :exc:`ValueError` is raised. If it is set to ``'ignore'``,
|
|
extra values in the dictionary are ignored. Any other optional or keyword
|
|
arguments are passed to the underlying :class:`writer` instance.
|
|
|
|
Note that unlike the :class:`DictReader` class, the *fieldnames* parameter of
|
|
the :class:`DictWriter` is not optional. Since Python's :class:`dict` objects
|
|
are not ordered, there is not enough information available to deduce the order
|
|
in which the row should be written to the *csvfile*.
|
|
|
|
|
|
.. class:: Dialect
|
|
|
|
The :class:`Dialect` class is a container class relied on primarily for its
|
|
attributes, which are used to define the parameters for a specific
|
|
:class:`reader` or :class:`writer` instance.
|
|
|
|
|
|
.. class:: excel()
|
|
|
|
The :class:`excel` class defines the usual properties of an Excel-generated CSV
|
|
file. It is registered with the dialect name ``'excel'``.
|
|
|
|
|
|
.. class:: excel_tab()
|
|
|
|
The :class:`excel_tab` class defines the usual properties of an Excel-generated
|
|
TAB-delimited file. It is registered with the dialect name ``'excel-tab'``.
|
|
|
|
|
|
.. class:: Sniffer()
|
|
|
|
The :class:`Sniffer` class is used to deduce the format of a CSV file.
|
|
|
|
The :class:`Sniffer` class provides two methods:
|
|
|
|
.. method:: sniff(sample, delimiters=None)
|
|
|
|
Analyze the given *sample* and return a :class:`Dialect` subclass
|
|
reflecting the parameters found. If the optional *delimiters* parameter
|
|
is given, it is interpreted as a string containing possible valid
|
|
delimiter characters.
|
|
|
|
|
|
.. method:: has_header(sample)
|
|
|
|
Analyze the sample text (presumed to be in CSV format) and return
|
|
:const:`True` if the first row appears to be a series of column headers.
|
|
|
|
An example for :class:`Sniffer` use::
|
|
|
|
csvfile = open("example.csv")
|
|
dialect = csv.Sniffer().sniff(csvfile.read(1024))
|
|
csvfile.seek(0)
|
|
reader = csv.reader(csvfile, dialect)
|
|
# ... process CSV file contents here ...
|
|
|
|
|
|
The :mod:`csv` module defines the following constants:
|
|
|
|
.. data:: QUOTE_ALL
|
|
|
|
Instructs :class:`writer` objects to quote all fields.
|
|
|
|
|
|
.. data:: QUOTE_MINIMAL
|
|
|
|
Instructs :class:`writer` objects to only quote those fields which contain
|
|
special characters such as *delimiter*, *quotechar* or any of the characters in
|
|
*lineterminator*.
|
|
|
|
|
|
.. data:: QUOTE_NONNUMERIC
|
|
|
|
Instructs :class:`writer` objects to quote all non-numeric fields.
|
|
|
|
Instructs the reader to convert all non-quoted fields to type *float*.
|
|
|
|
|
|
.. data:: QUOTE_NONE
|
|
|
|
Instructs :class:`writer` objects to never quote fields. When the current
|
|
*delimiter* occurs in output data it is preceded by the current *escapechar*
|
|
character. If *escapechar* is not set, the writer will raise :exc:`Error` if
|
|
any characters that require escaping are encountered.
|
|
|
|
Instructs :class:`reader` to perform no special processing of quote characters.
|
|
|
|
The :mod:`csv` module defines the following exception:
|
|
|
|
|
|
.. exception:: Error
|
|
|
|
Raised by any of the functions when an error is detected.
|
|
|
|
.. _csv-fmt-params:
|
|
|
|
Dialects and Formatting Parameters
|
|
----------------------------------
|
|
|
|
To make it easier to specify the format of input and output records, specific
|
|
formatting parameters are grouped together into dialects. A dialect is a
|
|
subclass of the :class:`Dialect` class having a set of specific methods and a
|
|
single :meth:`validate` method. When creating :class:`reader` or
|
|
:class:`writer` objects, the programmer can specify a string or a subclass of
|
|
the :class:`Dialect` class as the dialect parameter. In addition to, or instead
|
|
of, the *dialect* parameter, the programmer can also specify individual
|
|
formatting parameters, which have the same names as the attributes defined below
|
|
for the :class:`Dialect` class.
|
|
|
|
Dialects support the following attributes:
|
|
|
|
|
|
.. attribute:: Dialect.delimiter
|
|
|
|
A one-character string used to separate fields. It defaults to ``','``.
|
|
|
|
|
|
.. attribute:: Dialect.doublequote
|
|
|
|
Controls how instances of *quotechar* appearing inside a field should be
|
|
themselves be quoted. When :const:`True`, the character is doubled. When
|
|
:const:`False`, the *escapechar* is used as a prefix to the *quotechar*. It
|
|
defaults to :const:`True`.
|
|
|
|
On output, if *doublequote* is :const:`False` and no *escapechar* is set,
|
|
:exc:`Error` is raised if a *quotechar* is found in a field.
|
|
|
|
|
|
.. attribute:: Dialect.escapechar
|
|
|
|
A one-character string used by the writer to escape the *delimiter* if *quoting*
|
|
is set to :const:`QUOTE_NONE` and the *quotechar* if *doublequote* is
|
|
:const:`False`. On reading, the *escapechar* removes any special meaning from
|
|
the following character. It defaults to :const:`None`, which disables escaping.
|
|
|
|
|
|
.. attribute:: Dialect.lineterminator
|
|
|
|
The string used to terminate lines produced by the :class:`writer`. It defaults
|
|
to ``'\r\n'``.
|
|
|
|
.. note::
|
|
|
|
The :class:`reader` is hard-coded to recognise either ``'\r'`` or ``'\n'`` as
|
|
end-of-line, and ignores *lineterminator*. This behavior may change in the
|
|
future.
|
|
|
|
|
|
.. attribute:: Dialect.quotechar
|
|
|
|
A one-character string used to quote fields containing special characters, such
|
|
as the *delimiter* or *quotechar*, or which contain new-line characters. It
|
|
defaults to ``'"'``.
|
|
|
|
|
|
.. attribute:: Dialect.quoting
|
|
|
|
Controls when quotes should be generated by the writer and recognised by the
|
|
reader. It can take on any of the :const:`QUOTE_\*` constants (see section
|
|
:ref:`csv-contents`) and defaults to :const:`QUOTE_MINIMAL`.
|
|
|
|
|
|
.. attribute:: Dialect.skipinitialspace
|
|
|
|
When :const:`True`, whitespace immediately following the *delimiter* is ignored.
|
|
The default is :const:`False`.
|
|
|
|
|
|
Reader Objects
|
|
--------------
|
|
|
|
Reader objects (:class:`DictReader` instances and objects returned by the
|
|
:func:`reader` function) have the following public methods:
|
|
|
|
.. method:: csvreader.__next__()
|
|
|
|
Return the next row of the reader's iterable object as a list, parsed according
|
|
to the current dialect. Usually you should call this as ``next(reader)``.
|
|
|
|
|
|
Reader objects have the following public attributes:
|
|
|
|
.. attribute:: csvreader.dialect
|
|
|
|
A read-only description of the dialect in use by the parser.
|
|
|
|
|
|
.. attribute:: csvreader.line_num
|
|
|
|
The number of lines read from the source iterator. This is not the same as the
|
|
number of records returned, as records can span multiple lines.
|
|
|
|
|
|
DictReader objects have the following public attribute:
|
|
|
|
.. attribute:: csvreader.fieldnames
|
|
|
|
If not passed as a parameter when creating the object, this attribute is
|
|
initialized upon first access or when the first record is read from the
|
|
file.
|
|
|
|
|
|
|
|
Writer Objects
|
|
--------------
|
|
|
|
:class:`Writer` objects (:class:`DictWriter` instances and objects returned by
|
|
the :func:`writer` function) have the following public methods. A *row* must be
|
|
a sequence of strings or numbers for :class:`Writer` objects and a dictionary
|
|
mapping fieldnames to strings or numbers (by passing them through :func:`str`
|
|
first) for :class:`DictWriter` objects. Note that complex numbers are written
|
|
out surrounded by parens. This may cause some problems for other programs which
|
|
read CSV files (assuming they support complex numbers at all).
|
|
|
|
|
|
.. method:: csvwriter.writerow(row)
|
|
|
|
Write the *row* parameter to the writer's file object, formatted according to
|
|
the current dialect.
|
|
|
|
|
|
.. method:: csvwriter.writerows(rows)
|
|
|
|
Write all the *rows* parameters (a list of *row* objects as described above) to
|
|
the writer's file object, formatted according to the current dialect.
|
|
|
|
Writer objects have the following public attribute:
|
|
|
|
|
|
.. attribute:: csvwriter.dialect
|
|
|
|
A read-only description of the dialect in use by the writer.
|
|
|
|
|
|
.. _csv-examples:
|
|
|
|
Examples
|
|
--------
|
|
|
|
The simplest example of reading a CSV file::
|
|
|
|
import csv
|
|
reader = csv.reader(open("some.csv", newline=''))
|
|
for row in reader:
|
|
print(row)
|
|
|
|
Reading a file with an alternate format::
|
|
|
|
import csv
|
|
reader = csv.reader(open("passwd"), delimiter=':', quoting=csv.QUOTE_NONE)
|
|
for row in reader:
|
|
print(row)
|
|
|
|
The corresponding simplest possible writing example is::
|
|
|
|
import csv
|
|
writer = csv.writer(open("some.csv", "w"))
|
|
writer.writerows(someiterable)
|
|
|
|
Since :func:`open` is used to open a CSV file for reading, the file
|
|
will by default be decoded into unicode using the system default
|
|
encoding (see :func:`locale.getpreferredencoding`). To decode a file
|
|
using a different encoding, use the ``encoding`` argument of open::
|
|
|
|
import csv
|
|
reader = csv.reader(open("some.csv", newline='', encoding='utf-8'))
|
|
for row in reader:
|
|
print(row)
|
|
|
|
The same applies to writing in something other than the system default
|
|
encoding: specify the encoding argument when opening the output file.
|
|
|
|
Registering a new dialect::
|
|
|
|
import csv
|
|
csv.register_dialect('unixpwd', delimiter=':', quoting=csv.QUOTE_NONE)
|
|
reader = csv.reader(open("passwd"), 'unixpwd')
|
|
|
|
A slightly more advanced use of the reader --- catching and reporting errors::
|
|
|
|
import csv, sys
|
|
filename = "some.csv"
|
|
reader = csv.reader(open(filename, newline=''))
|
|
try:
|
|
for row in reader:
|
|
print(row)
|
|
except csv.Error as e:
|
|
sys.exit('file {}, line {}: {}'.format(filename, reader.line_num, e))
|
|
|
|
And while the module doesn't directly support parsing strings, it can easily be
|
|
done::
|
|
|
|
import csv
|
|
for row in csv.reader(['one,two,three']):
|
|
print(row)
|
|
|
|
|
|
.. rubric:: Footnotes
|
|
|
|
.. [#] If ``newline=''`` is not specified, newlines embedded inside quoted fields
|
|
will not be interpreted correctly. It should always be safe to specify
|
|
``newline=''``, since the csv module does its own universal newline handling
|
|
on input.
|