mirror of
				https://github.com/python/cpython.git
				synced 2025-11-04 03:44:55 +00:00 
			
		
		
		
	
		
			
				
	
	
		
			451 lines
		
	
	
	
		
			16 KiB
		
	
	
	
		
			ReStructuredText
		
	
	
	
	
	
			
		
		
	
	
			451 lines
		
	
	
	
		
			16 KiB
		
	
	
	
		
			ReStructuredText
		
	
	
	
	
	
=======================
 | 
						|
Extending/Embedding FAQ
 | 
						|
=======================
 | 
						|
 | 
						|
.. only:: html
 | 
						|
 | 
						|
   .. contents::
 | 
						|
 | 
						|
.. highlight:: c
 | 
						|
 | 
						|
 | 
						|
.. XXX need review for Python 3.
 | 
						|
 | 
						|
 | 
						|
Can I create my own functions in C?
 | 
						|
-----------------------------------
 | 
						|
 | 
						|
Yes, you can create built-in modules containing functions, variables, exceptions
 | 
						|
and even new types in C.  This is explained in the document
 | 
						|
:ref:`extending-index`.
 | 
						|
 | 
						|
Most intermediate or advanced Python books will also cover this topic.
 | 
						|
 | 
						|
 | 
						|
Can I create my own functions in C++?
 | 
						|
-------------------------------------
 | 
						|
 | 
						|
Yes, using the C compatibility features found in C++.  Place ``extern "C" {
 | 
						|
... }`` around the Python include files and put ``extern "C"`` before each
 | 
						|
function that is going to be called by the Python interpreter.  Global or static
 | 
						|
C++ objects with constructors are probably not a good idea.
 | 
						|
 | 
						|
 | 
						|
.. _c-wrapper-software:
 | 
						|
 | 
						|
Writing C is hard; are there any alternatives?
 | 
						|
----------------------------------------------
 | 
						|
 | 
						|
There are a number of alternatives to writing your own C extensions, depending
 | 
						|
on what you're trying to do.
 | 
						|
 | 
						|
.. XXX make sure these all work
 | 
						|
 | 
						|
`Cython <http://cython.org>`_ and its relative `Pyrex
 | 
						|
<http://www.cosc.canterbury.ac.nz/~greg/python/Pyrex/>`_ are compilers
 | 
						|
that accept a slightly modified form of Python and generate the corresponding
 | 
						|
C code.  Cython and Pyrex make it possible to write an extension without having
 | 
						|
to learn Python's C API.
 | 
						|
 | 
						|
If you need to interface to some C or C++ library for which no Python extension
 | 
						|
currently exists, you can try wrapping the library's data types and functions
 | 
						|
with a tool such as `SWIG <http://www.swig.org>`_.  `SIP
 | 
						|
<http://www.riverbankcomputing.co.uk/software/sip/>`__, `CXX
 | 
						|
<http://cxx.sourceforge.net/>`_ `Boost
 | 
						|
<http://www.boost.org/libs/python/doc/index.html>`_, or `Weave
 | 
						|
<http://www.scipy.org/Weave>`_ are also alternatives for wrapping
 | 
						|
C++ libraries.
 | 
						|
 | 
						|
 | 
						|
How can I execute arbitrary Python statements from C?
 | 
						|
-----------------------------------------------------
 | 
						|
 | 
						|
The highest-level function to do this is :c:func:`PyRun_SimpleString` which takes
 | 
						|
a single string argument to be executed in the context of the module
 | 
						|
``__main__`` and returns 0 for success and -1 when an exception occurred
 | 
						|
(including ``SyntaxError``).  If you want more control, use
 | 
						|
:c:func:`PyRun_String`; see the source for :c:func:`PyRun_SimpleString` in
 | 
						|
``Python/pythonrun.c``.
 | 
						|
 | 
						|
 | 
						|
How can I evaluate an arbitrary Python expression from C?
 | 
						|
---------------------------------------------------------
 | 
						|
 | 
						|
Call the function :c:func:`PyRun_String` from the previous question with the
 | 
						|
start symbol :c:data:`Py_eval_input`; it parses an expression, evaluates it and
 | 
						|
returns its value.
 | 
						|
 | 
						|
 | 
						|
How do I extract C values from a Python object?
 | 
						|
-----------------------------------------------
 | 
						|
 | 
						|
That depends on the object's type.  If it's a tuple, :c:func:`PyTuple_Size`
 | 
						|
returns its length and :c:func:`PyTuple_GetItem` returns the item at a specified
 | 
						|
index.  Lists have similar functions, :c:func:`PyListSize` and
 | 
						|
:c:func:`PyList_GetItem`.
 | 
						|
 | 
						|
For bytes, :c:func:`PyBytes_Size` returns its length and
 | 
						|
:c:func:`PyBytes_AsStringAndSize` provides a pointer to its value and its
 | 
						|
length.  Note that Python bytes objects may contain null bytes so C's
 | 
						|
:c:func:`strlen` should not be used.
 | 
						|
 | 
						|
To test the type of an object, first make sure it isn't *NULL*, and then use
 | 
						|
:c:func:`PyBytes_Check`, :c:func:`PyTuple_Check`, :c:func:`PyList_Check`, etc.
 | 
						|
 | 
						|
There is also a high-level API to Python objects which is provided by the
 | 
						|
so-called 'abstract' interface -- read ``Include/abstract.h`` for further
 | 
						|
details.  It allows interfacing with any kind of Python sequence using calls
 | 
						|
like :c:func:`PySequence_Length`, :c:func:`PySequence_GetItem`, etc.) as well
 | 
						|
as many other useful protocols such as numbers (:c:func:`PyNumber_Index` et.
 | 
						|
al.) and mappings in the PyMapping APIs.
 | 
						|
 | 
						|
 | 
						|
How do I use Py_BuildValue() to create a tuple of arbitrary length?
 | 
						|
-------------------------------------------------------------------
 | 
						|
 | 
						|
You can't.  Use :c:func:`PyTuple_Pack` instead.
 | 
						|
 | 
						|
 | 
						|
How do I call an object's method from C?
 | 
						|
----------------------------------------
 | 
						|
 | 
						|
The :c:func:`PyObject_CallMethod` function can be used to call an arbitrary
 | 
						|
method of an object.  The parameters are the object, the name of the method to
 | 
						|
call, a format string like that used with :c:func:`Py_BuildValue`, and the
 | 
						|
argument values::
 | 
						|
 | 
						|
   PyObject *
 | 
						|
   PyObject_CallMethod(PyObject *object, char *method_name,
 | 
						|
                       char *arg_format, ...);
 | 
						|
 | 
						|
This works for any object that has methods -- whether built-in or user-defined.
 | 
						|
You are responsible for eventually :c:func:`Py_DECREF`\ 'ing the return value.
 | 
						|
 | 
						|
To call, e.g., a file object's "seek" method with arguments 10, 0 (assuming the
 | 
						|
file object pointer is "f")::
 | 
						|
 | 
						|
   res = PyObject_CallMethod(f, "seek", "(ii)", 10, 0);
 | 
						|
   if (res == NULL) {
 | 
						|
           ... an exception occurred ...
 | 
						|
   }
 | 
						|
   else {
 | 
						|
           Py_DECREF(res);
 | 
						|
   }
 | 
						|
 | 
						|
Note that since :c:func:`PyObject_CallObject` *always* wants a tuple for the
 | 
						|
argument list, to call a function without arguments, pass "()" for the format,
 | 
						|
and to call a function with one argument, surround the argument in parentheses,
 | 
						|
e.g. "(i)".
 | 
						|
 | 
						|
 | 
						|
How do I catch the output from PyErr_Print() (or anything that prints to stdout/stderr)?
 | 
						|
----------------------------------------------------------------------------------------
 | 
						|
 | 
						|
In Python code, define an object that supports the ``write()`` method.  Assign
 | 
						|
this object to :data:`sys.stdout` and :data:`sys.stderr`.  Call print_error, or
 | 
						|
just allow the standard traceback mechanism to work. Then, the output will go
 | 
						|
wherever your ``write()`` method sends it.
 | 
						|
 | 
						|
The easiest way to do this is to use the :class:`io.StringIO` class::
 | 
						|
 | 
						|
   >>> import io, sys
 | 
						|
   >>> sys.stdout = io.StringIO()
 | 
						|
   >>> print('foo')
 | 
						|
   >>> print('hello world!')
 | 
						|
   >>> sys.stderr.write(sys.stdout.getvalue())
 | 
						|
   foo
 | 
						|
   hello world!
 | 
						|
 | 
						|
A custom object to do the same would look like this::
 | 
						|
 | 
						|
   >>> import io, sys
 | 
						|
   >>> class StdoutCatcher(io.TextIOBase):
 | 
						|
   ...     def __init__(self):
 | 
						|
   ...         self.data = []
 | 
						|
   ...     def write(self, stuff):
 | 
						|
   ...         self.data.append(stuff)
 | 
						|
   ...
 | 
						|
   >>> import sys
 | 
						|
   >>> sys.stdout = StdoutCatcher()
 | 
						|
   >>> print('foo')
 | 
						|
   >>> print('hello world!')
 | 
						|
   >>> sys.stderr.write(''.join(sys.stdout.data))
 | 
						|
   foo
 | 
						|
   hello world!
 | 
						|
 | 
						|
 | 
						|
How do I access a module written in Python from C?
 | 
						|
--------------------------------------------------
 | 
						|
 | 
						|
You can get a pointer to the module object as follows::
 | 
						|
 | 
						|
   module = PyImport_ImportModule("<modulename>");
 | 
						|
 | 
						|
If the module hasn't been imported yet (i.e. it is not yet present in
 | 
						|
:data:`sys.modules`), this initializes the module; otherwise it simply returns
 | 
						|
the value of ``sys.modules["<modulename>"]``.  Note that it doesn't enter the
 | 
						|
module into any namespace -- it only ensures it has been initialized and is
 | 
						|
stored in :data:`sys.modules`.
 | 
						|
 | 
						|
You can then access the module's attributes (i.e. any name defined in the
 | 
						|
module) as follows::
 | 
						|
 | 
						|
   attr = PyObject_GetAttrString(module, "<attrname>");
 | 
						|
 | 
						|
Calling :c:func:`PyObject_SetAttrString` to assign to variables in the module
 | 
						|
also works.
 | 
						|
 | 
						|
 | 
						|
How do I interface to C++ objects from Python?
 | 
						|
----------------------------------------------
 | 
						|
 | 
						|
Depending on your requirements, there are many approaches.  To do this manually,
 | 
						|
begin by reading :ref:`the "Extending and Embedding" document
 | 
						|
<extending-index>`.  Realize that for the Python run-time system, there isn't a
 | 
						|
whole lot of difference between C and C++ -- so the strategy of building a new
 | 
						|
Python type around a C structure (pointer) type will also work for C++ objects.
 | 
						|
 | 
						|
For C++ libraries, see :ref:`c-wrapper-software`.
 | 
						|
 | 
						|
 | 
						|
I added a module using the Setup file and the make fails; why?
 | 
						|
--------------------------------------------------------------
 | 
						|
 | 
						|
Setup must end in a newline, if there is no newline there, the build process
 | 
						|
fails.  (Fixing this requires some ugly shell script hackery, and this bug is so
 | 
						|
minor that it doesn't seem worth the effort.)
 | 
						|
 | 
						|
 | 
						|
How do I debug an extension?
 | 
						|
----------------------------
 | 
						|
 | 
						|
When using GDB with dynamically loaded extensions, you can't set a breakpoint in
 | 
						|
your extension until your extension is loaded.
 | 
						|
 | 
						|
In your ``.gdbinit`` file (or interactively), add the command::
 | 
						|
 | 
						|
   br _PyImport_LoadDynamicModule
 | 
						|
 | 
						|
Then, when you run GDB::
 | 
						|
 | 
						|
   $ gdb /local/bin/python
 | 
						|
   gdb) run myscript.py
 | 
						|
   gdb) continue # repeat until your extension is loaded
 | 
						|
   gdb) finish   # so that your extension is loaded
 | 
						|
   gdb) br myfunction.c:50
 | 
						|
   gdb) continue
 | 
						|
 | 
						|
I want to compile a Python module on my Linux system, but some files are missing. Why?
 | 
						|
--------------------------------------------------------------------------------------
 | 
						|
 | 
						|
Most packaged versions of Python don't include the
 | 
						|
:file:`/usr/lib/python2.{x}/config/` directory, which contains various files
 | 
						|
required for compiling Python extensions.
 | 
						|
 | 
						|
For Red Hat, install the python-devel RPM to get the necessary files.
 | 
						|
 | 
						|
For Debian, run ``apt-get install python-dev``.
 | 
						|
 | 
						|
 | 
						|
What does "SystemError: _PyImport_FixupExtension: module yourmodule not loaded" mean?
 | 
						|
-------------------------------------------------------------------------------------
 | 
						|
 | 
						|
This means that you have created an extension module named "yourmodule", but
 | 
						|
your module init function does not initialize with that name.
 | 
						|
 | 
						|
Every module init function will have a line similar to::
 | 
						|
 | 
						|
   module = Py_InitModule("yourmodule", yourmodule_functions);
 | 
						|
 | 
						|
If the string passed to this function is not the same name as your extension
 | 
						|
module, the :exc:`SystemError` exception will be raised.
 | 
						|
 | 
						|
 | 
						|
How do I tell "incomplete input" from "invalid input"?
 | 
						|
------------------------------------------------------
 | 
						|
 | 
						|
Sometimes you want to emulate the Python interactive interpreter's behavior,
 | 
						|
where it gives you a continuation prompt when the input is incomplete (e.g. you
 | 
						|
typed the start of an "if" statement or you didn't close your parentheses or
 | 
						|
triple string quotes), but it gives you a syntax error message immediately when
 | 
						|
the input is invalid.
 | 
						|
 | 
						|
In Python you can use the :mod:`codeop` module, which approximates the parser's
 | 
						|
behavior sufficiently.  IDLE uses this, for example.
 | 
						|
 | 
						|
The easiest way to do it in C is to call :c:func:`PyRun_InteractiveLoop` (perhaps
 | 
						|
in a separate thread) and let the Python interpreter handle the input for
 | 
						|
you. You can also set the :c:func:`PyOS_ReadlineFunctionPointer` to point at your
 | 
						|
custom input function. See ``Modules/readline.c`` and ``Parser/myreadline.c``
 | 
						|
for more hints.
 | 
						|
 | 
						|
However sometimes you have to run the embedded Python interpreter in the same
 | 
						|
thread as your rest application and you can't allow the
 | 
						|
:c:func:`PyRun_InteractiveLoop` to stop while waiting for user input.  The one
 | 
						|
solution then is to call :c:func:`PyParser_ParseString` and test for ``e.error``
 | 
						|
equal to ``E_EOF``, which means the input is incomplete).  Here's a sample code
 | 
						|
fragment, untested, inspired by code from Alex Farber::
 | 
						|
 | 
						|
   #include <Python.h>
 | 
						|
   #include <node.h>
 | 
						|
   #include <errcode.h>
 | 
						|
   #include <grammar.h>
 | 
						|
   #include <parsetok.h>
 | 
						|
   #include <compile.h>
 | 
						|
 | 
						|
   int testcomplete(char *code)
 | 
						|
     /* code should end in \n */
 | 
						|
     /* return -1 for error, 0 for incomplete, 1 for complete */
 | 
						|
   {
 | 
						|
     node *n;
 | 
						|
     perrdetail e;
 | 
						|
 | 
						|
     n = PyParser_ParseString(code, &_PyParser_Grammar,
 | 
						|
                              Py_file_input, &e);
 | 
						|
     if (n == NULL) {
 | 
						|
       if (e.error == E_EOF)
 | 
						|
         return 0;
 | 
						|
       return -1;
 | 
						|
     }
 | 
						|
 | 
						|
     PyNode_Free(n);
 | 
						|
     return 1;
 | 
						|
   }
 | 
						|
 | 
						|
Another solution is trying to compile the received string with
 | 
						|
:c:func:`Py_CompileString`. If it compiles without errors, try to execute the
 | 
						|
returned code object by calling :c:func:`PyEval_EvalCode`. Otherwise save the
 | 
						|
input for later. If the compilation fails, find out if it's an error or just
 | 
						|
more input is required - by extracting the message string from the exception
 | 
						|
tuple and comparing it to the string "unexpected EOF while parsing".  Here is a
 | 
						|
complete example using the GNU readline library (you may want to ignore
 | 
						|
**SIGINT** while calling readline())::
 | 
						|
 | 
						|
   #include <stdio.h>
 | 
						|
   #include <readline.h>
 | 
						|
 | 
						|
   #include <Python.h>
 | 
						|
   #include <object.h>
 | 
						|
   #include <compile.h>
 | 
						|
   #include <eval.h>
 | 
						|
 | 
						|
   int main (int argc, char* argv[])
 | 
						|
   {
 | 
						|
     int i, j, done = 0;                          /* lengths of line, code */
 | 
						|
     char ps1[] = ">>> ";
 | 
						|
     char ps2[] = "... ";
 | 
						|
     char *prompt = ps1;
 | 
						|
     char *msg, *line, *code = NULL;
 | 
						|
     PyObject *src, *glb, *loc;
 | 
						|
     PyObject *exc, *val, *trb, *obj, *dum;
 | 
						|
 | 
						|
     Py_Initialize ();
 | 
						|
     loc = PyDict_New ();
 | 
						|
     glb = PyDict_New ();
 | 
						|
     PyDict_SetItemString (glb, "__builtins__", PyEval_GetBuiltins ());
 | 
						|
 | 
						|
     while (!done)
 | 
						|
     {
 | 
						|
       line = readline (prompt);
 | 
						|
 | 
						|
       if (NULL == line)                          /* CTRL-D pressed */
 | 
						|
       {
 | 
						|
         done = 1;
 | 
						|
       }
 | 
						|
       else
 | 
						|
       {
 | 
						|
         i = strlen (line);
 | 
						|
 | 
						|
         if (i > 0)
 | 
						|
           add_history (line);                    /* save non-empty lines */
 | 
						|
 | 
						|
         if (NULL == code)                        /* nothing in code yet */
 | 
						|
           j = 0;
 | 
						|
         else
 | 
						|
           j = strlen (code);
 | 
						|
 | 
						|
         code = realloc (code, i + j + 2);
 | 
						|
         if (NULL == code)                        /* out of memory */
 | 
						|
           exit (1);
 | 
						|
 | 
						|
         if (0 == j)                              /* code was empty, so */
 | 
						|
           code[0] = '\0';                        /* keep strncat happy */
 | 
						|
 | 
						|
         strncat (code, line, i);                 /* append line to code */
 | 
						|
         code[i + j] = '\n';                      /* append '\n' to code */
 | 
						|
         code[i + j + 1] = '\0';
 | 
						|
 | 
						|
         src = Py_CompileString (code, "<stdin>", Py_single_input);
 | 
						|
 | 
						|
         if (NULL != src)                         /* compiled just fine - */
 | 
						|
         {
 | 
						|
           if (ps1  == prompt ||                  /* ">>> " or */
 | 
						|
               '\n' == code[i + j - 1])           /* "... " and double '\n' */
 | 
						|
           {                                               /* so execute it */
 | 
						|
             dum = PyEval_EvalCode (src, glb, loc);
 | 
						|
             Py_XDECREF (dum);
 | 
						|
             Py_XDECREF (src);
 | 
						|
             free (code);
 | 
						|
             code = NULL;
 | 
						|
             if (PyErr_Occurred ())
 | 
						|
               PyErr_Print ();
 | 
						|
             prompt = ps1;
 | 
						|
           }
 | 
						|
         }                                        /* syntax error or E_EOF? */
 | 
						|
         else if (PyErr_ExceptionMatches (PyExc_SyntaxError))
 | 
						|
         {
 | 
						|
           PyErr_Fetch (&exc, &val, &trb);        /* clears exception! */
 | 
						|
 | 
						|
           if (PyArg_ParseTuple (val, "sO", &msg, &obj) &&
 | 
						|
               !strcmp (msg, "unexpected EOF while parsing")) /* E_EOF */
 | 
						|
           {
 | 
						|
             Py_XDECREF (exc);
 | 
						|
             Py_XDECREF (val);
 | 
						|
             Py_XDECREF (trb);
 | 
						|
             prompt = ps2;
 | 
						|
           }
 | 
						|
           else                                   /* some other syntax error */
 | 
						|
           {
 | 
						|
             PyErr_Restore (exc, val, trb);
 | 
						|
             PyErr_Print ();
 | 
						|
             free (code);
 | 
						|
             code = NULL;
 | 
						|
             prompt = ps1;
 | 
						|
           }
 | 
						|
         }
 | 
						|
         else                                     /* some non-syntax error */
 | 
						|
         {
 | 
						|
           PyErr_Print ();
 | 
						|
           free (code);
 | 
						|
           code = NULL;
 | 
						|
           prompt = ps1;
 | 
						|
         }
 | 
						|
 | 
						|
         free (line);
 | 
						|
       }
 | 
						|
     }
 | 
						|
 | 
						|
     Py_XDECREF(glb);
 | 
						|
     Py_XDECREF(loc);
 | 
						|
     Py_Finalize();
 | 
						|
     exit(0);
 | 
						|
   }
 | 
						|
 | 
						|
 | 
						|
How do I find undefined g++ symbols __builtin_new or __pure_virtual?
 | 
						|
--------------------------------------------------------------------
 | 
						|
 | 
						|
To dynamically load g++ extension modules, you must recompile Python, relink it
 | 
						|
using g++ (change LINKCC in the Python Modules Makefile), and link your
 | 
						|
extension module using g++ (e.g., ``g++ -shared -o mymodule.so mymodule.o``).
 | 
						|
 | 
						|
 | 
						|
Can I create an object class with some methods implemented in C and others in Python (e.g. through inheritance)?
 | 
						|
----------------------------------------------------------------------------------------------------------------
 | 
						|
 | 
						|
In Python 2.2, you can inherit from built-in classes such as :class:`int`,
 | 
						|
:class:`list`, :class:`dict`, etc.
 | 
						|
 | 
						|
The Boost Python Library (BPL, http://www.boost.org/libs/python/doc/index.html)
 | 
						|
provides a way of doing this from C++ (i.e. you can inherit from an extension
 | 
						|
class written in C++ using the BPL).
 |