mirror of
				https://github.com/python/cpython.git
				synced 2025-10-24 15:36:26 +00:00 
			
		
		
		
	
		
			
				
	
	
		
			479 lines
		
	
	
	
		
			18 KiB
		
	
	
	
		
			ReStructuredText
		
	
	
	
	
	
			
		
		
	
	
			479 lines
		
	
	
	
		
			18 KiB
		
	
	
	
		
			ReStructuredText
		
	
	
	
	
	
| =======================
 | |
| Extending/Embedding FAQ
 | |
| =======================
 | |
| 
 | |
| .. contents::
 | |
| 
 | |
| .. highlight:: c
 | |
| 
 | |
| 
 | |
| .. XXX need review for Python 3.
 | |
| 
 | |
| 
 | |
| Can I create my own functions in C?
 | |
| -----------------------------------
 | |
| 
 | |
| Yes, you can create built-in modules containing functions, variables, exceptions
 | |
| and even new types in C.  This is explained in the document
 | |
| :ref:`extending-index`.
 | |
| 
 | |
| Most intermediate or advanced Python books will also cover this topic.
 | |
| 
 | |
| 
 | |
| Can I create my own functions in C++?
 | |
| -------------------------------------
 | |
| 
 | |
| Yes, using the C compatibility features found in C++.  Place ``extern "C" {
 | |
| ... }`` around the Python include files and put ``extern "C"`` before each
 | |
| function that is going to be called by the Python interpreter.  Global or static
 | |
| C++ objects with constructors are probably not a good idea.
 | |
| 
 | |
| 
 | |
| .. _c-wrapper-software:
 | |
| 
 | |
| Writing C is hard; are there any alternatives?
 | |
| ----------------------------------------------
 | |
| 
 | |
| There are a number of alternatives to writing your own C extensions, depending
 | |
| on what you're trying to do.
 | |
| 
 | |
| .. XXX make sure these all work; mention Cython
 | |
| 
 | |
| If you need more speed, `Psyco <http://psyco.sourceforge.net/>`_ generates x86
 | |
| assembly code from Python bytecode.  You can use Psyco to compile the most
 | |
| time-critical functions in your code, and gain a significant improvement with
 | |
| very little effort, as long as you're running on a machine with an
 | |
| x86-compatible processor.
 | |
| 
 | |
| `Pyrex <http://www.cosc.canterbury.ac.nz/~greg/python/Pyrex/>`_ is a compiler
 | |
| that accepts a slightly modified form of Python and generates the corresponding
 | |
| C code.  Pyrex makes it possible to write an extension without having to learn
 | |
| Python's C API.
 | |
| 
 | |
| If you need to interface to some C or C++ library for which no Python extension
 | |
| currently exists, you can try wrapping the library's data types and functions
 | |
| with a tool such as `SWIG <http://www.swig.org>`_.  `SIP
 | |
| <http://www.riverbankcomputing.co.uk/software/sip/>`__, `CXX
 | |
| <http://cxx.sourceforge.net/>`_ `Boost
 | |
| <http://www.boost.org/libs/python/doc/index.html>`_, or `Weave
 | |
| <http://www.scipy.org/Weave>`_ are also alternatives for wrapping
 | |
| C++ libraries.
 | |
| 
 | |
| 
 | |
| How can I execute arbitrary Python statements from C?
 | |
| -----------------------------------------------------
 | |
| 
 | |
| The highest-level function to do this is :c:func:`PyRun_SimpleString` which takes
 | |
| a single string argument to be executed in the context of the module
 | |
| ``__main__`` and returns 0 for success and -1 when an exception occurred
 | |
| (including ``SyntaxError``).  If you want more control, use
 | |
| :c:func:`PyRun_String`; see the source for :c:func:`PyRun_SimpleString` in
 | |
| ``Python/pythonrun.c``.
 | |
| 
 | |
| 
 | |
| How can I evaluate an arbitrary Python expression from C?
 | |
| ---------------------------------------------------------
 | |
| 
 | |
| Call the function :c:func:`PyRun_String` from the previous question with the
 | |
| start symbol :c:data:`Py_eval_input`; it parses an expression, evaluates it and
 | |
| returns its value.
 | |
| 
 | |
| 
 | |
| How do I extract C values from a Python object?
 | |
| -----------------------------------------------
 | |
| 
 | |
| That depends on the object's type.  If it's a tuple, :c:func:`PyTuple_Size`
 | |
| returns its length and :c:func:`PyTuple_GetItem` returns the item at a specified
 | |
| index.  Lists have similar functions, :c:func:`PyListSize` and
 | |
| :c:func:`PyList_GetItem`.
 | |
| 
 | |
| For strings, :c:func:`PyString_Size` returns its length and
 | |
| :c:func:`PyString_AsString` a pointer to its value.  Note that Python strings may
 | |
| contain null bytes so C's :c:func:`strlen` should not be used.
 | |
| 
 | |
| To test the type of an object, first make sure it isn't *NULL*, and then use
 | |
| :c:func:`PyString_Check`, :c:func:`PyTuple_Check`, :c:func:`PyList_Check`, etc.
 | |
| 
 | |
| There is also a high-level API to Python objects which is provided by the
 | |
| so-called 'abstract' interface -- read ``Include/abstract.h`` for further
 | |
| details.  It allows interfacing with any kind of Python sequence using calls
 | |
| like :c:func:`PySequence_Length`, :c:func:`PySequence_GetItem`, etc.)  as well as
 | |
| many other useful protocols.
 | |
| 
 | |
| 
 | |
| How do I use Py_BuildValue() to create a tuple of arbitrary length?
 | |
| -------------------------------------------------------------------
 | |
| 
 | |
| You can't.  Use ``t = PyTuple_New(n)`` instead, and fill it with objects using
 | |
| ``PyTuple_SetItem(t, i, o)`` -- note that this "eats" a reference count of
 | |
| ``o``, so you have to :c:func:`Py_INCREF` it.  Lists have similar functions
 | |
| ``PyList_New(n)`` and ``PyList_SetItem(l, i, o)``.  Note that you *must* set all
 | |
| the tuple items to some value before you pass the tuple to Python code --
 | |
| ``PyTuple_New(n)`` initializes them to NULL, which isn't a valid Python value.
 | |
| 
 | |
| 
 | |
| How do I call an object's method from C?
 | |
| ----------------------------------------
 | |
| 
 | |
| The :c:func:`PyObject_CallMethod` function can be used to call an arbitrary
 | |
| method of an object.  The parameters are the object, the name of the method to
 | |
| call, a format string like that used with :c:func:`Py_BuildValue`, and the
 | |
| argument values::
 | |
| 
 | |
|    PyObject *
 | |
|    PyObject_CallMethod(PyObject *object, char *method_name,
 | |
|                        char *arg_format, ...);
 | |
| 
 | |
| This works for any object that has methods -- whether built-in or user-defined.
 | |
| You are responsible for eventually :c:func:`Py_DECREF`\ 'ing the return value.
 | |
| 
 | |
| To call, e.g., a file object's "seek" method with arguments 10, 0 (assuming the
 | |
| file object pointer is "f")::
 | |
| 
 | |
|    res = PyObject_CallMethod(f, "seek", "(ii)", 10, 0);
 | |
|    if (res == NULL) {
 | |
|            ... an exception occurred ...
 | |
|    }
 | |
|    else {
 | |
|            Py_DECREF(res);
 | |
|    }
 | |
| 
 | |
| Note that since :c:func:`PyObject_CallObject` *always* wants a tuple for the
 | |
| argument list, to call a function without arguments, pass "()" for the format,
 | |
| and to call a function with one argument, surround the argument in parentheses,
 | |
| e.g. "(i)".
 | |
| 
 | |
| 
 | |
| How do I catch the output from PyErr_Print() (or anything that prints to stdout/stderr)?
 | |
| ----------------------------------------------------------------------------------------
 | |
| 
 | |
| In Python code, define an object that supports the ``write()`` method.  Assign
 | |
| this object to :data:`sys.stdout` and :data:`sys.stderr`.  Call print_error, or
 | |
| just allow the standard traceback mechanism to work. Then, the output will go
 | |
| wherever your ``write()`` method sends it.
 | |
| 
 | |
| The easiest way to do this is to use the StringIO class in the standard library.
 | |
| 
 | |
| Sample code and use for catching stdout:
 | |
| 
 | |
|    >>> class StdoutCatcher:
 | |
|    ...     def __init__(self):
 | |
|    ...         self.data = ''
 | |
|    ...     def write(self, stuff):
 | |
|    ...         self.data = self.data + stuff
 | |
|    ...
 | |
|    >>> import sys
 | |
|    >>> sys.stdout = StdoutCatcher()
 | |
|    >>> print('foo')
 | |
|    >>> print('hello world!')
 | |
|    >>> sys.stderr.write(sys.stdout.data)
 | |
|    foo
 | |
|    hello world!
 | |
| 
 | |
| 
 | |
| How do I access a module written in Python from C?
 | |
| --------------------------------------------------
 | |
| 
 | |
| You can get a pointer to the module object as follows::
 | |
| 
 | |
|    module = PyImport_ImportModule("<modulename>");
 | |
| 
 | |
| If the module hasn't been imported yet (i.e. it is not yet present in
 | |
| :data:`sys.modules`), this initializes the module; otherwise it simply returns
 | |
| the value of ``sys.modules["<modulename>"]``.  Note that it doesn't enter the
 | |
| module into any namespace -- it only ensures it has been initialized and is
 | |
| stored in :data:`sys.modules`.
 | |
| 
 | |
| You can then access the module's attributes (i.e. any name defined in the
 | |
| module) as follows::
 | |
| 
 | |
|    attr = PyObject_GetAttrString(module, "<attrname>");
 | |
| 
 | |
| Calling :c:func:`PyObject_SetAttrString` to assign to variables in the module
 | |
| also works.
 | |
| 
 | |
| 
 | |
| How do I interface to C++ objects from Python?
 | |
| ----------------------------------------------
 | |
| 
 | |
| Depending on your requirements, there are many approaches.  To do this manually,
 | |
| begin by reading :ref:`the "Extending and Embedding" document
 | |
| <extending-index>`.  Realize that for the Python run-time system, there isn't a
 | |
| whole lot of difference between C and C++ -- so the strategy of building a new
 | |
| Python type around a C structure (pointer) type will also work for C++ objects.
 | |
| 
 | |
| For C++ libraries, see :ref:`c-wrapper-software`.
 | |
| 
 | |
| 
 | |
| I added a module using the Setup file and the make fails; why?
 | |
| --------------------------------------------------------------
 | |
| 
 | |
| Setup must end in a newline, if there is no newline there, the build process
 | |
| fails.  (Fixing this requires some ugly shell script hackery, and this bug is so
 | |
| minor that it doesn't seem worth the effort.)
 | |
| 
 | |
| 
 | |
| How do I debug an extension?
 | |
| ----------------------------
 | |
| 
 | |
| When using GDB with dynamically loaded extensions, you can't set a breakpoint in
 | |
| your extension until your extension is loaded.
 | |
| 
 | |
| In your ``.gdbinit`` file (or interactively), add the command::
 | |
| 
 | |
|    br _PyImport_LoadDynamicModule
 | |
| 
 | |
| Then, when you run GDB::
 | |
| 
 | |
|    $ gdb /local/bin/python
 | |
|    gdb) run myscript.py
 | |
|    gdb) continue # repeat until your extension is loaded
 | |
|    gdb) finish   # so that your extension is loaded
 | |
|    gdb) br myfunction.c:50
 | |
|    gdb) continue
 | |
| 
 | |
| I want to compile a Python module on my Linux system, but some files are missing. Why?
 | |
| --------------------------------------------------------------------------------------
 | |
| 
 | |
| Most packaged versions of Python don't include the
 | |
| :file:`/usr/lib/python2.{x}/config/` directory, which contains various files
 | |
| required for compiling Python extensions.
 | |
| 
 | |
| For Red Hat, install the python-devel RPM to get the necessary files.
 | |
| 
 | |
| For Debian, run ``apt-get install python-dev``.
 | |
| 
 | |
| 
 | |
| What does "SystemError: _PyImport_FixupExtension: module yourmodule not loaded" mean?
 | |
| -------------------------------------------------------------------------------------
 | |
| 
 | |
| This means that you have created an extension module named "yourmodule", but
 | |
| your module init function does not initialize with that name.
 | |
| 
 | |
| Every module init function will have a line similar to::
 | |
| 
 | |
|    module = Py_InitModule("yourmodule", yourmodule_functions);
 | |
| 
 | |
| If the string passed to this function is not the same name as your extension
 | |
| module, the :exc:`SystemError` exception will be raised.
 | |
| 
 | |
| 
 | |
| How do I tell "incomplete input" from "invalid input"?
 | |
| ------------------------------------------------------
 | |
| 
 | |
| Sometimes you want to emulate the Python interactive interpreter's behavior,
 | |
| where it gives you a continuation prompt when the input is incomplete (e.g. you
 | |
| typed the start of an "if" statement or you didn't close your parentheses or
 | |
| triple string quotes), but it gives you a syntax error message immediately when
 | |
| the input is invalid.
 | |
| 
 | |
| In Python you can use the :mod:`codeop` module, which approximates the parser's
 | |
| behavior sufficiently.  IDLE uses this, for example.
 | |
| 
 | |
| The easiest way to do it in C is to call :c:func:`PyRun_InteractiveLoop` (perhaps
 | |
| in a separate thread) and let the Python interpreter handle the input for
 | |
| you. You can also set the :c:func:`PyOS_ReadlineFunctionPointer` to point at your
 | |
| custom input function. See ``Modules/readline.c`` and ``Parser/myreadline.c``
 | |
| for more hints.
 | |
| 
 | |
| However sometimes you have to run the embedded Python interpreter in the same
 | |
| thread as your rest application and you can't allow the
 | |
| :c:func:`PyRun_InteractiveLoop` to stop while waiting for user input.  The one
 | |
| solution then is to call :c:func:`PyParser_ParseString` and test for ``e.error``
 | |
| equal to ``E_EOF``, which means the input is incomplete).  Here's a sample code
 | |
| fragment, untested, inspired by code from Alex Farber::
 | |
| 
 | |
|    #include <Python.h>
 | |
|    #include <node.h>
 | |
|    #include <errcode.h>
 | |
|    #include <grammar.h>
 | |
|    #include <parsetok.h>
 | |
|    #include <compile.h>
 | |
| 
 | |
|    int testcomplete(char *code)
 | |
|      /* code should end in \n */
 | |
|      /* return -1 for error, 0 for incomplete, 1 for complete */
 | |
|    {
 | |
|      node *n;
 | |
|      perrdetail e;
 | |
| 
 | |
|      n = PyParser_ParseString(code, &_PyParser_Grammar,
 | |
|                               Py_file_input, &e);
 | |
|      if (n == NULL) {
 | |
|        if (e.error == E_EOF)
 | |
|          return 0;
 | |
|        return -1;
 | |
|      }
 | |
| 
 | |
|      PyNode_Free(n);
 | |
|      return 1;
 | |
|    }
 | |
| 
 | |
| Another solution is trying to compile the received string with
 | |
| :c:func:`Py_CompileString`. If it compiles without errors, try to execute the
 | |
| returned code object by calling :c:func:`PyEval_EvalCode`. Otherwise save the
 | |
| input for later. If the compilation fails, find out if it's an error or just
 | |
| more input is required - by extracting the message string from the exception
 | |
| tuple and comparing it to the string "unexpected EOF while parsing".  Here is a
 | |
| complete example using the GNU readline library (you may want to ignore
 | |
| **SIGINT** while calling readline())::
 | |
| 
 | |
|    #include <stdio.h>
 | |
|    #include <readline.h>
 | |
| 
 | |
|    #include <Python.h>
 | |
|    #include <object.h>
 | |
|    #include <compile.h>
 | |
|    #include <eval.h>
 | |
| 
 | |
|    int main (int argc, char* argv[])
 | |
|    {
 | |
|      int i, j, done = 0;                          /* lengths of line, code */
 | |
|      char ps1[] = ">>> ";
 | |
|      char ps2[] = "... ";
 | |
|      char *prompt = ps1;
 | |
|      char *msg, *line, *code = NULL;
 | |
|      PyObject *src, *glb, *loc;
 | |
|      PyObject *exc, *val, *trb, *obj, *dum;
 | |
| 
 | |
|      Py_Initialize ();
 | |
|      loc = PyDict_New ();
 | |
|      glb = PyDict_New ();
 | |
|      PyDict_SetItemString (glb, "__builtins__", PyEval_GetBuiltins ());
 | |
| 
 | |
|      while (!done)
 | |
|      {
 | |
|        line = readline (prompt);
 | |
| 
 | |
|        if (NULL == line)                          /* CTRL-D pressed */
 | |
|        {
 | |
|          done = 1;
 | |
|        }
 | |
|        else
 | |
|        {
 | |
|          i = strlen (line);
 | |
| 
 | |
|          if (i > 0)
 | |
|            add_history (line);                    /* save non-empty lines */
 | |
| 
 | |
|          if (NULL == code)                        /* nothing in code yet */
 | |
|            j = 0;
 | |
|          else
 | |
|            j = strlen (code);
 | |
| 
 | |
|          code = realloc (code, i + j + 2);
 | |
|          if (NULL == code)                        /* out of memory */
 | |
|            exit (1);
 | |
| 
 | |
|          if (0 == j)                              /* code was empty, so */
 | |
|            code[0] = '\0';                        /* keep strncat happy */
 | |
| 
 | |
|          strncat (code, line, i);                 /* append line to code */
 | |
|          code[i + j] = '\n';                      /* append '\n' to code */
 | |
|          code[i + j + 1] = '\0';
 | |
| 
 | |
|          src = Py_CompileString (code, "<stdin>", Py_single_input);
 | |
| 
 | |
|          if (NULL != src)                         /* compiled just fine - */
 | |
|          {
 | |
|            if (ps1  == prompt ||                  /* ">>> " or */
 | |
|                '\n' == code[i + j - 1])           /* "... " and double '\n' */
 | |
|            {                                               /* so execute it */
 | |
|              dum = PyEval_EvalCode (src, glb, loc);
 | |
|              Py_XDECREF (dum);
 | |
|              Py_XDECREF (src);
 | |
|              free (code);
 | |
|              code = NULL;
 | |
|              if (PyErr_Occurred ())
 | |
|                PyErr_Print ();
 | |
|              prompt = ps1;
 | |
|            }
 | |
|          }                                        /* syntax error or E_EOF? */
 | |
|          else if (PyErr_ExceptionMatches (PyExc_SyntaxError))
 | |
|          {
 | |
|            PyErr_Fetch (&exc, &val, &trb);        /* clears exception! */
 | |
| 
 | |
|            if (PyArg_ParseTuple (val, "sO", &msg, &obj) &&
 | |
|                !strcmp (msg, "unexpected EOF while parsing")) /* E_EOF */
 | |
|            {
 | |
|              Py_XDECREF (exc);
 | |
|              Py_XDECREF (val);
 | |
|              Py_XDECREF (trb);
 | |
|              prompt = ps2;
 | |
|            }
 | |
|            else                                   /* some other syntax error */
 | |
|            {
 | |
|              PyErr_Restore (exc, val, trb);
 | |
|              PyErr_Print ();
 | |
|              free (code);
 | |
|              code = NULL;
 | |
|              prompt = ps1;
 | |
|            }
 | |
|          }
 | |
|          else                                     /* some non-syntax error */
 | |
|          {
 | |
|            PyErr_Print ();
 | |
|            free (code);
 | |
|            code = NULL;
 | |
|            prompt = ps1;
 | |
|          }
 | |
| 
 | |
|          free (line);
 | |
|        }
 | |
|      }
 | |
| 
 | |
|      Py_XDECREF(glb);
 | |
|      Py_XDECREF(loc);
 | |
|      Py_Finalize();
 | |
|      exit(0);
 | |
|    }
 | |
| 
 | |
| 
 | |
| How do I find undefined g++ symbols __builtin_new or __pure_virtual?
 | |
| --------------------------------------------------------------------
 | |
| 
 | |
| To dynamically load g++ extension modules, you must recompile Python, relink it
 | |
| using g++ (change LINKCC in the Python Modules Makefile), and link your
 | |
| extension module using g++ (e.g., ``g++ -shared -o mymodule.so mymodule.o``).
 | |
| 
 | |
| 
 | |
| Can I create an object class with some methods implemented in C and others in Python (e.g. through inheritance)?
 | |
| ----------------------------------------------------------------------------------------------------------------
 | |
| 
 | |
| In Python 2.2, you can inherit from built-in classes such as :class:`int`,
 | |
| :class:`list`, :class:`dict`, etc.
 | |
| 
 | |
| The Boost Python Library (BPL, http://www.boost.org/libs/python/doc/index.html)
 | |
| provides a way of doing this from C++ (i.e. you can inherit from an extension
 | |
| class written in C++ using the BPL).
 | |
| 
 | |
| 
 | |
| When importing module X, why do I get "undefined symbol: PyUnicodeUCS2*"?
 | |
| -------------------------------------------------------------------------
 | |
| 
 | |
| You are using a version of Python that uses a 4-byte representation for Unicode
 | |
| characters, but some C extension module you are importing was compiled using a
 | |
| Python that uses a 2-byte representation for Unicode characters (the default).
 | |
| 
 | |
| If instead the name of the undefined symbol starts with ``PyUnicodeUCS4``, the
 | |
| problem is the reverse: Python was built using 2-byte Unicode characters, and
 | |
| the extension module was compiled using a Python with 4-byte Unicode characters.
 | |
| 
 | |
| This can easily occur when using pre-built extension packages.  RedHat Linux
 | |
| 7.x, in particular, provided a "python2" binary that is compiled with 4-byte
 | |
| Unicode.  This only causes the link failure if the extension uses any of the
 | |
| ``PyUnicode_*()`` functions.  It is also a problem if an extension uses any of
 | |
| the Unicode-related format specifiers for :c:func:`Py_BuildValue` (or similar) or
 | |
| parameter specifications for :c:func:`PyArg_ParseTuple`.
 | |
| 
 | |
| You can check the size of the Unicode character a Python interpreter is using by
 | |
| checking the value of sys.maxunicode:
 | |
| 
 | |
|    >>> import sys
 | |
|    >>> if sys.maxunicode > 65535:
 | |
|    ...     print('UCS4 build')
 | |
|    ... else:
 | |
|    ...     print('UCS2 build')
 | |
| 
 | |
| The only way to solve this problem is to use extension modules compiled with a
 | |
| Python binary built using the same size for Unicode characters.
 | 
