mirror of
				https://github.com/python/cpython.git
				synced 2025-10-31 10:26:02 +00:00 
			
		
		
		
	
		
			
				
	
	
		
			422 lines
		
	
	
	
		
			14 KiB
		
	
	
	
		
			Markdown
		
	
	
	
	
	
			
		
		
	
	
			422 lines
		
	
	
	
		
			14 KiB
		
	
	
	
		
			Markdown
		
	
	
	
	
	
| # Python WebAssembly (WASM) build
 | |
| 
 | |
| **WARNING: WASM support is highly experimental! Lots of features are not working yet.**
 | |
| 
 | |
| This directory contains configuration and helpers to facilitate cross
 | |
| compilation of CPython to WebAssembly (WASM). For now we support
 | |
| *wasm32-emscripten* builds for modern browser and for *Node.js*. WASI
 | |
| (*wasm32-wasi*) is work-in-progress
 | |
| 
 | |
| ## wasm32-emscripten build
 | |
| 
 | |
| For now the build system has two target flavors. The ``Emscripten/browser``
 | |
| target (``--with-emscripten-target=browser``) is optimized for browsers.
 | |
| It comes with a reduced and preloaded stdlib without tests and threading
 | |
| support. The ``Emscripten/node`` target has threading enabled and can
 | |
| access the file system directly.
 | |
| 
 | |
| Cross compiling to the wasm32-emscripten platform needs the
 | |
| [Emscripten](https://emscripten.org/) SDK and a build Python interpreter.
 | |
| Emscripten 3.1.8 or newer are recommended. All commands below are relative
 | |
| to a repository checkout.
 | |
| 
 | |
| Christian Heimes maintains a container image with Emscripten SDK, Python
 | |
| build dependencies, WASI-SDK, wasmtime, and several additional tools.
 | |
| 
 | |
| ```
 | |
| # Fedora, RHEL, CentOS
 | |
| podman run --rm -ti -v $(pwd):/python-wasm/cpython:Z quay.io/tiran/cpythonbuild:emsdk3
 | |
| 
 | |
| # other
 | |
| docker run --rm -ti -v $(pwd):/python-wasm/cpython quay.io/tiran/cpythonbuild:emsdk3
 | |
| ```
 | |
| 
 | |
| ### Compile a build Python interpreter
 | |
| 
 | |
| ```shell
 | |
| mkdir -p builddir/build
 | |
| pushd builddir/build
 | |
| ../../configure -C
 | |
| make -j$(nproc)
 | |
| popd
 | |
| ```
 | |
| 
 | |
| ### Fetch and build additional emscripten ports
 | |
| 
 | |
| ```shell
 | |
| embuilder build zlib bzip2
 | |
| ```
 | |
| 
 | |
| ### Cross compile to wasm32-emscripten for browser
 | |
| 
 | |
| ```shell
 | |
| mkdir -p builddir/emscripten-browser
 | |
| pushd builddir/emscripten-browser
 | |
| 
 | |
| CONFIG_SITE=../../Tools/wasm/config.site-wasm32-emscripten \
 | |
|   emconfigure ../../configure -C \
 | |
|     --host=wasm32-unknown-emscripten \
 | |
|     --build=$(../../config.guess) \
 | |
|     --with-emscripten-target=browser \
 | |
|     --with-build-python=$(pwd)/../build/python
 | |
| 
 | |
| emmake make -j$(nproc)
 | |
| popd
 | |
| ```
 | |
| 
 | |
| Serve `python.html` with a local webserver and open the file in a browser.
 | |
| 
 | |
| ```shell
 | |
| emrun builddir/emscripten-browser/python.html
 | |
| ```
 | |
| 
 | |
| or
 | |
| 
 | |
| ```shell
 | |
| ./Tools/wasm/wasm_webserver.py
 | |
| ```
 | |
| 
 | |
| and open http://localhost:8000/builddir/emscripten-browser/python.html . This
 | |
| directory structure enables the *C/C++ DevTools Support (DWARF)* to load C
 | |
| and header files with debug builds.
 | |
| 
 | |
| ### Cross compile to wasm32-emscripten for node
 | |
| 
 | |
| ```shell
 | |
| mkdir -p builddir/emscripten-node
 | |
| pushd builddir/emscripten-node
 | |
| 
 | |
| CONFIG_SITE=../../Tools/wasm/config.site-wasm32-emscripten \
 | |
|   emconfigure ../../configure -C \
 | |
|     --host=wasm32-unknown-emscripten \
 | |
|     --build=$(../../config.guess) \
 | |
|     --with-emscripten-target=node \
 | |
|     --with-build-python=$(pwd)/../build/python
 | |
| 
 | |
| emmake make -j$(nproc)
 | |
| popd
 | |
| ```
 | |
| 
 | |
| ```shell
 | |
| node --experimental-wasm-threads --experimental-wasm-bulk-memory --experimental-wasm-bigint builddir/emscripten-node/python.js
 | |
| ```
 | |
| 
 | |
| (``--experimental-wasm-bigint`` is not needed with recent NodeJS versions)
 | |
| 
 | |
| # wasm32-emscripten limitations and issues
 | |
| 
 | |
| Emscripten before 3.1.8 has known bugs that can cause memory corruption and
 | |
| resource leaks. 3.1.8 contains several fixes for bugs in date and time
 | |
| functions.
 | |
| 
 | |
| ## Network stack
 | |
| 
 | |
| - Python's socket module does not work with Emscripten's emulated POSIX
 | |
|   sockets yet. Network modules like ``asyncio``, ``urllib``, ``selectors``,
 | |
|   etc. are not available.
 | |
| - Only ``AF_INET`` and ``AF_INET6`` with ``SOCK_STREAM`` (TCP) or
 | |
|   ``SOCK_DGRAM`` (UDP) are available. ``AF_UNIX`` is not supported.
 | |
| - ``socketpair`` does not work.
 | |
| - Blocking sockets are not available and non-blocking sockets don't work
 | |
|   correctly, e.g. ``socket.accept`` crashes the runtime. ``gethostbyname``
 | |
|   does not resolve to a real IP address. IPv6 is not available.
 | |
| - The ``select`` module is limited. ``select.select()`` crashes the runtime
 | |
|   due to lack of exectfd support.
 | |
| 
 | |
| ## processes, signals
 | |
| 
 | |
| - Processes are not supported. System calls like fork, popen, and subprocess
 | |
|   fail with ``ENOSYS`` or ``ENOSUP``.
 | |
| - Signal support is limited. ``signal.alarm``, ``itimer``, ``sigaction``
 | |
|   are not available or do not work correctly. ``SIGTERM`` exits the runtime.
 | |
| - Keyboard interrupt (CTRL+C) handling is not implemented yet.
 | |
| - Resource-related functions like ``os.nice`` and most functions of the
 | |
|   ``resource`` module are not available.
 | |
| 
 | |
| ## threading
 | |
| 
 | |
| - Threading is disabled by default. The ``configure`` option
 | |
|   ``--enable-wasm-pthreads`` adds compiler flag ``-pthread`` and
 | |
|   linker flags ``-sUSE_PTHREADS -sPROXY_TO_PTHREAD``. 
 | |
| - pthread support requires WASM threads and SharedArrayBuffer (bulk memory).
 | |
|   The Node.JS runtime keeps a pool of web workers around. Each web worker
 | |
|   uses several file descriptors (eventfd, epoll, pipe).
 | |
| - It's not advised to enable threading when building for browsers or with
 | |
|   dynamic linking support; there are performance and stability issues.
 | |
| 
 | |
| ## file system
 | |
| 
 | |
| - Most user, group, and permission related function and modules are not
 | |
|   supported or don't work as expected, e.g.``pwd`` module, ``grp`` module,
 | |
|   ``os.setgroups``, ``os.chown``, and so on. ``lchown`` and `lchmod`` are
 | |
|   not available.
 | |
| - ``umask`` is a no-op.
 | |
| - hard links (``os.link``) are not supported.
 | |
| - Offset and iovec I/O functions (e.g. ``os.pread``, ``os.preadv``) are not
 | |
|   available.
 | |
| - ``os.mknod`` and ``os.mkfifo``
 | |
|   [don't work](https://github.com/emscripten-core/emscripten/issues/16158)
 | |
|   and are disabled.
 | |
| - Large file support crashes the runtime and is disabled.
 | |
| - ``mmap`` module is unstable. flush (``msync``) can crash the runtime.
 | |
| 
 | |
| ## Misc
 | |
| 
 | |
| - Heap memory and stack size are limited. Recursion or extensive memory
 | |
|   consumption can crash Python.
 | |
| - Most stdlib modules with a dependency on external libraries are missing,
 | |
|   e.g. ``ctypes``, ``readline``, ``sqlite3``, ``ssl``, and more.
 | |
| - Shared extension modules are not implemented yet. All extension modules
 | |
|   are statically linked into the main binary. The experimental configure
 | |
|   option ``--enable-wasm-dynamic-linking`` enables dynamic extensions
 | |
|   supports. It's currently known to crash in combination with threading.
 | |
| - glibc extensions for date and time formatting are not available.
 | |
| - ``locales`` module is affected by musl libc issues,
 | |
|   [bpo-46390](https://bugs.python.org/issue46390).
 | |
| - Python's object allocator ``obmalloc`` is disabled by default.
 | |
| - ``ensurepip`` is not available.
 | |
| - Some ``ctypes`` features like ``c_longlong`` and ``c_longdouble`` may need
 | |
|    NodeJS option ``--experimental-wasm-bigint``.
 | |
| 
 | |
| ## wasm32-emscripten in browsers
 | |
| 
 | |
| - The interactive shell does not handle copy 'n paste and unicode support
 | |
|   well.
 | |
| - The bundled stdlib is limited. Network-related modules,
 | |
|   distutils, multiprocessing, dbm, tests and similar modules
 | |
|   are not shipped. All other modules are bundled as pre-compiled
 | |
|   ``pyc`` files.
 | |
| - In-memory file system (MEMFS) is not persistent and limited.
 | |
| - Test modules are disabled by default. Use ``--enable-test-modules`` build
 | |
|   test modules like ``_testcapi``.
 | |
| 
 | |
| ## wasm32-emscripten in node
 | |
| 
 | |
| Node builds use ``NODERAWFS``.
 | |
| 
 | |
| - Node RawFS allows direct access to the host file system without need to
 | |
|   perform ``FS.mount()`` call.
 | |
| 
 | |
| # Hosting Python WASM builds
 | |
| 
 | |
| The simple REPL terminal uses SharedArrayBuffer. For security reasons
 | |
| browsers only provide the feature in secure environents with cross-origin
 | |
| isolation. The webserver must send cross-origin headers and correct MIME types
 | |
| for the JavaScript and WASM files. Otherwise the terminal will fail to load
 | |
| with an error message like ``Browsers disable shared array buffer``.
 | |
| 
 | |
| ## Apache HTTP .htaccess
 | |
| 
 | |
| Place a ``.htaccess`` file in the same directory as ``python.wasm``.
 | |
| 
 | |
| ```
 | |
| # .htaccess
 | |
| Header set Cross-Origin-Opener-Policy same-origin
 | |
| Header set Cross-Origin-Embedder-Policy require-corp
 | |
| 
 | |
| AddType application/javascript js
 | |
| AddType application/wasm wasm
 | |
| 
 | |
| <IfModule mod_deflate.c>
 | |
|     AddOutputFilterByType DEFLATE text/html application/javascript application/wasm
 | |
| </IfModule>
 | |
| ```
 | |
| 
 | |
| # WASI (wasm32-wasi)
 | |
| 
 | |
| WASI builds require [WASI SDK](https://github.com/WebAssembly/wasi-sdk) 15.0+
 | |
| and currently [wasix](https://github.com/singlestore-labs/wasix) for POSIX
 | |
| compatibility stubs.
 | |
| 
 | |
| ## WASI limitations and issues (WASI SDK 15.0)
 | |
| 
 | |
| A lot of Emscripten limitations also apply to WASI. Noticable restrictions
 | |
| are:
 | |
| 
 | |
| - Call stack size is limited. Default recursion limit and parser stack size
 | |
|   are smaller than in regular Python builds.
 | |
| - ``socket(2)`` cannot create new socket file descriptors. WASI programs can
 | |
|   call read/write/accept on a file descriptor that is passed into the process.
 | |
| - ``socket.gethostname()`` and host name resolution APIs like
 | |
|   ``socket.gethostbyname()`` are not implemented and always fail.
 | |
| - ``open(2)`` checks flags more strictly. Caller must pass either
 | |
|   ``O_RDONLY``, ``O_RDWR``, or ``O_WDONLY`` to ``os.open``. Directory file
 | |
|   descriptors must be created with flags ``O_RDONLY | O_DIRECTORY``.
 | |
| - ``chmod(2)`` is not available. It's not possible to modify file permissions,
 | |
|   yet. A future version of WASI may provide a limited ``set_permissions`` API.
 | |
| - User/group related features like ``os.chown()``, ``os.getuid``, etc. are
 | |
|   stubs or fail with ``ENOTSUP``.
 | |
| - File locking (``fcntl``) is not available.
 | |
| - ``os.pipe()``, ``os.mkfifo()``, and ``os.mknod()`` are not supported.
 | |
| - ``process_time`` does not work as expected because it's implemented using
 | |
|   wall clock.
 | |
| - ``os.umask()`` is a stub.
 | |
| - ``sys.executable`` is empty.
 | |
| - ``/dev/null`` / ``os.devnull`` may not be available.
 | |
| - ``os.utime*()`` is buggy in WASM SDK 15.0, see
 | |
|   [utimensat() with timespec=NULL sets wrong time](https://github.com/bytecodealliance/wasmtime/issues/4184)
 | |
| - ``os.symlink()`` fails with ``PermissionError`` when attempting to create a
 | |
|   symlink with an absolute path with wasmtime 0.36.0. The wasmtime runtime
 | |
|   uses ``openat2(2)`` syscall with flag ``RESOLVE_BENEATH`` to open files.
 | |
|   The flag causes the syscall to reject symlinks with absolute paths.
 | |
| - ``os.curdir`` (aka ``.``) seems to behave differently, which breaks some
 | |
|   ``importlib`` tests that add ``.`` to ``sys.path`` and indirectly
 | |
|   ``sys.path_importer_cache``.
 | |
| - WASI runtime environments may not provide a dedicated temp directory.
 | |
| 
 | |
| 
 | |
| # Detect WebAssembly builds
 | |
| 
 | |
| ## Python code
 | |
| 
 | |
| ```python
 | |
| import os, sys
 | |
| 
 | |
| if sys.platform == "emscripten":
 | |
|     # Python on Emscripten
 | |
| if sys.platform == "wasi":
 | |
|     # Python on WASI
 | |
| 
 | |
| if os.name == "posix":
 | |
|     # WASM platforms identify as POSIX-like.
 | |
|     # Windows does not provide os.uname().
 | |
|     machine = os.uname().machine
 | |
|     if machine.startswith("wasm"):
 | |
|         # WebAssembly (wasm32, wasm64 in the future)
 | |
| ```
 | |
| 
 | |
| ```python
 | |
| >>> import os, sys
 | |
| >>> os.uname()
 | |
| posix.uname_result(sysname='Emscripten', nodename='emscripten', release='1.0', version='#1', machine='wasm32')
 | |
| >>> os.name
 | |
| 'posix'
 | |
| >>> sys.platform
 | |
| 'emscripten'
 | |
| >>> sys._emscripten_info
 | |
| sys._emscripten_info(
 | |
|     emscripten_version=(3, 1, 8),
 | |
|     runtime='Mozilla/5.0 (X11; Fedora; Linux x86_64; rv:99.0) Gecko/20100101 Firefox/99.0',
 | |
|     pthreads=False,
 | |
|     shared_memory=False
 | |
| )
 | |
| >>> sys._emscripten_info
 | |
| sys._emscripten_info(emscripten_version=(3, 1, 8), runtime='Node.js v14.18.2', pthreads=True, shared_memory=True)
 | |
| ```
 | |
| 
 | |
| ```python
 | |
| >>> import os, sys
 | |
| >>> os.uname()
 | |
| posix.uname_result(sysname='wasi', nodename='(none)', release='0.0.0', version='0.0.0', machine='wasm32')
 | |
| >>> os.name
 | |
| 'posix'
 | |
| >>> sys.platform
 | |
| 'wasi'
 | |
| ```
 | |
| 
 | |
| ## C code
 | |
| 
 | |
| Emscripten SDK and WASI SDK define several built-in macros. You can dump a
 | |
| full list of built-ins with ``emcc -dM -E - < /dev/null`` and
 | |
| ``/path/to/wasi-sdk/bin/clang -dM -E - < /dev/null``.
 | |
| 
 | |
| ```C
 | |
| #ifdef __EMSCRIPTEN__
 | |
|     // Python on Emscripten
 | |
| #endif
 | |
| ```
 | |
| 
 | |
| * WebAssembly ``__wasm__`` (also ``__wasm``)
 | |
| * wasm32 ``__wasm32__`` (also ``__wasm32``)
 | |
| * wasm64 ``__wasm64__``
 | |
| * Emscripten ``__EMSCRIPTEN__`` (also ``EMSCRIPTEN``)
 | |
| * Emscripten version ``__EMSCRIPTEN_major__``, ``__EMSCRIPTEN_minor__``, ``__EMSCRIPTEN_tiny__``
 | |
| * WASI ``__wasi__``
 | |
| 
 | |
| Feature detection flags:
 | |
| 
 | |
| * ``__EMSCRIPTEN_PTHREADS__``
 | |
| * ``__EMSCRIPTEN_SHARED_MEMORY__``
 | |
| * ``__wasm_simd128__``
 | |
| * ``__wasm_sign_ext__``
 | |
| * ``__wasm_bulk_memory__``
 | |
| * ``__wasm_atomics__``
 | |
| * ``__wasm_mutable_globals__``
 | |
| 
 | |
| ## Install SDKs and dependencies manually
 | |
| 
 | |
| In some cases (e.g. build bots) you may prefer to install build dependencies
 | |
| directly on the system instead of using the container image. Total disk size
 | |
| of SDKs and cached libraries is about 1.6 GB.
 | |
| 
 | |
| ### Install OS dependencies
 | |
| 
 | |
| ```shell
 | |
| # Debian/Ubuntu
 | |
| apt update
 | |
| apt install -y git make xz-utils bzip2 curl python3-minimal ccache
 | |
| ```
 | |
| 
 | |
| ```shell
 | |
| # Fedora
 | |
| dnf install -y git make xz bzip2 which ccache
 | |
| ```
 | |
| 
 | |
| ### Install [Emscripten SDK](https://emscripten.org/docs/getting_started/downloads.html)
 | |
| 
 | |
| **NOTE**: Follow the on-screen instructions how to add the SDK to ``PATH``.
 | |
| 
 | |
| ```shell
 | |
| git clone https://github.com/emscripten-core/emsdk.git /opt/emsdk
 | |
| /opt/emsdk/emsdk install latest
 | |
| /opt/emsdk/emsdk activate latest
 | |
| ```
 | |
| 
 | |
| ### Optionally: pre-build and cache static libraries
 | |
| 
 | |
| Emscripten SDK provides static builds of core libraries without PIC
 | |
| (position-independent code). Python builds with ``dlopen`` support require
 | |
| PIC. To populate the build cache, run:
 | |
| 
 | |
| ```shell
 | |
| . /opt/emsdk/emsdk_env.sh
 | |
| embuilder build --force zlib bzip2
 | |
| embuilder build --force --pic \
 | |
|     zlib bzip2 libc-mt libdlmalloc-mt libsockets-mt \
 | |
|     libstubs libcompiler_rt libcompiler_rt-mt crtbegin libhtml5 \
 | |
|     libc++-mt-noexcept libc++abi-mt-noexcept \
 | |
|     libal libGL-mt libstubs-debug libc-mt-debug
 | |
| ```
 | |
| 
 | |
| ### Install [WASI-SDK](https://github.com/WebAssembly/wasi-sdk)
 | |
| 
 | |
| **NOTE**: WASI-SDK's clang may show a warning on Fedora:
 | |
| ``/lib64/libtinfo.so.6: no version information available``,
 | |
| [RHBZ#1875587](https://bugzilla.redhat.com/show_bug.cgi?id=1875587).
 | |
| 
 | |
| ```shell
 | |
| export WASI_VERSION=16
 | |
| export WASI_VERSION_FULL=${WASI_VERSION}.0
 | |
| curl -sSf -L -O https://github.com/WebAssembly/wasi-sdk/releases/download/wasi-sdk-${WASI_VERSION}/wasi-sdk-${WASI_VERSION_FULL}-linux.tar.gz
 | |
| mkdir -p /opt/wasi-sdk
 | |
| tar --strip-components=1 -C /opt/wasi-sdk -xvf wasi-sdk-${WASI_VERSION_FULL}-linux.tar.gz
 | |
| rm -f wasi-sdk-${WASI_VERSION_FULL}-linux.tar.gz
 | |
| ```
 | |
| 
 | |
| ### Install [wasmtime](https://github.com/bytecodealliance/wasmtime) WASI runtime
 | |
| 
 | |
| **NOTE**: wasmtime 0.37 has a bug. Newer versions should be fine again.
 | |
| 
 | |
| ```shell
 | |
| curl -sSf -L -o ~/install-wasmtime.sh https://wasmtime.dev/install.sh
 | |
| chmod +x ~/install-wasmtime.sh
 | |
| ~/install-wasmtime.sh --version v0.36.0
 | |
| ln -srf -t /usr/local/bin/ ~/.wasmtime/bin/wasmtime
 | |
| ```
 | |
| 
 | |
| ### Install [WASIX](https://github.com/singlestore-labs/wasix)
 | |
| 
 | |
| ```shell
 | |
| git clone https://github.com/singlestore-labs/wasix.git ~/wasix
 | |
| make install -C ~/wasix
 | |
| ```
 | 
