Issue #18960: Fix bugs with Python source code encoding in the second line.

* The first line of Python script could be executed twice when the source encoding (not equal to 'utf-8') was specified on the second line. * Now the source encoding declaration on the second line isn't effective if the first line contains anything except a comment. * As a consequence, 'python -x' works now again with files with the source encoding declarations specified on the second file, and can be used again to make Python batch files on Windows. * The tokenize module now ignore the source encoding declaration on the second line if the first line contains anything except a comment. * IDLE now ignores the source encoding declaration on the second line if the first line contains anything except a comment. * 2to3 and the findnocoding.py script now ignore the source encoding declaration on the second line if the first line contains anything except a comment.
2025-08-04 00:48:58 +00:00 · 2014-01-09 18:41:59 +02:00 · 2014-01-09 18:41:59 +02:00 · 7282ff6d5b
commit 7282ff6d5b
parent 766e10c4a8 768c16ce02
7 changed files with 87 additions and 5 deletions
--- a/Lib/idlelib/IOBinding.py
+++ b/Lib/idlelib/IOBinding.py
@ -64,6 +64,7 @@ encoding = locale_encoding  ### KBK 07Sep07  This is used all over IDLE, check!
                            ### 'encoding' is used below in encode(), check!

 coding_re = re.compile(r'^[ \t\f]*#.*coding[:=][ \t]*([-\w.]+)', re.ASCII)
+blank_re = re.compile(r'^[ \t\f]*(?:[#\r\n]|$)', re.ASCII)

 def coding_spec(data):
    """Return the encoding declaration according to PEP 263.
@ -93,6 +94,8 @@ def coding_spec(data):
        match = coding_re.match(line)
        if match is not None:
            break
+        if not blank_re.match(line):
+            return None
    else:
        return None
    name = match.group(1)