Python Notes

These are my personal notes that I use as a quick help in my work.
You are welcome to read them.

Index	Java	Internet	Oracle Notes
Linux	Basics	Web	Basics	SQL Notes
Informatica	Servlets	Apache	BkpRstore	SQL*Plus
Teradata		LDAP	Storage	PL/SQL
Windows			Tables	OEM
UML	Lang.		Net8	Portal
SQL Server	Python	perl	Performance	OLAP
Vmware	Visual Basic	PHP/MySQL	User Mgmt
Git
More technical pages here

Introduction
Install, and virtual environments
Variables
Strings and Bytes
Operators
Lists
Tuples
Dictionaries
Sets
Read
Numbers
Files
Structures
Processes
Classes
Date and Time
CSV Format
SQL Output and Parameters
Matplotlib
tkinter
Database Connections
Numpy
Pandas and DataFrames
PySpark
Python with AWS Lambda
Python with AWS Glue
Snowflake and Python
Misc. Packages (See here for a list of packages)
Tutorials
py web
Some Tools

Introduction

See separate file with Python code snippets

Command Line

py --version      Windows
python3 --version      Linux, outside of a virtual enviroment
python --version      in a virtual environment
python -V      all environments

python -c command [arg] ... quote command with single quotes.
Exit with ^z or quit() or exit(). ^d on *nix

python -m module_name [arg]
Run a module that is somewhere in the system path. Note: no ".py" because this is a module name, not a file name
Sort of equivalent to:
python import module_name
python -m searches in sys.path for the named module (without .py) and executes it as the main module. You can also import (import to do "help" and see doc)

python module_name.py [arg]
Run a script file. Note: the ".py" extension is not mandatory. It just has to be a valid file name

Reference documentation

Command line documentation

Syntax Basics

Multi-line\ command \ with a backslash

# This is a comment

Complex type summary:

Type		Immutable	Ordered		Empty
String	'...' "..." """..."""	imm	Y	s[0] is 1st char	""
List	[a,] or [a,b]	mut	Y	access by offset: l[0] is 1st element	[]
Tuple	(a,) or (a, b)	imm	Y	access by offset: t[0] is 1st element	()
Dict	{"a":b, "c":d}	mut	N	access by keys: d["k"]; keys are unique.	{}
Set	{a, b}	mut	N	access by iteration; elements are unique	set()
File	n/a

Note:
A tuple is immutable. But items it contains, such as lists, can be mutable.
User-defined classes are mutable.
Never pass a mutable object as a default value for a parameter.

Modules and Packages

A module is basically a file with a a_module.py extension.
Import into another module with import a_module. Access the objects in the module with a_module.obj. The module should be in the current directory or in the PYTHONPATH environment variable (see with sys.path).
Add a path: sys.path.append("full path")
See where the module was found: a_module.__file__
Provide alias for the module: import a_module as the_module_alias

Import another module's objects into the current module's namespace/symbol table with from a_module input name1,name2.

A module can be executed as a script: python a_module.py. In this case, the "__name__" dunder variable is "__main__". The following skips code when the module is imported, meaning when it is not a stand-alone script:
if __name__ == "__main__":

In the file that I am executing: __name__ == "main"
In imported modules: __name__ == "module_name" # module name is the file name
In functions: the_function_name.__name__ == "the_function_name"

def main(args=None):

if __name__ == '__main__':
    main()

A package is basically a sub-directory, which we will call a_package.
Be sure to put a file called "__init__.py" in the sub-directory a_package.
Import a module from the package with one of the two following lines:
from a_package import a_module
or
import a_package.a_module as a_mod
With __all__ = ["module1", "module2"] in the __init.py__, the listed modules are automatically loaded when doing from a_package import *
It is considered bad practice to use from a_package import *

Python adds the current directory to sys.path when running a script. See about PYTHONPATH below

Troubleshooting for package structure:

Is a __init__.py file in the sub-directory?
Is the PYTHONPATH environment variable set? When using pipenv, the easiest is to add export PYTHONPATH=. to the .env in the root of the project.
Was import pkg_name.module_name added to the top of the file? pkg_name is the name of the sub-directory or sub-directories. There is no .py after the module name.

It looks like you have to think in terms of the Python path sys.path, which is a list of directories. When running a script, the local directory is automatically one of the directories in the list sys.path. To import modules from another directory, here are the options:

Add the directory to the path:
import sys sys.path.append("full path")
If the directory is a sub-directory of something in the path, make the sub-directory a package by adding the file __init.py__. Then import as a package.
Add sys.path.append(r".") at the top of the first module when having difficulty with packaging.

Execute from root. This works without __init__.py in the root.


File a.py  ------------
import b.bmod
print(b.bmod.b_fctn())    # bmod module imports c.cmod
end  a.py  ------------

File b/__init__.py

File b/bmod.py  ------------
import c.cmod
def b_fctn():
    return "calling: " + c.cmod.c_fctn()
end  a.py  ------------

File c/__init__.py

File c/cmod.py  ------------
def c_fctn():
    return "Executing " + c_fctn.__name__
end  a.py  ------------

Alternative for file a.py above:


File a.py  ------------
from b import bmod
print(bmod.b_fctn())    # Notice: no 'b.'
end  a.py  ------------

If you are in the b sub-directory, and if the PYTHONPATH is not set to the parent directory, then add the parent to the sys.path.
The b/bmod.py file stays the same


File b/b.py  ------------
import sys
sys.path.append("..")
import bmod
print(bmod.b_fctn())    # bmod module imports c.cmod
end  a.py  ------------

variable sys.path.

The variable sys.path is a list of strings that determines the interpreter's search path for modules. It is initialized from these locations:

The directory containing the input script (or the current directory when no file is specified).
PYTHONPATH (a list of directory names, with the same syntax as the shell variable PATH).
The installation-dependent default (by convention including a site-packages directory, handled by the site module).

Add directories to the sys.path with the following code:

import sys
sys.path.append('/some/thing/python')

The best is to set the PYTHONPATH to the root of the project: export PYTHONPATH=.
With this, import the modules with the dot notation:
import folder.module
If you are in a sub-folder of the project's root folder, do: export PYTHONPATH=..
In pipenv, set it in the .env file:
### .env file
export PYTHONPATH=.

See details on package installation: https://packaging.python.org/tutorials/installing-packages/

Dynamic Imports

Import a package that is defined in a variable:
import importlib the_pkg = importlib.import_module(a_var_with_module_name)

Suggested structure

README.md .gitignore LICENSE Pipfile app/ __init__.py app.py docs/ /conf.py tests/ test_basic.py test_advanced.py

Another structure:

Root:
  .git
  README.md
  .gitignore

  Pipfile
  Pipfile.lock

  abc  # dir with source
      common # folder
      transformers # folder
  tests  # folder
      common # folder for unit tests
      transformers # folder for unit tests
      integration_test  # folder
  configs # folder

Help and Troubleshooting

print(" ", end='') # end supresses the end of line

dir(module name) -> sorted list of strings with the names defined in the module
dir() --> current defined names
__builtin__

Start interactive python shell:
import module
help() ???
help(module) Shows information on the module (import first). Put this in the .py file too, helps a lot
help(object) Shows information on the object
help(object()) Shows information on what the object returns
object.__dict__ Also shows information on the object (note: __dict__ not __dir__).
object().__dict__ Information on what the object returns
In interactive shell, the _ is the last value

callable(object) --> True if object is a function, class, or a method

Style

My thoughts on styling:

I try to implement PEP 8
I do PEP 484 (type hints) only to help documentation. Nothing complex. Do not assume that types are checked.
Commas in argument list at the end of the line, including on last parameter
Operators at beginning of new line
pytest is great, but may not always be easily applicable.
Use lint to improve code.
Each block (function or procedure) should have inputs through parameters and an output through the return statement. Avoid the use of global parameters.
Group the functionality in one place as much as possible. Avoid supposedly clever code that implements part of the functionality in one file, and another part in another file, where understanding what the code does involves constantly switching between two or more files.

Install

Install Python

Latest way of installing on Linux:

sudo apt install python3-full
sudo apt install python3-pip
sudo apt install python3-setuptools
sudo apt install python3-wheel
sudo apt install pipenv

then start a virtual shell:
pipenv shell --python /usr/bin/python3
Do all other installations in the virtual environments.

Create the virtual environments for new version of Linux

Remove old virtual environments in /home/.../.local/share/virtualenvs
Rename the existing Pipfile to old_pipfile
pipenv shell --python /usr/bin/python3
diff Pipfile old_pipfile

I may have to add --python /usr/bin/python3 to the pipenv shell command

New version of python on Windows:

Download the installer, and choose custom installation so as to install into a folder specific to the version

then start a virtual shell:
py -m pip install pipenv

For each virtual environment:

Remove old virtual environments: py -m pipenv --rm (dash-dash rm)
Rename the existing Pipfile to old_pipfile
py -m pip install pipenv
diff Pipfile old_pipfile

Note that the Pipfile often has the old version of Python in it.

("python3-py" means "python3", "py", or whatever works on the local OS)

Install pip

python3-py -m pip install --upgrade pip setuptools wheel
python3-py -m pip --version

Install packages in a virtual environment

pipenv install pkg
pipenv install pkg==1.2.3
pipenv install pkg>=1.2,<=2.3
pipenv install -r requirements.txt

pipenv uninstall pkg
pipenv show pkg
pipenv list

Usually, do not do this because it should be done in a virtual environment:

python3-py -m pip install pkg
python3-py -m pip install pkg==1.2.3
python3-py -m pip install pkg>=1.2,<=2.3
python3-py -m pip install -r requirements.txt

python3-py -m pip uninstall pkg
python3-py -m pip show pkg
python3-py -m pip list

Install Pip

If pip is not installed, do:
python3 -m ensurepip --default-pip
py -m ensurepip --default-pip

If necessary:
sudo apt install python3-pip
or
python3 -m pip --user

pip freeze > requirements.txt # output with ==
or
pip list > requirements.txt

Install a Package

On Windows
py -m pip install ...

On Linux and Mac, do pip3:
pip3 install ...

sudo apt install python-numpy sudo apt install python-matplotlib

c:\Python34\Scripts\pip.exe install matplotlib
c:\Python27\Scripts\pip.exe install matplotlib
Do NOT run pip with sudo.

Preferably install in a virtual environment.

Environment variables:
PYTHONHOME: location of the standard Python libraries (default: prefix/lib/pythonversion and exec_prefix/lib/pythonversion)
Set PYTHONHOME to prefix:exec_prefix

Install from a requirements file:
python3 -m pip install -r requirements.txt

Behind a firewall, you may have to do add "--trusted-host pypi.org --trusted-host files.pythonhosted.org" as follows:
pip install --trusted-host pypi.org --trusted-host files.pythonhosted.org package...

Upgrade

Upgrade pip:
python3 -m pip install --upgrade pip setuptools wheel

Upgrade any package (notice same syntax as upgrade of pip):
python3 -m pip install --upgrade SomeProject

Replace python with python3 in a virtual environment

Typical Paths for Python Installations

Add these to the PATH environment variable:
C:\Users\..user..\AppData\Local\Programs\Python\Python38-32
C:\Users\..user..\AppData\Local\Packages\PythonSoftwareFoundation.Python. ....\LocalCache\local-packages\Python310\Scripts

If you set the PYTHONPATH env var as the root of the project, all imports will find the modules in the subdirectories

set PYTHONSTARTUP=PYTHONSTARTUP.txt
this is a file executed before first prompt
See https://docs.python.org/3/using/cmdline.html#envvar-PYTHONHOME

Installing a version that is not current

Go to python.org, and look for the list of versions.
Choose a version with an installer, otherwise you will have to run the install scripts.

Or, install with the regular installer, with one of the options:
sudo apt install python3.8
sudo yum install python38
sudo amazon-linux-extras install python3.8
It seems to be with the "." in "apt", and without in "yum".

Arguments

import sys
sys.argv[0] # this is the command
sys.argv[1] # first argument

Return to OS

s.py: import sys
sys.exit(0)
# 0 is successful, 1 or more means error

s.py another option: raise SystemExit(1)

python3 s.py
ret=$? # get the return code now (or it is lost when another command is issued)
if [ ${ret} -ne 0 ] then # handle error fi

Virtual Environments

Am I in a virtual environment? Run where python in Windows, or which python in Linux.

Online coding space: colab.research .google . com ????

pipenv

Pipenv is recommended for collaborative projects as it's a higher-level tool that simplifies dependency management for common use cases.

Start by installing:
pip install pipenv
(on linux: pip3)

On Windows, add the following to the PATH environment variable (this assumes installation of python from python.org):
C:\Users\<username>\AppData\Roaming\Python\Python38\Site-Packages
C:\Users\<username>\AppData\Roaming\Python\Python38\Scripts

pip3 list # list of installed packages

cd directory
pipenv shell --python .....python.exe
pipenv shell --python /usr/bin/python3
#This creates a new file "Pipfile"
#and creates the virtual env: see pipenv --venv
# in case of error, try re-installing: pip install pipenv
pipenv --venv  # see where virtual env is stored
pipenv install pandas
pip list    # shows all installed packages. Notice "pip" not "pipenv"
deactivate  # deactivate the venv

Edit the Pipfile if needed
Move dev packages from [packages] to a section called [dev-packages], such as pytest, pylint, jupyter, ...
Pipfile.lock contains the exact versions of what was installed
pipenv install --ignore-pipfile   # this installs the software from the pipfile.lock file instead
                                  # in this way, I can reproduce the environment exactly as I tested it
pipenv install --dev  # load with the dev-packages

# delete by removing the directory that is given by 'pipenv --venv'
# or:
cd project_directory_where_the_Pipenv_file_is_located
pipenv --rm

# existing installation:
# After first time, activate simply with :
pipenv shell    # restart shell
pipenv install  # installs everthing in the Pipfile
deactivate      # or exit

pipenv graph  # shows the installed packages and the dependencies

If necessary, do python3 -m pipenv ...

Exit pipenv: exit or deactivate

For a different version of python: Edit pipfile and put version 3.7 pipenv -python 3.7.
Or better: use virtualenv

Doc: https://pipenv.pypa.io/en/latest/basics/

If I get "Shell for UNKNOWN_VIRTUAL_ENVIRONMENT already activated" then do "exit" because I am still in a virtual environment

Run a file without opening a shell
pipenv run python a-py.py

Run in the virtual environment, but without having to do "exit" when done
pipenv run python

venv

python3-py -m venv /path/to/new/virtual/environment
Creates a pyvenv.cfg file in it with a home key pointing to the Python installation from which the command was run.

Activate:
Linux: source <venv>/bin/activate
cmd: <venv>\Scripts\activate.bat
PowerShell: <venv>\Scripts\Activate.ps1

Verify Linux: which python
Windows: where python

https://docs.python.org/3/library/venv.html

Python interpreter is from a virtual environment:

sys.prefix and sys.exec_prefix –> directories of the virtual environment,
sys.base_prefix and sys.base_exec_prefix –> directories of the base Python used to create the environment.

If check sys.prefix != sys.base_prefix then the current interpreter is running from a virtual environment.

python -m venv name_of_virtual_env
cd name_of_virtual_env
Scripts\activate.bat
Scripts\pip install . . .
Scripts\python  # to start shell
deactivate      # when done

(Set the slashes appropriate for the operating system)
venv module is standand, meaning that no installation is needed
Don't put scripts into directory with virtual environment. And add to .gitignore
Delete the env by deleting the directory

pip list. See all pkgs

pip freeze > requirements.txt
In another installation, use
pip install -r requirements.txt

virtualenv

Seems better for alternate version of Python

pip install virtualenv

Create environment
virtualenv env1
cd env1
source bin/activate (Linux/Mac)
Scripts\activate (Windows)

Exit:
deactivate

Virtual env with specific version of python. This requires installation of the specific version (good luck: I have had varying degrees of success).
Initialize the virtual environment:
virtualenv -p path/python.exe a_dir
virtualenv -p C:\path\python.exe a_dir
virtualenv -p /Library/Frameworks/Python.framework/Versions/3.8/bin/python3 the_env

Use req file
pip install -r requirements.txt

List packages:
pip list /p>

.env file

pipenv shell and pipenv run automatically load .env file.
By default, it is at the root of the project. Set another location with PIPENV_DOTENV_LOCATION.


######
# file .env
THE_ENV_VAR=abc
THE_PATH=${HOME}/...:/etc/another/path
######

file aaa.py
import os
env_var = os.environ['THE_ENV_VAR']
the_path = os.environ['THE_PATH']

View all:

for k,v in os.environ.items():
    print(f"{k}: {v}")

Dvp env

pipenv run pytest
pipenv run lint
pipenv run tidy

pipenv run pytest
pylint file.py
pipenv run tidy

Be sure to install with

pipenv install pytest
pipenv install pylint

Dependency Management

https://packaging.python.org/guides/tool-recommendations/
https://realpython.com/pipenv-guide

applications:
  pip install -r requirements.txt
package:
  setup.py

vi

Configuration for vi:
:set syntax=python

or set the following (to be verified):
syntax enable set tabstop=4 set shiftwidth=4 set expandtab filetype indent on set autoindent

or:
set sw=4 et ts=4 ai set smartindent cinwords=if,elif,else,for,while,try,except,finally,def,class

Eggs

Go to the directory where I want to create the egg:
python.exe setup.py bdist_egg
The second to last line of the output has the location of the resulting file
Open a .egg with 7zip to view

Make ready for production

Options for passwords, access keys, and secrets

put the values in environment variables
Do not use pw

Improve code:

put in functions
put all hardcoded values in variables

Documentation, being intuitive

Logging to std_out: use lib logging

Options for exception handling:

single point of exit, with a wrapper
catch inside functions, log, and re-raise
let the job fail

Orchestration tool: calls the run.py, which has configuration, initialization, and execution

Configuration in config.yaml. What is executed is the tuple (config, executable)

Meta file for job control: the source files that were successfully loaded

install pyyaml

Additional notes

Error with certificates in Mac

Execute /Applications/Python 3.7/Install Certificates.command
this command replaces the root certificates of the default Python installation with the ones shipped through the certifi package

Variables

No declaration needed or even possible
Assignement: a_var = 3
Note: _ (underscore) is a variable. Generally used to put a throw-away value
Names:

Start with "_" or a capital or lower case letter
Digits except as start of name
Case-sensitive
Not Python keywords
No more long int in python3

type(x) # gives type of variable x
if type(x) is str: # tests if the type is string, list, ...
id() # identity function

However, object type comparisons should always use isinstance() instead of comparing types directly:
Correct:
if isinstance(obj, int):
Wrong:
if type(obj) is type(1):

The "==" operator tells us if the objects have the same value:
a == b
The "is" keyword tells us if the underlying objects are the same:
a is b
The preceding line is equivalent to
id(a) == id(b)

0o (zero o in lower or upper case) is octal
0x (zero X) is hexadecimal
0b (zero B) is binary
convert with hex, bin, oct
a + bj # complex number

Naming conventions:

module_names: short, all-lowercase names. Underscores if improves readability
variable_names: lowercase, separations with underscores
function_names: lowercase, separations with underscores
python: short, all-lowercase names, underscores are discouraged.
ClassNames: CapWords
ExceptionNamesError: (should be classes) the class naming convention with suffix "Error"
CONSTANTS: all uppercase

Reminder:

snake_case (underscore characters, usually lower case)
UPPER_SNAKE_CASE (underscore characters)
kebab-case (hyphen instead of underline character)
camelCase (first character in lowercase)
PascalCase (first character in uppercase)
Title Case (spaces, no separators)

global xyz # makes variable global. Generally considered sloppy programming

Basic types

str
int, float, complex
bool
bytes, bytearray, memoryview (binary types)
NoneType

Boolean

Subclass of int
Literals True and False (initial cap, then lower case)
True equiv to int 1, False to int 0

Strings and Bytes

Special characters:

\\
\'
\"
\a # ASCII Bell (BEL)
\b # ASCII Backspace (BS)
\f # ASCII Formfeed (FF)
\n # ASCII Linefeed (LF)
\N{name} # Character with name in the Unicode database
\r # ASCII Carriage Return (CR)
\t Tab
\uxxxx # Character with 16-bit hex value xxxx (Unicode only)
\Uxxxxxxxx # Character with 32-bit hex value xxxxxxxx (Unicode only)
\v # ASCII Vertical Tab (VT)
\ooo # Character with octal value ooo
\xhh # Character with hex value hh
r'literal string or raw string' safe string (not sure on the details)
f'formatted string {variable}' Inserts the value of the variable

a_str.startswith("start_to_look_for") # (note: not "beginwith()" nor "beginswith()")
a_str.startswith("start_to_look_for", n) # comparison starts at position n
a_str.endswith("end_to_look_for") # End

Unicode:
"\u0394" # Using a 16-bit hex value
"\U00000394" # Using a 32-bit hex value

str is the type for text (unicode)
chr(i) returns a character with code i. "\u234" is unicode
ord(c) with c as a Unicode character: returns an integer
hex(i) returns a hexadecimal string
chr(65) --> "A"
ord("A") --> 65
chr(128512) --> '😀'
ascii('😀') --> "'\\U0001f600'" # always in ascii characters, escapes if necessary
repr('😀') --> "'😀'" # shows non-ascii characters
bin(16) --> '0b10000'
oct(16) --> '0o20'
hex(16) --> '0x10'
int("0b10000", 2) --> 16
int("0o20", 8) --> 16
int("0x10", 16) --> 16
f"{16:x}" --> 10 (hex repr of 16)

int(s) # convert string to integer
ord(c) # return the character code

a_string[0:2] # first 2 characters! Substring from 0 (counting from 0, meaning position n-1) to position n (counting from 1)
print("12345"[0:1]) # --> "1" (first element) print("12345"[0]) # --> "1" (first element) print("12345"[0:3]) # --> "123" print("12345"[0:30]) # --> "12345" print("12345"[1:-1]) # --> "234" print("12345"[3:]) # --> "45" print("12345"[:3]) # --> "123" print("12345"[:-2]) # --> "123" print("12345"[2]) # --> "3" print("12345"[-1]) # --> "5" (last element) print("12345"[0:0]) # --> empty print("1234567890"[::3]) # --> "1470" (first in slice, then every third) print("1234567890"[::-1]) # --> "0987654321" (reverse) print("1234567890"[4:9:3]) # --> "58" (first in slice, which is "56789", then every third ) print("1234567890"[:]) # --> Copy

"""multi- line string"""

len("asd") # gives length
str1 is str2 # true if both identifiers refer to the same location
str1 == str2 # true if the content is the same. str1 is str2 implies str1 == str2, but not the other way

s = s1 + s2 # concatenate with "+"
s = 'string' 'string2' # or string literals side by side
s = 'string' \ 'string2' # or one string on each line, with backslashes (end-of-line indicator)
3*'string' # -->repeats
See operators below

In py3, if s is a string:
s.split(',') # if argument is null, then splits based on whitespace. Consecutive white spaces counted as one.
s.find('asdf') # find the 'asdf' in the string. -1 if not found
"dog" in "the quick dog." # determine if a a string contains a string
In py2, import the string module and do the following:
string.split(s, ',') # if null, then splits based on whitespace. Consecutive white spaces counted as one.
str.split() # alternate
string.find(s, 'asdf') # find the 'asdf' in the string. -1 if not found

escape: (need followup)
repr(s)
triple quotes
"\"" contains one char: "
r"\"" contains two chars: \ and " (this is a raw string)
b"\"" bytes, see above
f"text {expr} text" formatting, see below
a_str.replace('"', '\\"').replace("'", "\\'")
a_str.rstrip('\r\n') # remove combinations of trailing line feeds and carriage returns
a_str.startswith(begin_str) # return true if a_str starts with beginstr
a_str.startswith(begin_str, n) # comparison starts at position n
a_str.ljust(5, ' ') # Left justify, and pad on the right

Prepare map for patterns:
mp = str.maketrans('abcdefghijklmnopqrstuvwxyz' + 'abcdefghijklmnopqrstuvwxyz'.upper() + '0123456789', 'a'*26 + 'A'*26 + '9'*10)
Translate:
a_string.translate(mp)

Show character code instead of character for special characters:
''.join([c if ord(c)<255 else "[" + str(ord(c)) + "]" for c in list(the_string)])

Methods for bytes and strings

try out and complete documentation

Count, removesuffix, removeprefix
decode (bytes to str), encode (for str to bytes)
find, index, replace, rfind, rindex
join, partition, rpartion
maketrans, translate
center, ljust, lstrip, rjust, rstrip, rstrip, split, strip
capitalize
expandtabs
isalpha, isalnum, and many others
lower, swapcase, upper, title
splitlines
zfill

Formatting (f strings)

f"{abcde:09}" is equivalent to format(abcde, "09")

f"text {var} or {expression}"

f"text {numeric:8.2f}" # 2 digits after the decimal point, and 8 characters in all
If the output does not fit, it is expanded.

f"text {expression=}" # with "=" before the colon ":", the expression precedes the value.

Some options:

'>5.2f;' Right align within 5 characters for a float with 2 after the decimal
'^5' Center within 5 characters for a string
'0' Left pad with zeros
',' Add a comma for thousands separator
'+' Show sign for both positive and negative numbers
'-' Show sign for only negative numbers
'5i', '5d' integer with sign, 5 spaces or more
'o' Unsigned octal
'x', 'X' hexadecimal (lower , upper case)
'e', 'E' exponential (lower, upper case).
'g', 'G' Equivalent to 'f', or to 'e' for large or small numbers
'c' single character
'r' string (based on repr()).
's' string (based on str()).

Literal curly brackets: {{ and }}.

Note that single quotes have to be used inside the the curly brackets when the f-string is surrounded by double quotes, and vice versa

Note that you cannt use a backslash inside the curly brackets

Bytes

A bytes object is like a tuple of integers between 0 and 255, with additional functions for conversion to strings. They are immutable.
bytes("a string") # convert string to bytes
b"a string" # a literal
b"\x01\x00..."
data = b"a string"
data.decode() --> "a string"
"a string".encode --> b"a string"
"Déjà vu".encode('utf-8') gives b'D\xc3\xa9j\xc3\xa0 vu'

bytearray is a mutable byte object.

UTF-8 is the de-facto standard: it is generally safe to assume encoding="utf-8".
Convert string to bytes (both options are equivalent):
byts = bytes("abc", "utf-8") byts = "abc".encode("utf-8")

To bytes (two hexadecimal digits per byte; ignores white space):
bytes.fromhex('2Ef0 F1f2 ')

From bytes to string (separator is optional, -2 keeps every two hexadecimal digits, starting from the left):
b'2ef0 f1f2'.hex(" ", -2)

bytearray objects are a mutable counterpart to bytes objects.

Write bytes to and read bytes from files:
open(name, "rb")
open(name, "wb")
You cannot specify the encoding in binary mode.

Operators

+ for concatenation
"abc" * 3 # repeat 3 times, referencing the same element. Leads to odd cases resembling pointer issues in C

// # floor division (division of two integers with result as integer)
Note: in py2, divisions are always integer divisions. Therefore, start files with:
from __future__ import division

x > 10 and x < 20 equivalent to 10 < x < 20
+= -= # a += b is a = a + b

Operator precedence (low to high)

Operators	Comments
`lambda`
`x if condition else y`	Conditional expression
`or`
`and`
`not x`
`in, not in, is, is not, <, <=, >, >=, <>, !=, ==`
`\|`	Bitwise OR
`^`	Bitwise XOR
`&`	Bitwise AND
`<<, >>`	Shift operators
`+, -`	Addition and subtraction
`*, /, //, %`	Multiplication, division, remainder [8]
`+x, -x, ~x`	Unary minus, unary plus, bitwise NOT
`**`	Exponentiation [9]
`x[index], x[index:index], x(arguments...), x.attribute`	Subscription, slicing, call, attribute reference
`(expressions...), [expressions...], {key: value...}, 'expressions...'`	Binding or tuple display, list display, dictionary display, string conversion

lambda [parameter_list]: expression
lambda a,b,c: a*b+c
is equivalent to:
def anonymous_fctn(a,b,c)
return a*b+c
In event handler binding: lambda a=1,b=20,c=-2: self.evt_handler(event, a, b, c) # add event parameter for event handlers, remove for command handlers
By the way, assigning a lambda to a variable is like doing def and is not recommended for readability

Note:
x = f # assigns function f to x
x = f() # evaluates function f and assigns the result to x

# ---------- code for class: curry (begin) ---------------------
class curry:
"""from Scott David Daniels'recipe
"curry -- associating parameters with a function"
in the "Python Cookbook"
http://aspn.activestate.com/ASPN/Python/Cookbook/
"""
def __init__(self, fun, *args, **kwargs):
self.fun = fun
self.pending = args[:]
self.kwargs = kwargs.copy()
def __call__(self, *args, **kwargs):
if kwargs and self.kwargs:
kw = self.kwargs.copy()
kw.update(kwargs)
else:
kw = kwargs or self.kwargs
return self.fun(*(self.pending + args), **kw)
# ---------- code for class: curry (end) ---------------------
# ---------- code for function: event_lambda (begin) --------
def event_lambda(f, *args, **kwds ):
"""A helper function that wraps lambda in a prettier interface.
Thanks to Chad Netzer for the code."""
return lambda event, f=f, args=args, kwds=kwds : f( *args, **kwds )
# ---------- code for function: event_lambda (end) -----------

Lists

A list is an ordered group of items or elements, not necessarily of the same type
As for all sequential types

a_list[1] to get an element
len(a_list) to get length
Last pos is a_list[-1]
First element is indexed with 0: a_list[0]

[1,3,4,3,43] # a list
a = [[a, b, c], [x, y]] # a list of lists
["asd", "asfee"] # a list of strings
[] # empty list
list(something) # makes a list
i in [1,2,3] # True if i in the list

a_list[0:1] # returns the first element in a list: (e,)
See also under strings

a_list.append(new_element) # adds one element to list (if the new element is a list, then it is added as one element)
lst += elmt_to_append is equivalent to lst.append(elmt_to_append) (although a hair slower)
y_list = [a,b,c] + [e,d,f] # concatenates lists. Note: append and extend modify the list, the + operator creates a new list
lst = lst + lst2 # concatenates. Slow, because the list is copied to a new object
a_list.extend(list) # concatenates list to list
', '.join(list) # concatenates elements of list with ', ' in between each element
', '.join(filter(None, list)) #Filtering out the None elements prevents errors on concatenating null strings.
lst.index(a) returns the first index of the value a.
a_list.sort(key=None, reverse=False) # Sorts the list, changing the original list
sorted(a_list) # Sorts the list and creates a new list. It returns a new list even if the original object is something else.
sorted(a_list, key=f, reverse=TrueOrFalse) # f is a function (lambda): abs, ...
reversed(a_list) # Returns a reversed list
del a_list[i] # delete an element

The methods insert, remove and sort only modify the list and have no return value. If you want to keep the sorted in a new object, do: newobj = sorted(obj)

enumerate(a_list) returns tuples with (index, element)
Instead of:
for i in range(len(a_list)): do_something(i, a_list[i])
Do this:
for i, a in enumerate(a_list): do_something(i, a)

list(zip(a, b)) creates a list of tuples with elements from each list. The zipping stops when the end of the shortest list is reached
Unzip with the following trick:
col1, col2 = zip (*a_list_with_tuples_with_2_elements)
.items() returns a zip object. list(dct.items()) returns a list

len(a) gives length of the list

for i in [1,2,3,4,5]:
print i

text to list: need to understand this:
data = data.split('\n')
x = [row.split(' ')[0] for row in data]
y = [row.split(' ')[1] for row in data]

map(f, a) # applies f to all elements of a
map(lambda v: v/2, a) # with lambda
reduce(f, a) # applies f to first two elements of a, then applies f to the result and third element...
# example: reduce(lambda x,y: x if x<y else y, a)
filter(f, a) # returns a list with elements for which f(element) is true
filter(lambda v: v/2 > 1, a) # with lambda
enumerate(f, a) # returns a list with elements for which f(element) is true
# The above assumes the following: a = list(range(5)) def f(x): return x/2 def fi(x): return f(x)>1

all(a_list) # --> True if all elements are true, different from zero, or not empty
any(a_list) # --> True if any element is true, different from zero, or not empty
all([]) is True
any([]) is False

Comprehensions

List comprehension:
a_list = [ ((float(9)/5)*x + 32) for x in range(-40,101,5) ] # convert -40 to 100 deg C to F

This is how it works:

a = []
for i in range(10):
    if i%2==0:
        a.append(i**2)

Equivalent to:


a = [i**2 for i in range(10) if i%2==0]

Generator comprehension with round brackets:
gc = ( ((float(9)/5)*x + 32) for x in range(-40,101,5) )
A generator is does not store the data, but the functionality

Set comprehension:
{ } instead of [ ].
from math import sqrt
n = 100
no_primes = {j for i in range(2,int(sqrt(n))) for j in range(i*2, n, i)}
primes = {i for i in range(n) if i not in no_primes}

List comprehension with two loops:
[(x, y) for x in [1,2,3] for y in [3,1,4] if x != y]
[(1, 3), (1, 4), (2, 3), (2, 1), (2, 4), (3, 1), (3, 4)]

List comprehension with two loops, second example:

two_lvl_lst = [['a1', 'a2'], ['b1', 'b2', 'b3']]
x = []
for outer_lvl in two_lvl_lst:
    for inner_lvl in outer_lvl:
        x.append(inner_lvl)

Equivalent to:

x = [inner_lvl for outer_lvl in two_lvl_lst for inner_lvl in outer_lvl]

Unpacking

See under tuples

Renaming, Shallow Copies, and Deep Copies

Summary:

When you do list_copy = a_list, you have two pointers to the same memory location.
Create a copy with list_copy = a_list.copy() (shallow copy)
If lists are nested, and the nested lists also need to be copied, then do:
from copy import deepcopy list_copy = deepcopy(a_list)

Explanation:

Python assigns variables to locations of lists, meaning that after "a=b", "a" and "b" point to the same memory location until a new assignement is made on one of the variables (see "print(id(a),id(b)").

When a and b are a list — pointing to the same list in memory — and I assign a new value to one of the elements, as in "a[0]="new value"", then both "a" and "b" see the change.
To ensure that "b" points to a different memory location than "a", do the following:
b=a.copy()

Although "b=a.copy()" is "deeper" than with just the equal sign, it is not deep enough. If one of the elements is itself a list, that nested list is not copied, and ... re-belote ... any change to the nested element will show in both outer lists.

A deep copy is done as follows (thanks to https://www.python-course.eu/python3_deep_copy.php):
from copy import deepcopy b = deepcopy(a)
Note that the deep copy is only necessary when the lists are nested.
Note that the deep copy goes into all nested lists, not just the first nesting.
Be aware of recursive objects and cases where data is intended to be shared between copies.

Simple illustration:

from copy import deepcopy

a = [1, 2, ["a", "b"]]
print("a, original", a)
b_rename  = a           # new variable, same memory location
b_shallow = a.copy()    # shallow copy
b_deep    = deepcopy(a) # deep copy

# make changes
a[1]    = 321
a[2][1] = "changed"

# show results
print("a, changed", a)
print("b, renamed", b_rename)
print("b, shallow", b_shallow)
print("b, deep   ", b_deep)

Data Structures

Stacks

Use a_list.append(aaa) and a_list.pop() to use a list as a last-in-first-out stack.

Queues

For queues, lists are not efficient. Use the deque package, which is efficient for appending and poping to and from both ends:

from collections import deque
aqueue = deque([1, 2 ,3])
aqueue.append(4)
firstin = aqueue.popleft()

Tuples

Tuples are immutable lists
Most of what is done with lists can be done with tuples
t = ("one", "another", "third")
t = "one", "another", "third" # also works without brackets
() # empty tuple
(3,) # one-item tuple
t = tuple(something) # creates a tuple

t[1] # second item (first has index 0)

x, y = y, x # this swaps values (see "unpacking" under Lists)

Tuples vs lists:

Use tuples instead of lists when using as key fo dictionaries (because the keys have to be immutable)
Also, tuples are more efficient (faster)

Unpacking

Unpack a list (you have to know how many items it has):
x, y = [1, 4] # --> gives two non-lists with 1 and 4.
x, y are a tuple

Similar to unpacking, do multiple assignments on the same line:
a, b, c = 3, 6, 2

Similar to unpacking, swap values:
a, b = b, a

Unpacking works with list and other collections, including strings.

a,b,c = "abc"
print(a,b,c)

If the full length of the tuple is not known:
a variable preceded by an asterisk takes on the list of all remaining values, and is empty if there are none.
a, b, *c, d = (1,2,3,4,5,6)
Or optionally if I want to not use the values (good practice, not required):
a, b, *_ = (1,2,3,4)

Dictionaries

Disctionaries are unordered sets in which elements are accessed by keys. They are composed of key:value pairs: {'key name': a value, ....}
The index uses only immutable data types, including tuples (which are immutable), but not lists (which are not immutable)

dic = {"a":234, "bas":4322}
dic["a"] # gives 234
dic["c"] = 42332 # add an element
{} # empty dictionary

Operators:
len(dic)
del dic[k] # delete a key and value
k in dic
k not in dic
dic.pop(k) # returns the value for k and TAKES IT OUT of the dictionary
dic.popitem() # returns arbitrary key-value pair as tuple, and removes them
dic.get(k) # returns the value (keeping it in the dictionary). No error if key not in dic
dic.get(k, a_deflt) # returns the value and returns a_deflt if not in dic
dic.get(k).get(k2) if k in dic else "" # Although .get() does not throw an error if the key is not in the dictionary, check for non-existant key for a sub dictionary
copied_dic = dic.copy() # shallow copy
dic.clear() # points to empty dictionary
dic1.update(dic2) # add elements of dic2 into dic1. When keys are the same, assign new values from dic2

dic.keys() # Acts like a list of keys (actual type is dict_keys)
dic.values() # Acts like a list of values (actual type is dict_values)
dic.items() # Acts like a list of (key, value) tuples (actual type is dict_items)

for k in dic: # iterate over the keys of the dictionary print("key:",k) print("value:",a_dict[k])

for k,v in dic.items(): # better practice: use ".items()" print("key:",k) print("value:",v)

if x is not None: # test for null object

Beware of:
if x
when you really mean
if x is not None
The following values evaluate to False, but they are not None:

0
""
[]
{}
set()

try: do_something_with( a_dict["xyz"] ) except KeyError: print('"xyz" not in a_dict') Or:
if ("xyz" in a_dict): do_something_with( a_dict["xyz"] ) else: print('"xyz" not in a_dict')

Dictionary comprehension:
a_text = "WHEN in the Course of human Events"
dct_of_char_counts = {c: a_text.count(c) for c in set(a_text)}

Advanced sort:
a_list.sort(key=lambda x: x[1]) # sort on the second element
a_list.sort(key=lambda x: x[3] + "|" + x[2]) # sort on something else

def a_sorting_function(x):
. . .
a_list.sort(key=a_sorting_function) # sort on something else (notice no "()")

Asterisk-asterisk of a dictionary:
The asterisk-asterisk "unpacks" the dictionary. Use inside curly brackets to build another dictionary, or use to unpack the named arguments for a function.
a = {"a1": 234, "a2": 42} b = {"a2": -2, "b1": 234, "b2": 42} print({**a, **b}) -> {"a1": 234, "a2": -2, "b1": 234, "b2": 42} print({**a, **b}) -> {"a2": 42, "b1": 234, "b2": 42, "a1": 234}Notice that "a2" is in both dictionaries. In the "repacked" dictionary, it takes on the latest value.

Sets

A set is an unordered collection of unique and immutable objects. Similar to a key-only dictionary
Create set from "string", (tuple), [list], but not list of lists
The set is mutable, the elements are not: attempting to add a mutable element raises the error "hashable type"
Useful for finding memberships because the "in" operation is fast, and for having distinct collections
set(sequence or other iterable object) # create set, removing the duplicates
{sequence or other iterable object} # alternate notation
frozenset(...) # creates an immutable set.
set() # empty set

Operations:
s.add(element)
s.clear() # remove all elements
s.copy() # shallow copy
s1.difference(s2) # elements in s1 that are not in s2
s1 - s2 # elements in s1 that are not in s2, alternate notation
s1.difference_update(s2) # remove from s1 the elements in s2
s1 = s1 - s2 # alternate notation
s.discard(el) # remove el from set; no error if el not in set
s.remove(el) # remove el from set; raise error if el not in set
a in s # True if a in set s (note: this is a fast operation)
s1.union(s2) #
s1 | s2 # alternate
s1.intersection(s2) #
s1 & s2 # alternate
s1.isdisjoint(s2) # true if intersection is empty
s1 <= s2 s1.issubset(s2) # true if s1 is included in s2
s1 < s2 # proper subset (subset but not equal)
s1 >= s2 s1.issuperset(s2) #
s1 > s2 # proper superset
s.pop() # remove an element

Use a set for testing membership, because sets use hash tables under the covers. Searching for membership in a set is much faster than searching in a list
Therefore, make a set, then test for "if xyz in the_set: . . ."

Read

a_var = input("prompt: ") # get user input
a_var = eval(input(...)) # evaluate the returned value to a number

N = input("Please enter name: ")
print "Name is", N
# At keyboard, enclose with quotes!!! (at least in Py2)
#instead, use raw_input in py 2

N = raw_input("Please enter name: ")
print "Name is", N

some_input = raw_input().upper()

Numbers

the following are evaluated as false:
zero
False
""
[] # empty list
() # empty tuple
None

n = input("Maximal Number? ")

The "float" data type stores a number for which the decimal portion is close, but may not be exactly equal to the string representation: print(f"{0.1:.20f}") does NOT show 0.100000... as there is a small difference.
If precision is required, use:
from decimal import Decimal
a = Decimal("0.1") # be sure to declare as a string

Random numbers
import random
random.random() # Generate random number between 0 and 1
[random.random() for _ in range(4)] # Four random number between 0 and 1
random.seed(n) # n is a number
random.randrange(m) # Randomly choose between 0 and m-1
random.randrange(m, n) # Randomly choose between m and n-1
random.shuffle(a_list) # Shuffle the list
random.choice(a_list) # Choose one
random.sample(a_list, n) # Choose n in the list
[random.choice(a_list) for _ in range(n)] # Choose n in the list, allowing duplicates (replacements)

Files

f.open(. . .)
open(filename, mode, encoding=None)
f.name gives the name of the file
f.mode gives the mode

f.read() # loads the whole file
f.readlines() # loads all the lines into a list
f.readline() # iterator ????
for line in f: # line by line. Is readline method the default?

f.read(size) returns a string of size characters in text mode
f.read(size) returns a bytes object of size bytes in binary mode
If size is omitted, then the whole file is read.
If the end of the file has been reached, f.read() will return an empty string.

c = f.read(1)
while len(c)>0:
    print(c)
    c = f.read(1)

f.tell() #gives position of pointer of the reader
f.seek(offset, whence) where whence=0 means the start of the file, whence=1 current position, whence=2 end of file. Works best when in bytemode.
print(" ", end='') # this supresses the EOL
print(" ", end='[EOL]')

f.readlines() reads a line. It returns the "\n" character too. When it returns an empty string, it reached the end.
This copies from one file to another:

with open(in_file_name, "r") as f_in:
    with open(out_file_name, "w") as f_out:
        for one_line in f_in.readlines():
            f_out.write(one_line)

Read:


f = open(fname, "r")

while True:

    l = f.readline()

    if not l:  # reached EOF

        break

    one_line = l.strip()  # note: if you strip before testing for the EOF, empty lines will make it think it reached the EOF

    ...

f.close()

Set context (better practice):
If the module errors out, the file connector is automatically closed
with open(file_with_list_of_usernames, "r") as f: for l in f: one_line = l.strip() ...

a_fh = open("data.txt","w", encoding="utf-8") # r for read, w for write (overwrite), a for append to the end. Optional encoding="utf-8" if necessary
print("asdf", file=a_fh)
a_fh.read(n)
a_fh.readline(n)
for l in a_fh:
....
a_fh.close()

f.write("abc") returns the number of characters written

a_fh.write("text\n")
a_fh.close()

Always close files explicitely, or better still, use a context

with open("hello.txt", mode="w") as file:
    file.write("Hello, World!")

Open a file:



# the whole file
with open(fn, "w") as f:
    f.write(some_content)

# the whole file
with open(fn, "r") as f:
    the_content = f.read()


with open(fn, "r") as f:
    for raw_line in fi:  # you do not need the .readlines()
        one_line = raw_line.strip()
        # strips spaces and tabs at beginning and end of string

with open(fn, "r") as f:
    for raw_line in fi:
        one_line = raw_line[:-1]
        if raw_line[-1] != "\n":
            print(f"Last char is not \\n but {ord(raw_line[-1])}")

formatting: see
http://www.python-course.eu/python3_formatted_output.php

files, system, ...

import os
os.listdir()
for a_path, dirs_in_path, files_in_path in os.walk("a directory"): print(a_path, dirs_in_path, files_in_path)
os.chdir(new_path)
os.getcwd() # get current directory
os.startfile(file in windows with an extension)
os.system(command)
if not os.path.exists("dir-to-be-created"): os.mkdir("dir-to-be-created")
os.makedirs("path-and-dir-to-be-created") # Creates nested directories in one command
os.remove() remove a file.
os.rmdir("dir-to-be-removed") # remove an empty directory.
os.stat(file-name) # stats on a file
datetime.fromtimestamp(os.stat(file-name).st_mtime) # Date and time of last modification
shutil.rmtree() delete a directory and all its contents.

os.environ['ONE_ENV_VAR'] = '1'
a_var = os.environ['ONE_ENV_VAR']

import os os.path.basename(" ...") # file name with extension but without the directory and without a slash os.path.splitext(" ...")[0] # first part of the file name (including directory) os.path.splitext(" ...")[1] # extension (including the ".") os.path.join("path", "file") # concatenates and ensures that there is one and only one separator os.path.dirname(" ...") # the directory without trailing slash # file name with extension but without the directory os.path.splitext(os.path.basename(" ..."))[0] # first part of the file name without the directory The following is always true:
f ==os.path.dirname(f) + (os.sep if len(os.path.dirname(f))>0 else "") + os.path.splitext(os.path.basename(f))[0] + os.path.splitext(f)[1]

shutil.copy(src, dst)

import sys
sys.stdin
sys.stdout
sys.stderror

#read from stdin and allow pipe into Py:
for line in sys.stdin.readlines():
....
#then, in command line:
something.sh | the_py_script.py
#write out
sys.stdout.write(...)

Show default directory:
import sys print (sys.prefix) #default directory print (sys.path) #path to executables print (sys.getwindowsversion()) print (sys.platform)

import os print (os.environ) os.environ.get('PYTHONPATH') os.environ.get('WINDIR')

print(value1, ..., sep=', ', end='[eol]', file=sys.stdout, flush=False)
# the default sep is blank (space)
# the default end is \n

Always close files

with open("hello.txt", mode="w") as file:
    file.write("Hello, World!")

try:
    file = open("hello.txt", mode="w")
    file.write("Hello, World!")
finally:
    file.close()

Line Endings

When reading in text mode, the platform-specific line endings are converted behind the scenes to \n. The opposite is done when writing.
Platform specific: \n on *nix, \r\n on Windows.

Be careful not to corrupt binary files with this behavior.

StringsIO

import io

With io.StringIO(), if you get "TypeError: string argument expected, got 'bytes'", then use Bytes instead.

Structures


for a in sequence:    <-----\     # seq is list, tuple, string, key of dictionary
     ...                     \
     continue   # quit current iteration and goes to next
     ...
     pass       # do nothing, go to next line|
     ...                                 <---|
     ...
     break      # premature termination, skips the "else" part|
     ...                                                      |
else:  # Always executed, at exit of loop, and also when the loop was not entered
       # not executed in case of break                        |
     ...                                                 <----|

if (x == 0): # ( ) not mandatory
   ...
elif cond:
   ...
else:
   ...

a if (cond) else b # an expression that returns a if cond is true, else b


while cond:           <-----\
     ...                     \
     continue   # quit current iteration and goes to next
     ...
     break      # premature termination, skips the "else" part|
     ...                                                      |
else:  # executed when exiting loop if cond is false, even if did not enter loop
       # not executed in case of break                        |
     ...                                                 <----|


def fctn_name(param, param, optional-param=value):   # madatory parameters go first
    """
    doc string, accessible with fctn_name.__doc__
    """
    ...
    return     # returns None, execution ends here
    ...
    return a_val  # execution ends here
    ...
    return (a,b,d)  # return a tuple
    ...


def fctn_name():
    pass   # empty statement

fctn_name(3,2,7,g=value) # param g as keyword

variables are local to the function where they are first assigned
global v # set the variable as global

New since version 3.10:

Not tried yet

match an_expr: 
    case val1: 
        do_something 
    case val2: 
        do_something 
    case alt1 | alt2:  
        do_something  # if matches alt1 or alt2 
    case [a, b, c]: 
        do_something  # if structure of "an_expr" is a list of three elements 
                      # the elements are then available generically 
    case [a, b, c, *rest]: 
        do_something  # if structure of "an_expr" is a list of three OR MORE elements 
                      # all additional elements are available in variable rest 
    case other:       # equivalent to "else".  "case _:" can also be used 
        do_something
    case _:           # equivalent to "else".  "case other:" can also be used 
        do_something

Arguments

Python passes arguments by reference.

For immutable objects, the reference is passed. However, if another value is assigned to the variable, the variable's place in memory changes. So, it appears as if the value is passed by value, meaning that the value in the original variable in the calling function is not changed.
For mutable objects, the reference is passed. Any changes to the object remain when control is passed back to the calling function. When control is passed to the calling function, the original variable contains the new contents.

Call a function with a tuple, and the function handles each element (argument unpacking)
fctn_name(*a_tuple)
   ...
    # use a_tuple as a tuple
   ...
Call a function with a dictionary
fctn_name(**a_dic)
    # use a_dic as a dictionary (tried, did not work. Need to explore more)
   ...

def fctn_name(*a, **k)
   a... # tuple of the unnamed arguments
   k... # dictionary of the named arguments
   ...

def a_fctn(**kwargs):
arg1 = kwargs.pop("arg_1", "default value")
print(kwargs) # note that kwargs no longer has "name" because it was poped out

Hints for types (see annotations):
def fctn_name(a: str) -> str:

Catching Exceptions

Simplest, but bad practice (called a "bare except"):
try:
...
except:
...
When using the bare except, the clause will catch SystemExit and KeyboardInterrupt exceptions. I may not be able to interrupt the program with ctrl-C (the program will keep on going). And the bare except can disguise other problems.

No bare excepts. If I have to, do:
except Exception:
the ctrl-c does not get caught here. We can still stop our execution. (Without Exception, a ctrl-C does not stop execution).

Add the traceback:
option 1: traceback.print_exc()
option 2: s = traceback.format_exc() then print(s) or put it in a log

try:
   ...
except Exc1:
   ...
except Exce2, Exce3:
   ...
except Exception as e:
   print(f"Unexpected {e=}, {type(e)=}")
   e.add_note('Add some information')
   e.add_note('Add some more information')
   raise # re-raise the most recent error
else:
   # If there is no exception then execute this block.
finally:
   # This is always executed, even when an exception occurs

Catch specific exceptions:

Add some information to the error message and re-raise
Do something, and continue without re-raising
If it is a generic exception, give specific information, and re-raise. Always re-raise with a generic exception.
Re-raising the exception without doing anything else is not of much use. You might as well skip the try-except block
except ValueError as e:
raise e # example of something that is not useful

Raise another exception. It shows both errors.
Raise a generic "Exception" or something specific like "ValueError"
The original exception does not propagate
Notice: nothing at the end of the line with "raise"

def divide(x=1, y=0):
    try:
        return x / y
    except ZeroDivisionError:
        raise ValueError("Pattern 2 error.")

Raise another exception, but suppress the original exception with "from None"
This is useful if the original error message has confidential information
Notice: "from None" at the end of the line with "raise"

def divide(x=1, y=0):
    try:
        return x / y
    except ZeroDivisionError:
        raise Exception("Pattern 3 error.") from None

I raise another exception and include the previous exception
The error message has both
Notice: "from e" at the end of the line with "raise"

def divide(x=1, y=0):
    try:
        return x / y

    except ZeroDivisionError as e:
        raise ValueError("Pattern 4 error.") from e

Examples of exception handling with some added values

    except FileNotFoundError as e:
        raise FileNotFoundError(f"Error: could not find file '{file_name}'") from e

    except Exception as e:
        logger.debug(f"Postgresql URL: {mask_pg_url_for_display(params)}")
        logger.error("\n\nAre you logged into postgreSQL?  You may need to run the bastion script.        <================  \n\n")
        raise Exception(f"Failed to create engine for the database {params['database']} on {params['host']}") from e

    except Exception as e:
        raise Exception(f"Error when attempting to execute query {qry} from RDS") from e

Custom exception:

class PasswordNotFound(Exception): 
    """
    Raised when no password was found
    """
    def __init__(self, username):
        self.message = f"Password not found for username '{username}'"
        super().__init__(self.message)

if len(the_pw)==0:
    raise PasswordNotFound(the_username):

continue with http://www.python-course.eu/python3_recursive_functions.php

Annotations

# function annotations:
def fctn(a: "annotation here", b: "here too") -> "annotation about returned type":
# Typically, the annotations are the types (as far as I know, nothing is enforced)
# But they can be any text
# Example:
def fctn(a: int,  b: str) -> list:

# Variable annotations:
var1: "annotation here"
# Again, typically, it is a type, but it can be a text:
var1: int = 123

Decorators

Closures

A closure is a record with a function. For example, a closure is inner function that has access to data after the outer function is executed. It seems to mean data kept with the functionality.

First-class functions:

Functions can be treated as any other object, including being assigned to a variable and passing it as a parameter. In python, and also javascript

def f1():
    return "something"

print(f1())  # prints something
print(f1  )    # shows "function f1 at ..."

Returning a Function

Reminder to help understand decorators:

def outer_fctn():
    def inner_fctn():
        print("a")
    return inner_fctn()    # notice parentheses here
outer_fctn()  # this prints "a"

def outer_fctn():
    def inner_fctn():
        print("a")
    return inner_fctn      # notice NO parentheses here
outer_fctn()    # this prints something like "<function outer_fctn.<locals>.inner_fctn at 0x7f53beb02200>"
outer_fctn()()  # this prints "a"  (notice two sets of parentheses)
f = outer_fctn()
f()             # this prints "a"

Decorators

A decorator is a function that adds another function as an argument, and returns a function (function in, function out)

import functools                # optional: provides correct naming (see below)

def decorator_fctn(input_fctn):            # input: a function
    @functools.wraps(input_fctn)           # optional:
                                           # the wrapped function shows with name "input_fctn"
                                           # instead of the name "the_wrapper"
    def the_wrapper(*args, **kwargs):
        ...
        return input_fctn(*args, **kwargs) # this executes the functon (remember to put parentheses)
    return the_wrapper                     # output is a function (no parentheses)

@a_decorator
def a_fctn_with_decorator():
    return "this is 'a_fctn_with_decorator()'"

# is the same as:
def xfctn():
    return "this is 'a_fctn_with_decorator()'"
a_fctn_with_decorator = a_decorator(xfctn)

A decorator with an argument:
This is done by adding yet another function on the outside.
Note that the outside function does not have the function as parameter, but the function just inside.


from functools import wraps                # optional: provides correct naming (see below)
# ...
def my_decorator(decorator_parameter):                           # put my parameter(s) here
    def inside_function_for_my_decorator(input_fctn):            # input: a function here (NOT in the outside function)
        # ================================ just the outside changes same as case without decorator parameter from here
        @wraps(input_fctn)                     # optional:
                                           # the wrapped function shows with name "input_fctn"
                                           # instead of the name "the_wrapper"
        def the_wrapper(*args, **kwargs):
            # something that uses the decorator_parameter
            return input_fctn(*args, **kwargs) # this executes the functon (remember to put parentheses)
        return the_wrapper                     # output is a function (no parentheses)
        # ================================ just the outside changes same as case without decorator parameter to here
    return inside_function_for_my_decorator

functools is not necessary, but allows better indication about the wrapper when I use help(..)

Try with this code:

def dec(f):
    print("dec A")
    def w():
        print(f"Wrapper for {f.__name__}")
        return f()
    print("dec B")
    return w

@dec
def decorated_f1():
    print("in some_fctn")
    return " executed"

decorated_f1()

def some_fctn():
    print("in some_fctn")
    return " executed"

decorated_f2 = dec(some_fctn)
decorated_f2()



# now, with parameters:
def dec(f):
    print("dec A")
    def w(*a, **k):        # added *a, **k here, otherwise no difference
        print(f"Wrapper for {f.__name__}")
        return f(*a, **k)        # added *a, **k here, otherwise no difference
    print("dec B")
    return w



def some_fctn(msg):
    print("in some_fctn, msg=",msg)
    return msg + " executed"

decorated_f = dec(some_fctn)
decorated_f("asdf")


@dec
def decorated_f2(msg):
    print("in some_fctn, msg=",msg)
    return msg + " executed"

decorated_f2("asdf")

(End of sample code to try)

Sample of Logger Decorator

import logging
logging.basicConfig(...)

def mylog(orif_fctn):
    def w(*args, **kwargs):
        logging.info(f"Ran with args: {str(args)} and {str(kwargs)}")
        return orig_fctn(*args, **kwargs)
    return w

Sample of Timing Decorator

import time
def time_exec(orig_fctn):
    def w(*a, **k):
        t = time.perf_counter()
        result = orig_fctn(*a, **k)
        elapsed = time.perf_counter() - t
        return result
    return w

When stacking decorators, the last gets executed first.

Enumerations

As enumerations, are not real classes, I prefer the following creation syntax:
from enum import Enum my_enum = Enum("Color", ["RED", "BLUE", "GREEN", "YELLOW"])
And, frankly, I am not interested in what Python assigns as values. If I really want to see the values, do list(my_enum), or c.RED.value (and c.RED.name for the name).

Generators:

In a generator function, "yield" sort of replaces "return" in regular functions.
Note however that execution continues after the "yield", but does not after the "return".

def a_generator(var_that_is_an_enum):
    for i in var_that_is_an_enum:
       yield the_calculation(i)   # yield is the key
v = a_generator(a_list)
print(next(v))  # it knows to show first
print(next(v))  # it knows to show second

range():
print([i for i in range(5)]) --> [0, 1, 2, 3, 4] print([i for i in range(0, 5)]) --> [0, 1, 2, 3, 4] 0 is default start print([i for i in range(1, 5)]) --> [1, 2, 3, 4] print([i for i in range(0, 5, 1)]) --> [0, 1, 2, 3, 4] 1 is default step print([i for i in range(0, 5, 2)]) --> [0, 2, 4] print([i for i in range(1, 5, 2)]) --> [1, 3]

Misc

x = 5
# can be written as:
(x := 5)  # valid, but not recomended!
# the brackets are crucial
After Py 3.8

Processes

formatting: see
http://www.python-course.eu/python3_formatted_output.php

files, system, ...

import os
os.listdir()
os.chdir(new_path)
os.getcwd()
os.startfile(file in windows with an extension)
os.system(command)

os.remove() remove a file.
os.rmdir() remove an empty directory.
shutil.rmtree() delete a directory and all its contents.

os.environ['ONE_ENV_VAR'] = '1'
a_var = os.environ['ONE_ENV_VAR']

shutil.copy(src, dst)

https://docs.python.org/3.4/library/subprocess.html

Forking:
(only on linux/unix?)
newpid = os.fork() # creates fork
# the fork copies memory and code to a new process. I guess both just pick up from there?
os.getpid() # the process id, as shown in ps -aux
os._exit(0) # exist the child process

Threading:
# works in Windows and in Linux (py 2, did not try py 3)
from threading import Thread
def a_function:
pass
t = Thread(target=a_function, args=(...,))
t.start()
t.join() # not sure what this does yet

Generators / Iterators
use lazy generation of lists so as not to fill memory. range is lazy in Python3

Classes

class ClassName: # by convention PascalCase
...

class MyClass:
    class_attr = 123  # usually before the __init__

    def __init__(self, a, b):
        self.a = a   # instance attribute
        self.b = b

    def __str__(self):
        return f"attributes are {self.a} , {self.b} "
        # Overwrite this to provide a non-default value for display

    def method_nm(self):
        # instance method
        self.attrib = ... # value for the instance
        MyClass.attrib = ... # value for the class

    # parameter self points to an instance of the class when the method is called
    # and modify the class through the self.__class__ attribute
    # call in two ways:
    #obj.method_nm()
    #or
    #MyClass.method_nm(obj)

    @classmethod
    def classmethod_nm(cls, ...):
        # class method
        # cls parameter points to the class
        # can call without any instance
        cls.class_attribute = ...
        ...
        cls(args_for_constructor)    # Create a new object inside a class method


    @staticmethod
    def staticmethod_nm(...):
        # static method
        # Can call without any instance
        # Static methods do not pass an object by default.  The first argument is neither self nor cls.
        # Typically, if you do not access the "self" or the "cls", then you should probably make a static method

c = MyClass(1,2) # instantiate
class_instance = Class_name()   # remember the parentheses
Alias_of_class = Class_name     # without parentheses, it is just an alias

# A call to an instance method "adds" the instance as the first argument. It is represented by convention as self in the method definition.
# A call to class  method "adds" the class as the first argument. It is represented by convention as cls in the method definition.
# A call to static  method "adds" nothing.

# When calling a method, the following two syntaxes are equivalent
class_instance.method_name(a, b)
Class_name.method_name(class_instance, a, b)    # this is what is actually happening, hence the argument "self"

print(c.a, class_attr)  # see the attributes (but not best practice)
c.a = new_value  # alter attributes (not best practice: modify through a method)

help(Classname)        # shows information about class
print(dir(Classname))  # lists methods

A dunder method starts and ends with double underline: __dunder_method__.
__init__(): the constructor __str__() # string representation __repr__() # representation, for developpers. Often show what would re-create the object

isinstance(var, class) ==> True if var is an instance of class class, of a subclass of class

No private instance variables. By convention, _something with preceeding underscore is considered protected and should not be accessed from the outside
Protected means that subclasses can access
Double underscore for private

When you forget the "self", Python throws the error "takes 0 positional arguments but 1 was given". To resolve, add "self" as the first argument in the method.
If it is static, then add the "@staticmethod" decorator (and remove "self" or "class").

Note that type() can be used to create a new type. However, creating a class is preferred over creating a type.

From outside the class, call a static method any time, before or after instanciation, with the class name or with an instance as the prefix:
ClassName.static_method(a,b,c)
class_instance.static_method(a,b,d)
From inside the class, call a static method with the class name or with "self" as the prefix:
ClassName.static_method(a,b,c)
self.static_method(a,b,d)

Note that you can use any name instead of "self" and "cls". These are just conventions. Use another name at risk of making the code less readable.


call Complex:
     sh = ...  # shared variable

     def __init__(self, re, im, v=None):    # Constructor
         self.r = re
         self.i = im
         self.s = set()
         # s, r and i are instance variables
         if v is not None:
             for vv in v:
                 s.add(vv)

     def a(self):    # By convention, all functions take self as first parameter
         ...

     def __repr__(self):    # Representation
         return "..." #

Put meaningful data into string format

To see the namespace of an instance or a class:
abc.__dict__
namespaces:
   __builtin__
   __main__ the main, top script
   namespaces of various functions
   innermost scope: the namespace local to the function, where local names are directly accessible, meaning unqualified

Data attributes override method attributes with the same name;
--> causes hard-to-find bugs
Suggestions:
capitalize method names
prefix data attribute names with something
verbs for methods and nouns for data attributes.

Setting and getting attributes indirect key


the_key = "a"
the_value = "a value"
setattr(c,the_key,"a value")
# is same as:
c.a = "a value"

v = getattr(c, the_key)
# is same as:
v = c.a
getattr(the_objects_instance, variable_with_attribute_name, optional_default)
setattr(the_objects_instance, variable_with_attribute_name, value_to_assign)
hasattr(the_objects_instance, variable_with_attribute_name) # True if the object has that attribute
delattr(the_objects_instance, variable_with_attribute_name) # delete the attribute

Note: "attr" is pronounced "atter" (rhymes with "otter")

Context Manager

Class with a context manager:


class ClassWithContextManager():

    def __init__(self, username, password, connect_string):
        self.username = username
        self.password = password
        self.connect_string = connect_string

    def __enter__(self):
        print("Entering the context...")
        self.connection = whatever_package.connect(self.username, self.password, self.connect_string)
        return self.connection

    def __exit__(self, exc_type, exc_value, exc_tb):
        self.connection.close()
        print("Closing the context...")

Object Oriented Programming OOP

Terms:

Encapsulation: att describe obj state, only accessible via methods
Abstraction: only use the interface, without having to know the details of the object
Inheritance: the ability to create child classes
Polymorphism: the same method is implemented differently for each child class

Design Principles:

Composition over inheritance: try to re-use a class by creating another instance instead of building a child
Encapsulate what varies
Dependency Inversion Principle
Inversion of control: the "what" part and the "when" part are independent
SOLID
- Single responsibility principle
- Open/close principle
- Liskov's Substitution Principle
- Interface segregation principle
- Dependency Inversion Principle
DRY: Do not Repeat Yourself
YAGNI: You Aren't Gonna Need It. Do not implement for unseen needs

A class has

arguments
instance attributes
instance methods

Decrease coupling:

Beware of inheritance, as it can create dependencies
Do not use objects where you create them; separate creation from use
Use abstraction
Avoid "inappropriate intimacy", which is passing too much information to a function. An example is passing a full structure when only one element of the structure is needed
Introduce an intermediate data structure

Functional Programming

Functional programming allows parallel execution; Functional code is much easier to parallelize.
Less physical memory and CPU restrictions given that execution is spread out
Distribute functional code across an cluster of computers
Data should be manipulated by functions without maintaining any external state
Code should avoid global variables and always return new data instead of manipulating the data in-place.

* Pass values into the function via arguments, and extract information via the "return" statement. * No global variables. * Every execution of the function with the same argument gives the same result * Functions must be reentrant * See notions of "pure function" and "side effect."

Architecture Design

Infrastructure layer: databases, src and tgt
Adapter layer: access the instructure. Put the code for connecting to the data
Application layer: features of the application, in three parts: extract, transform, load
a main fctn as entry point with parameters, initialization, and run
Domain layer: entities, objects

Inheritance

class DerivedClassName(BaseClassName):
    def method_with_same_name_overwrites(param):
        return super().method_with_same_name_overwrites(param)

The method resolution order means that we go up the inheritance chain until we find the definition we are looking for


isinstance(instance, subclass)
isinstance(instance, superclass)
issubclasse(subclass, superclass)

Abstract Class

NOT TRIED

from abc import ABC, abstractmethod
# ABC = Abstract Base Class
# An abstract method has a declaration but no implementation,
# and needs an implementation in each inherited class

class abs_cls(ABC):
    @abstractmethod
    def abs_method(self):
        pass

Protocol

(since version 3.8)

NOT TRIED


from typing import Protocol
# The protocol basically defines the interface

class prot_cls(Protocol):
    def prot_method(self):
        ...   # yes, three dots

Then classes that implement this protocol have to have the methods. But they do not inherit from the protocol class. The protocols are like templates.

Getters, setters, deleters


# Property decorator (getter method)

@property
def a_fctn_that_will_appear_as_an_attrib(self):
# I can call this function without parentheses
Print(obj_instance.a_fctn_that_will_appear_as_an_attrib)

# Setter method

@the_method_that_looks_like_an_attribute.setter
def the_method_that_looks_like_an_attribute(self, param):
    self.a = ... param ...
# I can use "the_method_that_looks_like_an_attribute" as if it where an attribute:
obj_instance.the_method_that_looks_like_an_attribute = 'asdfa'

# Deleter method (to delete a property)

@the_method_that_looks_like_an_attribute.deleter
def the_method_that_looks_like_an_attribute(self):
    self.a = None
del obj_instance.the_method_that_looks_like_an_attribute

Sneaky behavior to be aware of.
The call_another() for the child object calls the the_other() method from the CHILD class, not from the parent class where the call_another() function is found.
Notice the self in self.the_other() call
To help understand this when doing a diagram, "pull over" the un-overwritten methods into the child classes. In particular, look for methods in the parent classe containing "self.method...()".

Let's call these virtual child methods "ghosts".

A ghost that calls a ghost is not an issue
A ghost that calls a method that is only in the child is not an issue
A non-ghost that calls a ghost is not an issue
A ghost that calls a methond from a class with no inheritance is not an issue
A non-ghost that calls a non-ghost gives different results, but it is not sneaky because I know I am in the child class and the results are consistent with that class
The issue is when a ghost calls a non-ghost. In this case, the behavior is different depending on if the execution is on the parent class or the child class. I call a method in the parent class and I get results that are from the child class.

When analyzing, concentrate on the child methods that overwrite a parent method. Be sure to understand which object they are called by: child or parent.