pandarallel
allows any Pandas user to take advantage of their multi-core computer, while Pandas uses only one core. pandarallel
also offers nice progress bars (available on Notebook and terminal) to get an rough idea of the remaining amount of computation to be done.
The Levenshtein Python C extension module contains functions for fast computation of
Levenshtein (edit) distance, and edit operations
string similarity
approximate median strings, and generally string averaging
string sequence and set similarity
It supports both normal and Unicode strings.
Hypothesis is a library for testing your Python code against a much larger range of examples than you would ever want to write by hand. It’s based on the Haskell library, Quickcheck, and is designed to integrate seamlessly into your existing Python unit testing work flow.
fastparquet
is a Python implementation of the Parquet file format. fastparquet
is used implicitly by dask
, pandas
and intake-parquet
. It supports the following compression algorithms:
Gzip
Snappy
Brotli
LZ4
Zstd
LZO (optionally)
python-lazy-loader
makes it easy to load subpackages and functions on demand. Its main features are:
Allow subpackages to be made visible to users without incurring import costs.
Allow external libraries to be imported only when used, improving import times.
smartpants
can perform the following transformations:
Straight quotes ( " and ' ) into "curly" quote HTML entities
Backticks-style quotes (``like this'') into "curly" quote HTML entities
Dashes (-- and ---) into en- and em-dash entities
Three consecutive dots (... or . . .) into an ellipsis entity
Entry points are a way for Python packages to advertise objects with some common interface. The most common examples are console_scripts
entry points, which define shell commands by identifying a Python function to run. The entrypoints
module contains functions to find and load entry points.
Kiwi is an efficient C++ implementation of the Cassowary constraint solving algorithm. Kiwi has been designed from the ground up to be lightweight and fast. Kiwi ranges from 10x to 500x faster than the original Cassowary solver with typical use cases gaining a 40x improvement. Memory savings are consistently > 5x.
This package provides a version helper that lets you automatically use the latest vX.X.X
Git tag as the version in your Python package. It also supports getting the version from Python source distributions or, once your package is installed, via pkg_resources
(part of setuptools
).
This is a library that enables automated deprecations. It offers the deprecated()
decorator to wrap functions, providing proper warnings both in documentation and via Python’s warnings system, as well as the deprecation.fail_if_not_removed()
decorator for test methods to ensure that deprecated code is eventually removed.
ScientificPython is a collection of Python modules that are useful for scientific computing. Most modules are rather general (Geometry, physical units, automatic derivatives, ...) whereas others are more domain-specific (e.g. netCDF and PDB support). The library is currently not actively maintained and works only with Python 2 and NumPy < 1.9.
The shellescape Python module defines the shellescape.quote()
function that returns a shell-escaped version of a Python string. This is a backport of the shlex.quote()
function from Python 3.8 that makes it accessible to users of Python 3 versions < 3.3 and all Python 2.x versions.
This plugin installs a mocker
fixture which is a thin-wrapper around the patching API provided by the mock
package, but with the benefit of not having to worry about undoing patches at the end of a test. The mocker fixture has the same API as mock.patch
, supporting the same arguments.
The asyncstdlib
library re-implements functions and classes of the Python standard library to make them compatible with async
callables, iterables and context managers. It is fully agnostic to async
event loops and seamlessly works with asyncio
, third-party libraries such as trio
, as well as any custom async
event loop.
Flask-RESTX is an extension for Flask that adds support for quickly building REST APIs. Flask-RESTX encourages best practices with minimal setup. If you are familiar with Flask, Flask-RESTX should be easy to pick up. It provides a coherent collection of decorators and tools to describe your API and expose its documentation properly using Swagger.
The LAZR config system is typically used to manage process configuration. Process configuration is for saying how things change when we run systems on different machines, or under different circumstances. This system uses ini-like file format of section, keys, and values. The config file supports inheritance to minimize duplication of information across files. The format supports schema validation.
ISO 8601 is most commonly known as a way to exchange datetimes in textual format. A lesser known aspect of the standard is the representation of durations. The state of the art of ISO 8601 duration handling in Python is more or less limited to what's offered by isodate. This package attempts to address the shortcomings of isodate
.
This package provides YAML parser/emitter that supports roundtrip preservation of comments, seq/map flow style, and map key order. It is a derivative of Kirill Simonov’s PyYAML 3.11. It supports YAML 1.2 and has round-trip loaders and dumpers. It supports comments. Block style and key ordering are kept, so you can diff the source.
This package provides YAML parser/emitter that supports roundtrip preservation of comments, seq/map flow style, and map key order. It is a derivative of Kirill Simonov’s PyYAML 3.11. It supports YAML 1.2 and has round-trip loaders and dumpers. It supports comments. Block style and key ordering are kept, so you can diff the source.
Telomerecat is a tool for estimating the average telomere length (TL) for a paired end, whole genome sequencing (WGS) sample.
Telomerecat is adaptable, accurate and fast. The algorithm accounts for sequencing amplification artifacts, anneouploidy (common in cancer samples) and noise generated by WGS. For a high coverage WGS BAM file of around 100GB telomerecat can produce an estimate in ~1 hour.
PyTorch is a Python package that provides two high-level features:
tensor computation (like NumPy) with strong GPU acceleration;
deep neural networks (DNNs) built on a tape-based autograd system.
You can reuse Python packages such as NumPy, SciPy, and Cython to extend PyTorch when needed.
Note: currently this package does not provide GPU support.
This package supports the creation of a combined header for a FITS file based on the contents of the headers of a set of input FITS images. A rules file defines what keywords will be present in the combined output header as well as how the output value will be determined from the set of values from all the input image headers.
PINT is not TEMPO3 - package providing a Pulsar Timing, written in Python from scratch. Features:
a robust system to produce high-precision timing results that is completely independent of TEMPO and Tempo2
a system that is easy to extend and modify due to a good design and the use of a modern programming language, techniques, and libraries