Software

Python packages

ER-Evaluation

An End-to-End Evaluation Framework for Entity Resolution Systems.

pip install er-evaluation

PatentsView-Evaluation

pv_evaluation is a Python package for the evaluation and benchmarking of PatentsView disambiguation algorithms. It provides summary statistics, performance evaluation metrics, and representative performance estimators. See the project website for the full documentation.

pip install pv-evaluation

StringCompare

StringCompare is a Python package (implemented in C++ through pybind11) for efficient string similarity computation and approximate string matching.

pip install git+https://github.com/OlivierBinette/stringcompare.git

đźš§ GroupByRule

Deduplicate data using fuzzy and deterministic matching rules. This is under construction.

pip install git+https://github.com/OlivierBinette/groupbyrule.git

R packages

FingermatchR

Fingerprint matching tools based on NIST’s Biometric Image Software, on FingerJet minutiae extraction tool, and on the libfmr library.

devtools::install_github("forensic-science/fingermatchR")

MSETools

Code and analyses for the paper titled “On the Reliability of Multiple Systems Estimation for the Quantification of Modern Slavery” (Binette and Steorts, 2021).

devtools::install_github("OlivierBinette/MSETools")

cache

CRAN_Status_Badge

Simple interface to caching which works across interactive R sessions, R scripts and Rmarkdown documents.

install.packages("cache")

assert

CRAN_Status_Badge

Lightweight validation tool for checking function arguments and data analysis scripts.

install.packages("assert")

dgaFast

Multiple Systems Estimation Using Decomposable Graphical Models. This is an efficient re-implementation and extension of the dga R package (it is now part of dga).

devtools::install_github("OlivierBinette/dgaFast")

TessTools

Tools for the use of Tesseract OCR in R and for the analysis of historical newspaper archives.

devtools::install_github("OlivierBinette/TessTools")

splinit

Periodic spline regression and closed curve reconstruction.

devtools::install_github("OlivierBinette/splinit")

pretty

Better baser plots in R.

devtools::install_github("OlivierBinette/pretty")

Javascript apps

Fractals

High resolution visualization for the Mandelbrot set. A Java version with more features is also available.

Earthquakes

Visualize earthquakes on the globe.

Other software