This website lists some of what I do. See my publications, teaching, and software.
I am a data scientist at American Institutes for Research. Previously, I was a data science consultant while doing my PhD in the Statistical Science Department at Duke university. I specialize in applied data science R&D, in the statistical evaluation of AI/machine learning systems, and in entity resolution for data linkage, cleaning, and enrichment. My work combines engineering, machine learning, and statistics to address applied problems in these areas.
I contribute to open source projects related to my work and I maintain various R and Python packages. My open-source work has previously been supported by G-Research and by individual contributors through Github Sponsors.