Publications

Publications

Benjamin Charles Germain Lee

Journal Publications

LIMEADE: From AI Explanations to Advice Taking
Benjamin C.G. Lee, Doug Downey, Kyle Lo & Daniel S. Weld
ACM Transactions on Interactive Intelligent Systems (TiiS), Volume 13, Issue 4 (2023)
Special Issue: Human-Centered Explainable AI
DOI, ArXiv

The "Collections as ML Data" Checklist for Machine Learning and Cultural Heritage
Benjamin C.G. Lee
Journal of the Association for Information Science and Technology (JASIST) (2023)
Special Issue: Conceptual Models of the Sociotechnical
DOI, ArXiv

Towards a Speculative Bibliography of Hemispheric Reconstruction Newspapers
Joshua Ortiz Baco*, Benjamin C.G. Lee*, Sarah Salter* & Jim Casey* (equal contribution)
Criticism: A Quarterly for Literature and the Arts, Volume 64, Numbers 3–4 (2022)
Special Issue "New Approaches to Critical Bibliography and the Material Text"
DOI

Grappling with the Scale of Born Digital Government Publications: Toward Pipelines for Processing and Searching Millions of PDFs
Benjamin C.G. Lee & Trevor Owens
International Journal of Digital Humanities, Volume 3, 2022
DOI, ArXiv

Compounded Mediation: A Data Archaeology of the Newspaper Navigator Dataset
Benjamin C.G. Lee
Digital Humanities Quarterly, Volume 15, Issue 4, 2021
DOI, Humanities Commons

​Machine Learning and the Social Studies
Benjamin C.G. Lee, Ilene R. Berson & Michael J. Berson
Social Education, Volume 85, Issue 2, 2021
DOI

Machine Learning, Template Matching, and the International Tracing Service Archive:
Automating the Retrieval of Death Certificate Reference Cards from 40 Million Document Scans
Benjamin C.G. Lee
Digital Scholarship in the Humanities, Volume 4, Issue 3, 2019
DOI

Improved Point-source Detection in Crowded Fields Using Probabilistic Cataloging
Stephen K.N. Portillo, Benjamin C.G. Lee, Tansu Daylan & Douglas Finkbeiner
The Astronomical Journal, Volume 154, Number 4, 2017
DOI, ArXiv

Galaxy Redshifts from Discrete Optimization of Correlation Functions
Benjamin C.G. Lee, Tamás Budavári, Amitabh Basu & Mubdi Rahman
​The Astronomical Journal, Volume 152, Number 6, 2016
DOI, ArXiv

Conference Publications

Navigating the Mise-en-Page: Interpretive Machine Learning Approaches to the Visual Layouts of Multi-Ethnic Periodicals
Benjamin C.G. Lee*, Joshua Ortiz Baco*, Sarah Salter* & Jim Casey* (equal contribution)
Computational Humanities Research (CHR) 2021
DOI, ArXiv

LayoutParser: A Unified Toolkit for Deep Learning Based Document Image Analysis
Zejiang Shen, Ruochen Zhang, Melissa Dell, Benjamin C.G. Lee, Jacob Carlson & Weining Li
ICDAR 2021
DOI, ArXiv, Preview Video, Presentation Video

The Newspaper Navigator Dataset: Extracting Headlines and Visual Content from 16 Million Historic Newspaper Pages in Chronicling America
Benjamin C.G. Lee, Jaime Mears, Eileen Jakeway, Meghan Ferriter, Chris Adams, Nathan Yarasavage, Deborah Thomas, Kate Zwaard & Daniel S. Weld
CIKM 2020
DOI, ArXiv, GitHub, Dataset Website
*Best Resource Paper Runner-up (92 submissions)*
*Best Digital Humanities Dataset, 2020 DH Awards*

Book Chapters

The Digital Humanities and the Ladino Press:
Using Machine Learning to Extract and Analyze Visual Content in Historic Ladino Newspapers
Benjamin C.G. Lee
Jewish Studies in the Digital Age
Studies in Digital History and Hermeneutics Series, Chapter 10
De Gruyter Press, 2022
DOI

Identity, Personhood, and Material Culture:
Personal Effects Confiscated from Prisoners Upon Arrival at Dachau Concentration Camp
Gabriel Pizzorno* & Benjamin C.G. Lee* (equal contribution)
The Material Culture of Difficult Histories, Chapter 10
Cornell Univerity Press (forthcoming)

Computer Science Research and Digital Humanities Questions
Benjamin C.G. Lee
The Digital Futures of Graduate Study in the Humanities, Chapter 30
Debates in the Digital Humanities Series
University of Minnesota Press (forthcoming)

Access
Benjamin C.G. Lee
Digital Preservation: A Critical Vocabulary
MIT Press (forthcoming)

Fundamentals and Ethics of AI & Machine Learning
Benjamin C.G. Lee
Engaging with Big and Small Historical Data
Engaging with... Series
Routledge Press (forthcoming)

Edited Volumes

Cultures of Scale: Disciplines, Data, and Labor
Editors: Joshua Ortiz Baco*, Jim Casey*, Benjamin C.G. Lee*, and Sarah H. Salter* (equal contribution)
Debates in the Digital Humanities Series
University of Minnesota Press (forthcoming)
CfP

In Preparation

Powell.pps: Close & Distant Reading of Primary Sources in Web Archives
Trevor Owens & Benjamin C.G. Lee

Integrating Visual and Textual Inputs for Enhanced Map Retrieval of the Library of Congress's Geography and Maps Digital Collections
James Mahowald & Benjamin C.G. Lee

The European Clergy in Dachau: A Digital Humanities Research Approach to a Concentration Camp Prisoner Population
Benjamin C.G. Lee* & Andrew Kloes* (equal contribution)

Commissioned Reports

A Landscape of Data Sources: Findings & Recommendations
A Report Commissioned by the Library of Congress
Benjamin C.G. Lee
In partnership with the Digital Strategy Directorate, Strategic Planning & Performance Management, and the Financial Services Directorate
February 1, 2021
Delivered to the Deputy Librarian of Congress (available as internal report only)

Workshop Publications

Past Meets Future: Human-AI Interaction for Digital History and Cultural Heritage
Kurt Luther, Vikram Mohanty, Benjamin C.G. Lee & Ioanna Lykourentzou
29th International Conference on Intelligent User Interfaces (IUI) 2024
DOI

Line Detection in Binary Document Scans:
A Case Study with the International Tracing Service Archives
Benjamin C.G. Lee
2nd Computational Archival Science Workshop
2017 IEEE International Conference on Big Data
DOI

Demos

Newspaper Navigator: Open Faceted Search for 1.5 Million Images
Benjamin C.G. Lee & Daniel S. Weld
UIST 2020
Paper DOI, Preview Video, Short Talk Video

Other

Newspaper Navigator: Putting Machine Learning in the Hands of Library Users
Benjamin C.G. Lee, Jaime Mears, Eileen Jakeway, Meghan Ferriter & Abigail Potter
October 16, 2020
EuropeanaTech Insight (Issue 16)
DOI

Dissertation

Human-AI Interaction for Exploratory Search & Recommender Systems With Application to Cultural Heritage
Benjamin C.G. Lee
2023
DOI

Undergraduate Thesis

Probabilistic Cataloging of the Globular Cluster Messier 2:
Improved PSF Photometry of Crowded Stellar Fields
Benjamin C.G. Lee
April 7, 2017
DOI

Public Code Repositories

Newspaper Navigator
Benjamin C.G. Lee (2020)
GitHub (215 stars)

Paired Sequence File Comparison:
Fast Validation of FASTQ Files Containing Paired-end Reads
Benjamin C.G. Lee (2012)
SourceForge