Davide M. Coluccia

Postdoctoral Researcher, Northwestern University

Digitized Universe of British Patents during the Second Industrial Revolution, 1853-1900

with Gaia Dossi

We digitize the universe of original patent documents issued in England, Wales, Scotland, and Ireland between 1853 and 1900. Through state-of-the-art Optical Character Recognition techniques, data analysis using large language models, and manual verification, we construct a machine-readable database covering over 300,000 patent documents. The data includes information on the patent title, technological classification, issue and filing dates, the name of the inventor, their profession, and their address of residence, which we geocode to precise coordinates. This dataset represents a novel and unique resource to study the Second Industrial Revolution in the United Kingdom.
 

Number of Patent Grants, 1853-1900

A Nation of Emigrants: The Italian Mass Migration to the US, 1892-1924. Individual-Level Data from Ellis Island

with Lorenzo Spadavecchia

Between 1890 and 1924 more than four million Italians emigrated to the United States. During this period, Italy was the single largest supplier of emigrants in the world. This dataset tracks Italians who arrived in the Ellis Island immigration station between 1892 and 1924. We collect individual-level information on 3.6 million immigrants, including immigration year, literacy status, municipality of origin, and age. In ongoing work, we are leveraging census linking algorithms to merge the resulting dataset with individual-level historical (US) census data. This will yield a novel tool to study a broad set of questions concerning one of the defining immigrant group in (historical) American society.
 

Italian Emigrants to the US, 1892-1924

© 2024 Davide M. Coluccia. Codes on Github.
##