Seminar by Roberto Di Cosmo on Software Heritage, Campinas, São Paulo, Brazil, 12 April 2024, on line

Software Heritage: a key infrastructure for Open Science and Reproducibility

Roberto Di Cosmo

Director, Software Heritage

Inria, Paris

Link –

12/04/2024, 14h (BR time), 9h (FR time)


In today’s digital research landscape, software permeates every discipline, from the humanities to the hard sciences. To maintain the fabric of knowledge, address the mandate of Open Science, and foster reproducibility of research, it is imperative to have dependable preservation and identification of all software artifacts relevant for research.

This talk will present in detail Software Heritage, a groundbreaking initiative committed to collecting, preserving, and sharing all publicly available software in source code form that has already collected more than 17 billion files from more than 270 million software origins.

Hands on demos will showcase Software Heritage’s unmatched role in addressing the essential needs of software artifact preservation and identification in all research realms. Attendees will discover the ease of archiving and referencing in just a few clicks, courtesy of dedicated APIs and tools, and see in action the SWHID (Software Hash Identifier) that Software Heritage provides for more than 35 billion artifacts today.


An alumnus of the Scuola Normale Superiore di Pisa, with a PhD in Computer Science from the University of Pisa, Roberto Di Cosmowas associate professor for almost a decade at Ecole Normale Supérieure in Paris. In 1999, he became a Computer Science full professor at University Paris Diderot, where he was head of doctoral studies for Computer Science from 2004 to 2009. President of the board of trustees and scientific advisory board of the IMDEA Software institute and chair of the Software chapter of the National Committee for Open Science in France, he is currently on leave at Inria.

His research activity spans theoretical computing, functional programming, parallel and distributed programming, the semantics of programming languages, type systems, rewriting and linear logic, and, more recently, the new scientific problems posed by the general adoption of Free Software, with a particular focus on static analysis of large software collections. He has published over 20 international journals articles and 50 international conference articles.

In 2008, he has created and coordinated the european research project Mancoosi, that had a budget of 4.4Me and brought together 10 partners to improve the quality of package-based open source software systems.

Following the evolution of our society under the impact of IT with great interest, he is a long term Free Software advocate, contributing to its adoption since 1998 with the best-seller Hijacking the world, seminars, articles and software. He created in October 2007 the Free Software thematic group of Systematic, that helped fund over 50 Open Source research and development collaborative projects for a consolidated budget of over 200Me. From 2010 to 2018, he was director of IRILL, a research structure dedicated to Free and Open Source Software quality.

He created in 2015, and now directs Software Heritage, an initiative to build the universal archive of all the source code publicly available, in partnership with UNESCO.

Comments are closed.