The French Open Science Monitor
Steering the science based on open bibliographic databases
FrenchOpenScienceMonitor.esr.gouv.fr

February 3, 2024

FOSDEM

Anne L'Hôte

Data engineer & software craftswoman

French Open Science Monitor : Measure the evolution of open science in France licence

Objectives

National Plan for Open Science in 2018, the monitor has been designed as:

  • 👑 a sovereign and evolving tool for assessing the impacts of the open science public policy

  • 🔧 a strategic tool to refine and adjust open science public policies

  • 👩‍🔬 a lever for improving knowledge of French scientific production, beyond the Open Science aspects
French nation plan for Open Science

The monitor is a command for


  • monitoring and steering public policy
  • by taking into account bibliodiversity
  • sharing and openness to encourage transparency and reproducibility

  • 🧗 These constraints led us to choose the difficult path
    • at the methodological level, in particular for detection of affiliations and disciplinary fields
    • at the operational level, with a IT infrastructure adapted

Publications

Current situation

#1 Extract data

  • PubMed, Crossref, HAL
  • 🏗️ Automatic country detection (affiliation-matcher)

#2 Detect the country of affiliations

  • 🏗️ Automatic country detection (affiliation-matcher)

#3 Consolidate the opening status

  • Open access detection: Unpaywall
  • 🏗️ Classification of open access types

#3 Consolidate the disciplinary classification

  • Training data : Pascal and Francis, Field of Research (FoR)
  • 🏗️ Automatic classification models (fastText)

#4 Share the results

#4 Share the results

  • 200 local variations

Indicators

Indicators

Data, code and software

#1 Collect data

#2 Download PDFs, the open ones

#2 Download PDFs, the closed ones

#3 Consolidate metadata - Grobid

#3 Consolidate metadata - DataStet & Softcite

#4 Indicators

Indicators

Indicators

Indicators

... capitalizing on and complementing existing open sources

🏗️ Built at MESR within the FOSM framework
  • 🏛️ Affiliations metadata
    • PubMed, Crossref, HAL
    • 🏗️ Crawling web pages
    • 🏗️ Automatic country detection (affiliation-matcher)
  • 🔍 Characterization of open access
    • Open access detection: Unpaywall
    • 🏗️ Classification of open access types
  • 🤖 Thematic classification
    • Training data : Pascal and Francis, Field of Research (FoR)
    • 🏗️ Automatic classification models (fastText)

A modular approach ...


    dataesr github

Comparison with major international databases

Lauranne Chaignon, Daniel Egret; Identifying scientific publications countrywide and measuring their open access: The case of the French Open Science Monitor (FOSM). Quantitative Science Studies 2022; 3 (1): 18-36. doi: https://doi.org/10.1162/qss_a_00179
FOSM benchmark
  • "The open-source strategy used by the FOSM effectively identifies the vast majority of publications with a persistent identifier (DOI) for Open Science monitoring."

Sensitivity of open access rate measurement (1/3)

Lauranne Chaignon, Daniel Egret; Identifying scientific publications countrywide and measuring their open access: The case of the French Open Science Monitor (FOSM). Quantitative Science Studies 2022; 3 (1): 18-36. doi: https://doi.org/10.1162/qss_a_00179
FOSM Open Access rate
The OA rate varies according to the source, but the more diversified the source, the lower the sensitivity.

Sensitivity of open access rate measurement (2/3)

The OA rate varies according to the date of observation (which is rarely specified). FOSM observation dates

Sensitivity of open access rate measurement (3/3)

Numerous dimensions analyzed in the FOSM: discplines, publication type, languages, distribution platform ... FOSM scientific fields

Information-sharing services for a variety of uses

New for 2021: new website

bso web 1

New for 2021: analysis by observation date

bso web 2

New for 2021: health section

bso web 3

Next milestones

  • 🗄️ Integration of indicators about ORCID integration in France
  • 🧑‍🎓 Thesis
  • 📍 New functions for local variations of the FOSM (funders, HAL identifiers)

The Open outputs

What's new since 2018 ?

❔ Questions ?
📨 anne.lhote@recherche.gouv.fr
📨 bso@recherche.gouv.fr
🦣 https://mas.to/@annelhote