Science is built on transparency, and reproducibility. Yet for much of its history, data and methods were locked away in lab notebooks or hidden behind paywalls. Open science asks: what if we did things differently? What if we shared data, code, and results as widely as possible?
In the last decade, open science has grown from a niche idea to a global movement. This lecture explores why open science matters and how it’s reshaping the way we do research.
This rapid sharing of data enabled the fastest vaccine development in history. Without open science, the timeline could have been years longer.
Open science is an umbrella term covering practices that make research more accessible and reusable:

These practices align with the FAIR principles: Findable, Accessible, Interoperable, Reusable.
Open science benefits different stakeholders in different ways:
1. The scientific community:
Reduces duplication of effort, increases reproducibility, fosters new collaborations.
Example: The Human Genome Project (1990–2003) released human DNA sequences freely online. Helped launch new industries in genomics and biotechnology, as researchers everywhere could access.
2. Society at large:
Public access to publicly-funded research, removes ‘paywalls’, builds public trust.
Example: West African Ebola outbreak (2014–2016), rapid sharing of viral genomes facilitated epidemiology and health authorities respond quickly.
3. You as a researcher:
Increases visibility and impact, establish priority on findings, demonstrates credibility.
Example: The R package ggplot2 (Hadley Wickham, 2007) is now used by tens of thousands of papers.
Case in point: In 2020, a company called Surgisphere claimed to hold vast COVID‑19 clinical datasets. Based on their private data, major studies on COVID treatments were published in The Lancet and NEJM. When no one could independently verify the data, the studies were retracted. The scandal undermined public trust at a critical moment. Open principles (data sharing, transparent review) could have flagged problems earlier.
Good open science begins with a plan.
Before collecting any data, researchers can preregister their ideas — writing down what questions they will ask and how they will test them.
This helps show which results were predicted in advance, and which were discovered along the way.
It’s also important to make a data plan: decide how data will be collected, stored safely, and shared later.
Planning early makes it easier to keep the project organised and trustworthy.
The data lifecycle
While the research is happening, the key is good record-keeping and transparency.
Scientists keep lab books (on paper or online) so that every step can be traced.
They often use open tools, like R or Python, so that others can see exactly how results were analysed.
By carefully recording methods and sharing clear protocols, researchers make it possible for others to repeat and check their work — the heart of reproducibility.
When the research is finished, open science means making the outputs available to everyone.
Data can be uploaded to public repositories (like Zenodo or Dryad).
Code can be shared on platforms like GitHub.
And results can be posted as preprints or in open-access journals, so anyone can read them without paywalls.
This final step ensures that the knowledge created is not locked away, but can be built on by other scientists—and by society.
As students who will be doing research projects, it’s also important that your supervisors have your data in a usable format, as they might use your project as the basis for a grant application or as part of publication—which benefits you, too.
Imagine a scientific world where:
This is the vision of open science. It requires a cultural shift toward collaboration, equity, and accelerated discovery.