Turing Data Stories

An open community creating and curating data stories

Project status

Ongoing

Introduction

A Turing Data Story is an interactive mix of narrative, code, and visuals that derives insight from real world open data. They are written as pedagogic Jupyter notebooks that aim to spark curiosity and motivate more people to play with data.

Our project lives in Github in this repo.

The stories are published online and you can check them out here.
 

Project aims

Our aim is to help people understand the data driven world around us. We want to inspire an open community around a central platform. One that encourages people to harness the potential of open data by creating 'data stories'.

These 'data stories' mix computer code, narrative, visuals and real world data to document an insightful result. They should be educational and relate to society in a way that people care about. They must maintain a high standard of openness and reproducibility and be approved by the community in a peer review process. The stories will develop data literacy and critical thinking in the general readership.

Recent updates

  

Esport predictions: Overwatch League

We study a single season of games in the professional esports league for Overwatch and try to predict game outcomes based on past performance. Published Oct 11, 2023

Unveiling London’s mobility patterns with Boris Bikes

Using Transport for London’s data on the Santander Bike Scheme to understand how Londoners move around the city. Published Jul 7, 2023.

Machine Learning for the 20th century - Artifact Classification From an Aviation Mystery

Can a United Kingdom database from 1987 classify a glass cosmetics jar that might have belonged to Amelia Earhart? Published Oct 14, 2022.

Desert Island Discs - Famous people and their musical tastes

As the iconic BBC radio programme turns 80, we explore notable people and the music that tells the stories of their lives. Published Mar 9, 2022.

Modelling COVID-19 Hospitalizations in England

Investigating the link between infections and hospitalizations using COVID-19 data in England for 2020-2021 and Bayesian inference. Published Nov 3, 2021.

You can find all other stories here

How to get involved

This project is ongoing and everyone is encouraged to help us build something that will be useful to many. There are several ways you can help:

Story ideas: Have an idea for an interesting story that could be told if you had the data, or knew how to analyse it? We can help.

Data: Stumbled across an interesting dataset, or perhaps mashed together several sources of data yourself? We want to hear about it.

Code: Are you an expert in Bayesian analysis? Do you have good data visualisation skills? Put that knowledge to work!

Peer review: Know a bit about data analysis? Good at communicating that knowledge? Interested in learning about how it can be applied to understanding society? We need reviews to make sure our stories are the best they can be.

Communication: Are you an amazing writer? Help us tell better stories.

Community: Don't fit in any of the above categories, but still want to hang out and be involved? We've got you, drop us a line.

Organisers

Researchers and collaborators

Dr Eirini Zormpa

Research Community Manager, AIM RSF Open Collaboration | Tools, Practices and Systems

Contact info

Camila Rangel Smith
[email protected]