Data analytics is the process of transforming a raw dataset into useful knowledge. By drawing on new advances in artificial intelligence and machine learning, this project is aiming to develop systems that help to automate each stage of the data analytics process.
Explaining the science
Data analytics comprises many different stages and phases. While some elements of the data analytics process have benefited from considerable development through software or tools, there has been little methodological research into so-called data ‘wrangling’, even though this is often laborious and time-consuming, and accounts for up to 80% of a typical data science project.
Data wrangling includes understanding what data is available, integrating data from multiple sources, identifying missing, messy or anomalous data, and extracting features in order to prepare data for computer modelling.
Drawing on new advances in artificial intelligence and machine learning to produce technology that will help automate each stage of the data analytics process. This technology will revolutionise the speed and efficiency with which data can be transformed into useful knowledge.
The project has the potential to dramatically improve the productivity of working data scientists and benefit researchers, industry, and government.