Course on good principles & practices in Data Management (https://tinyurl.com/cruk-myrd)
It has been said that 80% of data analysis is spent on the process of cleaning and preparing the data. Not only does this represent a significant time investment for the data analyst, but is often a hurdle for the non-specialist trying to get to grips with analysing their own data after attending an R or Python course. Despite the best intentions, a spreadsheet that is intuitive and easily-understandable by human eyes can lead to disaster when trying to process computationally.
This workshop will go through the basic principles that we can all adopt in order to work with data more effectively and “think like a computer”. Moreover, we will discuss the best practices for data management and organisation so that our research is auditable and reproducible by ourselves, and others, in the future. Part of the journey will be via critical evaluation of example Data Management Plans (Often a condition of Grant).
As a researcher, you will encounter research data in many forms, ranging from measurements, numbers and images to documents and publications. Whether you create, receive or collect data, you will certainly need to organise it at some stage of your project. This workshop will provide an overview of some basic principles on how we can work with data more effectively. We will discuss the best practices for research data management and organisation so that our research is auditable and reproducible by ourselves, and others, in the future.
Trainers.
Abigail Edwards (CRUK Cambridge Institute).
Ashley Sawle (CRUK Cambridge Institute).
Timetable | |
---|---|
10:00 - 10:20 | Introduction, Data Management Plans (Ash) |
10:20 - 11:00 | Data formatting (Ash) |
11:00 - 11:10 | Break |
11:10 - 12:00 | OpenRefine practical (Abbi) |
12:00 - 12:15 | Spreadsheet validation practical (Abbi) |
12:15 - 13:00 | File management (Ash) |
13:00 - 13:45 | Lunch break |
13:45 - 14:15 | File management in DMP practical (Ash) |
14:15 - 15:00 | Data Sharing & Backup (Abbi) |
15:00 - 15:10 | Break |
15:10 - 15:45 | Data Sharing & Backup in DMP practical (Abbi) |
15:45 - 16:00 | Wrap-up & close |
Please fill in the feedback survey at end of course link
Drosophila BBSRC project.
Signalling pathways MRC project.
Bioinformatics software BBSRC project.
Pathways to violence & crime ESRC project.
scRNAseq analysis of neurons.
Useful checklist: A Data management plan checklist.