Data cleaning

Target audience
All employees and students at UiT who work on projects that include working with research data, and who have a particular responsibility for the continuous management of data files.

45 minutes presentation, including time for questions and comments.

Course calendar and sign-up
See event calendar at the UiT Research Data Portal.

Course material
Powerpoint presentation updated April 2022.

About the course

Data cleaning, or data preparation is an essential and time-consuming part of statistical analysis. Before you can analyse your data you must make sure your spreadsheets are well organised with valid entries and manageable variables. This lecture will cover the best practices for organising spreadsheets and the basics of cleaning data to make the data “tidy”. Tidy data will dramatically speed downstream data analysis tasks, make the data machine-readable, and contribute to make the research reproducible.

We will evaluate and discuss example data with typical mistakes and demonstrate best practices. There will be time for questions and discussion.

The webinar builds on the module “How to structure and document research data”.