Skip to Main Content
HSRC

Work with data

Research data as an intellectual output

Data sets are collections of records or measurements used by researchers to undertake their research or provide an evidential record of their research (Based on http://www.beagrie.com/KRDS2_selectioncriteria.pdf)

What is data? What is it not?

Raw / microdata vs aggregated / summarised / macrodata

Microdata is data in which every record is at the unit of analysis level and all records must be added up to get the totals for each data item.

Aggregated data is a summary format of the the raw data that was gathered and expressed for statistical analysis.

Although data is an intellectual output of research, it is not the same as a research output (journal articles, books, client reports, etc.)    
Primary data (created for the first time and there is no previous source available, did not exist before) vs secondary data (readily available, previously collected data)

 What are its attributes?

  • Digital encoded
  • Heterogeneous
  • Contextual

A data set can have many lives

A data set can have different versions and live in various places

A data set has a unique identity and must be cited

A data set must be managed throughout the research process

The life of a data set

Guidelines to best practice