Health Research Analysis Data

The UK Health Research Analyses Datasets

Executive Summary

The UKCRC promotes the use of the data collected for the UK Health Research Analyses. The processes by which data is collected and how the data is published are detailed in the 'Data Collection' sections for each report. The datasets used in the analyses are also made freely available and can be found at the bottom of this page.

Please note that we ask any re-use of the data to be fully acknowledged with formal citations in any publications that use this information. This helps us to better understand the value of these data to the research community and ensure that the UK Health Research Analyses continue to provide supportive information for research in the future


UK Health Research Analyses Datasets

The 2004/05 HRCS dataset (CSV format)

The 2009/10 HRCS dataset (CSV format)

Both 04/05 and 09/10 datasets (Excel spreadsheet)

The 2014 HRCS public dataset (Excel spreadsheet)


Use of the HRCS 2014 dataset

Principles of data use

The HRCS Report 2014 and associated datasets will use the principles defined in the 2009/10 report regarding data ownership and management. These are:

  • Data collected in the course of this work is owned by the organisations funding the research and are held in confidence by the MRC.
  • Further details of individual awards will not be circulated or published without agreement from the participating organisations.
  • Datasets are made available online to facilitate further independent analysis.
  • Any further analyses must be fully acknowledged and cited to the UKCRC.
  • Any further analyses abide by the HRCS conditions of use.

Acknowledging use of this data

As with all previous HRCS reports, these datasets are made available on the condition that the UKCRC is acknowledged in any publication using the citation “UK Health Research Analysis 2014 (UK Clinical Research Collaboration, 2015) ISBN 978-0-903730-20-4 http://www.hrcsonline.net/pages/uk-health-research-analysis-2014”.

The UK charity and public sector funding organisations that provided this data are interested in any work which makes use of the information. This feedback will be helpful in making the case to continue to compile data. You can let the UKCRC know that you are using the data by email via info@hrcsonline.net.

Dataset Criteria

There are a number of elements of funding that are essential to support research activity. We have made a distinction between grants focused on directly supporting specific research programmes and projects and funding that support more ‘indirect’ aspects such as infrastructure (which may include administration, building maintenance or support for national facilities). The 2014 analysis therefore focused on the directly funded peer reviewed UK research of the participating funders. The inclusion criteria for expenditure data match those of previous analyses:

  • Research is funded by a participating organisation
  • Research must take place within the UK
  • Research is of a health or biomedical relevance
  • The award must be active in the calendar year 2014
  • Research where funding can be directly attributed to a set of clearly defined research objectives and therefore can be classified by type of research activity and area of health or disease i.e. directly funded research, training awards and projects, plus clearly defined programme and unit awards.

Full details of the criteria for inclusion/exclusion, definitions of direct/indirect support can be found in the main 2014 report, pages 18-19.

Data submitted for the 2014 analysis was provided by 64 funding organisations, each with their own criteria for inclusion/exclusion and specific caveats relevant to each organisation. For full details of this process, including a list of participating organisations, please see the main 2014 report (pages 16-17 and Appendix 1, pages 64-71) and supplementary information #1.

Dataset Caveats

"This dataset contains the UKCRC Health Research Analysis 2014 data in two formats; single line and pivot-friendly multiple line.

The former is the original format for submission, with a single line for each individual award. The pivot-friendly version has been generated using the MS Access conversion software developed for the UKCRC report series, which splits awards according to the number of HRCS codes attributed to it.

Explanatory text for each data column for both datasets is provided on the next tab. Further details of this process can be found in the 2014 report and on the HRCS website (see links below)"

Data collection processes and the subsequent cleaning, de-duplication and quality control procedures are explained in detail in the main 2014 report (page 26 and Appendix 10, pages 96-99).

The public dataset contains the same direct award data used in the main analysis. However to allow full publication certain amendments and redactions have been required, primarily where sensitive or privileged information has been edited/removed. These have been applied as minimally as possible, but some grant codes, award titles and award abstracts have been changed or removed.

Links

UKCRC Health Research Analysis 2014

UK Health Research Analysis 2014 - Supplementary Information #1

Any questions about this data, its re-use and publication should be directed to:

info@hrcsonline.net


Use of the HRCS 2004/05 and 2009/10 datasets

The data from both 2004/05 and 2009/10 analyses are freely available for public use, and the UKCRC encourages further analysis of the HRCS datasets. If you use this data, please ensure that you reference the UKCRC appropriately see 'Data Collection Guidelines' sections for further details.

The 04/05 and 09/10 datasets contain information in the following columns:

HRCS ID
a unique identifier for each award assigned by the UKCRC
Funder
the name of the funding organisation
Research Activity Group
top level HRCS research activity code (one of 8 categories)
Research Activity Code
detailed HRCS research activity code (one of 48 sub-categories)
Health category
HRCS health category code (one of 21 categories)
N
proportion of the award categorised (awards can be split across 4-5 categories)
Amount_total
annualised spend allocated to the portion of an award
Institution
organisation administering the award
Town/Region
geographical information relating to the institution administering the award


Overall description of the datasets:

YearNumber of awardsTotal expenditure (£m)
2004/059901965
2009/10114751636

If you require further information on the 2009/10 data, we have a full summary of the inclusion and exclusion criteria available in MSWord document format.