ResearchFlow Dataset - EKAW 2020
datasetposted on 14.08.2020 by Francesco Osborne, Angelo Salatino, Enrico Motta
Datasets usually provide raw data for analysis. This raw data often comes in spreadsheet form, but can be any collection of data, on which analysis can be performed.
This is the dataset associated with the paper:
Salatino et al. (2020) ResearchFlow: Understanding the Knowledge Flow between Academia and Industry. EKAW 2020.
The dataset contains two folders.
1- 'ResearchFlow_Dataset' contains the data generated from AIDA which describes the diachronic behaviour of 5K topics across 29 years (1990-2018). It includes multiple json files. Each file represents a topic and contains a json dictionary with four main keys: ‘papers-education’, ‘papers-company’, ‘patents-education’, ‘patents-company’. Each key is then associated to a list of 29 values corresponding to number of documents from 1990 to 2018.
2- 'ResearchFlow_Evaluation' contains the data produced for the evaluation of the paper.