EMIP: The Eye Movements in Programming Dataset
Bednarik, R.1, Busjahn, T.2, Gibaldi, A.3, Ahadi, A.4, Bielikova, M., Crosby, M.5, Essig, K.6, Fagerholm, F.7, Jbara, A.8, Lister, R.4, Orlov, P.9, Paterson, J.10, Sharif, B.11, Sirkia, T.7, Stelovsky, J.5, Tvarozek, J.12, Vrzakova, H.13, van der Linde, I.14
1 University of Eastern Finland, Finland
2 HTW Berlin, Germany
3 University of Genova, Italy
4 University of Technology Sydney, Australia
5 University of Hawai‘i at Manoa, USA
6 Rhine-Waal University of Applied Sciences, Germany
7 University of Helsinki, Finland
8 Augusta University, GA, USA
9 Imperial College London, UK
10 Glasgow Caledonian University, UK
11 University of Nebraska, Lincoln, USA
12 Slovak University of Technology in Bratislava, Slovakia
13 University of Colorado, Boulder, USA
14 Anglia Ruskin University, Cambridge, UK
A large dataset that contains the eye movements of N=216 programmers of different experience levels captured during two code comprehension tasks is presented. Data are grouped in terms of programming expertise (from none to high) and other demographic descriptors. Data were collected through an international collaborative effort that involved eleven research teams across eight countries on four continents. The same eye tracking apparatus and software was used for the data collection. The Eye Movements in Programming (EMIP) dataset is freely available for download. The varied metadata in the EMIP dataset provides fertile ground for the analysis of gaze behavior and may be used to make novel insights about code comprehension
Cite: Bednarik, R., Busjahn, T., Gibaldi, A., Ahadi, A., Bielikova, M., Crosby, M., … & van der Linde, I. (2020). EMIP: The eye movements in programming dataset. Science of Computer Programming, 198, 102520. https://doi.org/10.1016/j.scico.2020.102520