Towards Gaze-contingent Head-mounted Displays

Authors: Niklas Hypki

In 1968, Ivan Sutherland and his students developed the first head-mounted display (HMD). This new see-through device made it possible to overlay rudimentary virtual objects such as triangles or cubes on the surrounding environment of the user (Sutherland 1968). The digital objects could change in perspective according to the user’s head movements, which were tracked using a heavy mechanical system. It was the first prototype of an augmented reality display. Half a century later, commercially available virtual reality (VR) headsets allow users to project very detailed virtual objects and HMDs have become everyday devices for exploring virtual environments (VEs).

The key factors that have made this rapid development possible have been the astonishing technological advances in displays, sensors, motion detection algorithms, computer hardware and batteries, as well as an increasingly better understanding of our visual perception. By continuously adapting technological innovations to newly discovered perceptual principles, VEs can now be explored intuitively, using natural gestures and movements.

Today, we may have even reached a point where comparing the virtual with the real world can not only help us to create more realistic virtual experiences, but also expand our knowledge of how we perceive and behave in our environment in general.

The first part of this thesis deals with the evaluation of the basic requirements that must be met in order for HMDs to be used for the presentation of stimuli in scientific experiments in psychophysics. The introduction first presents the basic functions and technologies of current HMDs. It then provides a general overview of the anatomy of the eye and the different types of eye movements we use to perceive the world around us, which also shape our perception in VR. To accurately measure the effect of stimuli in psychophysical experiments, it is also important to know exactly which eye movements are performed at what point during the presentation. Therefore, a brief introduction to the various eye tracking methods available follows. Particular emphasis was put on the latency requirements for the presentation of gaze-dependent stimuli, which are used in many paradigms of perception research. This is followed by study [I], in which we measured the latency of eye tracking in HMDs to assess whether gaze-contingent paradigms can be presented in VR. Next, an overview of how to determine three-dimensional fixation points and classify eye movements in VEs is provided.

In the following chapters, eye movements during various activities in VR are analysed. This initially serves to expand our knowledge of human perception, which could subsequently be used to improve VR applications. First, a scientific overview of typical eye movements during actions is provided. Here, particular emphasis was put on eye movements that adapt in order to solve specific tasks, as well as eye movements during locomotion. Next is study [II], in which we recorded gaze, movement and orientation data in three different typical locomotion tasks. We then used this data to predict future waypoints. Here, we were particularly interested in whether eye tracking data is capable of improving prediction accuracy.

The following chapter deals with the extent to which our eye and head movements are guided by visual stimuli. It provides an introduction to the basic principles of visual search, the visual pathway and the visual cortex, and then explains how the shift of attention is possible at the neural level in the brain. In study [III], we recorded and analysed eye and head movements during visual search tasks in VR.

The final chapter concludes the thesis with a general discussion of our findings. It presents different approaches for extending our results and addresses the question of the extent to which our eye movements are typically controlled by bottom-up and top-down processes. Then, we discuss how the current limitations of VR as a method in psychophysics can be addressed in research. Finally, the main conclusions of the thesis are drawn.

Head-mounted Displays

To present virtual objects, an HMD obviously needs some sort of display. To achieve stereo vision, two separate two-dimensional projections of a simulated object or scene are usually displayed. All objects in both images are slightly offset based on the position of the eyes (Hibbard, Dam, and Scarfe 2020). The brain can then extract three-dimensional information from this offset, just as it does from the retinal projections of the real world (Scarfe and Glennerster 2019; Wheatstone 1838). The first HMD prototypes were equipped with two CRTs as screens that presented two different images, one for each eye.

Today, two types of screens are currently available in HMDs: LCDs and OLED displays. Compared to their CRT predecessors, both technologies enable lighter and thinner VR headsets with screens that require less power and therefore also generate less heat. In both types of screen, the image is generated using RGB sub-pixels. In LCDs, these consist of a backlight source that emits light, which is then polarised and passed through colour filters and a liquid crystal matrix (Chen et al. 2018). Depending on the strength of an electric current applied to the liquid crystal layer, the crystals align themselves and become less or more translucent. As a result, sub-pixels become darker or lighter. Due to viscosity, it can take up to 10 ms for an LCD to change from black to white. In early LCDs, the backlight light source was a cold cathode fluorescent lamp. In today’s displays, a LED matrix is typically used, which consumes less power and enables splitting up the display surface in multiple dimming zones. As a result, black areas of the image appear darker and the overall contrast of the image is increased.

OLED displays have self-illuminating RGB subpixels, which consist of an LED matrix with self-emitting diodes and no backlight. This makes it possible to design even thinner and lighter displays, in which individual pixels are simply switched off to display black. It also lowers the overall power consumption and makes black to white response times of well under 1 ms possible. Yet current OLED displays have problems with brightness and stability (Liu et al. 2020): With continuous use, self-emitting LEDs become less efficient and therefore reduce their light output. This can lead to burnt-in images, especially if monochrome red, green, or blue stimuli are displayed at the same position using the same pixels over a long period of time.

To delay LED-wear, the maximum brightness in OLED displays is often reduced below the maximum brightness of the diodes by using PWM. During active PWM, the individual LEDs constantly switch between on- and off-states (Baker 2022). The longer each on-state (duty cycle) lasts, the higher the image brightness. For instance, a 90% duty cycle creates a bright image while a 10% duty cycle creates a dark image.

If the switching occurs at a frequency above the critical flicker fusion frequency of approximately 50 to 90 Hz, we generally do not perceive the image as flickering (Mankowska et al. 2021). Up to 500 Hz, however, we can still perceive flicker (with slight variations for different colours) when we are asked to detect it on modern displays (J. Davis, Hsieh, and Lee 2015). When PWM was first introduced in smartphone displays, frequencies of around 240 Hz were used, which some users found very unpleasant. Some can even get headaches from it, which might be related to migraineurs having slightly lower critical flicker frequencies (Kowacs et al. 2005, 2004).

Interestingly, the off-states, which are also sometimes described as artificially added black frames, have another useful property for displaying moving images: when we follow a moving rigid object with our eyes, we follow it with a continuous eye movement. By briefly inserting a black frame between two images, each stable image is visible for a shorter amount of time. Similar to the effect of a shorter exposure time in a camera, the perceived movement on the screen with black frames inserted appears sharper. This technique was also used in film projectors, which often used shutters with three blades at 24 frames per second to increase the flicker frequency to 72 Hz and achieve the same sharpening effect (Hammock 2015).

Unlike film projectors, LCDs and OLED displays do not automatically produce off-states between individual frames. Instead, the rendered content follows a ‘sample-and-hold’ characteristic (Marsh 2017). This means that a moving virtual object jumps forward frame by frame on a screen. Between jumps, the image remains static. This sample-and-hold image representation does not reflect the continuous movement in the real world. Each frame of the rendered content is only valid for a brief moment. In reality, at any subsequent point in time, the objects in the scene have moved to different locations. Sample-and-hold rendering presents another problem: When our eyes move across a static frame with a continuous eye movement, the content is perceived as blurry. Especially in VR, shifts of the whole FOV caused by head movements would then result in significant motion blur. Therefore, inserting artificial black images instead of presenting one frame until the next update has become a frequently used technique in HMDs. In this context, the length of each individual on-state of a frame is also referred to as the persistence of the display. In LCDs, synchronised backlight flashing is used to insert black frames. In OLED displays, all LEDs can be turned on and off directly to achieve the same effect.

Lenses

To distribute the image to the eyes of the user, a lens collects the light from the display and maps it onto the FOV (Bang et al. 2021). For this purpose, three different types of lens designs are currently used in HMDs: Fresnel lenses, aspheric lenses and pancake lenses (see Figure 1). All three VR lenses have positive optical power and increase the angular size. Therefore, they make the close display look large and spread the image across the FOV.

Fresnel lenses consist of many prism-like, ring-shaped sections. With each section and from the inside to the outside, the angle at which the emerging light is diffracted increases (A. Davis and Kühnlenz 2007). Fresnel lenses can be made of plastic and are therefore comparatively light. However, with the designs currently in use, the edges of each lens section are clearly visible. In addition, a sharp image can only be seen when the user’s eyes are directly opposite the flat surface in the centre of the lens. This makes the area through which an image can be perceived sharply (sweet spot) comparably small. Moreover, light, especially from outer parts of the lens, can stray into other, darker areas of the image (Haitao et al. 2022). Sometimes the resulting visual effect is referred to as a ghost image. Additionally, Fresnel lenses introduce distortion to the transmitted image, accompanied by chromatic aberrations, often resulting in blurring and colourful aberrations on high-contrast edges. To present an undistorted image, the displayed content is preprocessed by software (Anthes et al. 2016; Nikonorov et al. 2015).

Aspherical lenses have a continuously ground surface and can be made of plastic or glass, giving the transmitted image no visible internal edges. Moreover, aspheric lenses have a larger sweet spot. On the one hand, this means the image does not become blurry when the HMD slips slightly on the user’s head. On the other hand, distortions introduced by the lens still vary depending on the exact pupil position. Thus, such slippage and even small positional movements caused by an eye movement can lead to local distortions. Sometimes these effects, also known as the pupil swim effect, cause people to feel uncomfortable. To compensate them with software, very accurate and low latency eye tracking is needed (Rui et al. 2023).

In pancake lenses, the incoming light is reflected back and forth once within the lens using a half-mirror effect. As the optical path is therefore longer than the physical distance, the lens can be positioned closer to the display, which enables overall smaller HMDs (Bang et al. 2021). A side effect of the half-mirror, however, is that a part of the incoming light is reflected towards the display, resulting in a darker outgoing image. Although this can be compensated for by increasing the brightness of the screen, this in turn has other side effects, such as greater wear and tear on light sources of the displays as well as higher power consumption and greater heat generation. Like aspherical lenses, pancake lenses have a large sweet spot and can also cause pupil swim effects (Jia et al. 2023). Stray light caused by multiple surface reflections and imperfect polarisation control within the lens can also create ghost images (Luo et al. 2024).

Movement Tracking in Virtual Reality

To enable a VE that can not only be displayed but also be explored using movements such as head turning and walking, a VR headset needs a range of sensors that measure the user’s behaviour and transfers it into the virtual space. Modern HMDs are equipped with an IMU that usually includes an accelerometer, a gyroscope and a magnetometer to accurately track head rotation (Anthes et al. 2016).

The position can be tracked using external devices or cameras within the HMD, which is also known as inside-out tracking (Niehorster, Li, and Lappe 2017; Anthes et al. 2016). The latter is typically based on visual information from multiple front cameras that are on the outside of the headset (see Figure 2) to gather data on the current surroundings and can deliver even more accurate position tracking than common external tracking methods based on infrared lighthouses (Holzwarth et al. 2021). Both types of systems can also be used to track devices, such as controllers or body trackers that can be attached to joints such as knees or feet (Anthes et al. 2016). Using a multi-stage process to estimate hand pose and finger angles, inside-out tracking systems can also be used to track the hands without additional devices in about 45 ms (Abdlkarim et al. 2024). This allows users to interact with virtual objects, for example, by pointing and grasping to select and single out targets. Moreover, the camera-based tracking can be complemented with a depth sensor, to achieve a more detailed map of the surrounding objects.

Components of an HMD. Cameras for see-through applications (red) and microphones (pink) are often mounted on the front of the enclosure (right). In addition, there are cameras on the bottom for hand and controller tracking (blue). Some models also have infrared LEDs for external tracking (green). Inside the headset are the main circuit board, an IMU sensor for tracking head orientation (grey), a display and two lenses. Depending on the design, the display may have a backlight (as shown here) or not. A light sensor (purple) is often attached to the optical module to determine whether the HMD is currently in use. In addition, many models have two eye tracking cameras (orange) and infrared light-emitting diodes (brown) distributed around the lens, which emit infrared light towards the eyes which cause reflections necessary for eye tracking. The head strap at the back of the HMD (left) often contains headphones (yellow) and batteries (cyan) and allows the display to be worn securely on the head.

On top of a depth sensor, additional sensors aimed at the user’s face, can be used to track subtle facial expressions, and together with live audio recordings from microphones, this information can be used to create ever more realistic virtual avatars that enable immersive conversations with other users. In addition, computer-generated three-dimensional avatars are now being developed that can even interact directly with the user via speech (Virk 2025).

In summary, today’s HMDs enable full body tracking while also being small and light. In addition to improved displays and lenses, all necessary sensors, computing and graphics processors can now be integrated into a single head-mounted device. As a result, VEs can now be rendered directly on the wearable device, enabling the exploration of VEs without restrictions.

Eye Movements

After an image is displayed on an HMD, it is passed through the lens and reaches the user. When the eyelids are open, light enters through the cornea, pupil and the lens of the eye (see Figure 3). The perceived brightness of the image is not only influenced by the display brightness alone: The diameter of the pupil also plays a role. When looking at bright light or increasingly brighter areas, the iris contracts, causing the pupil to narrow so that less light may enter the eye (Ferree, Rand, and Harris 1933). In dim light, the pupil dilates to increase the incoming photons. Therefore, to a certain extent, our eyes adapt to the given input (which also means that a dark image can be perceived as brighter after an adaptation period).

To see objects at varying distances sharply, the cornea and lens focus the incoming light beam directly onto the retina through contraction of the ciliary eye muscles. With increased age, the eye lens becomes less flexible (presbyopia), which makes it impossible to focus on near objects (Mordi and Ciuffreda 1998). In case of myopia, distant objects always appear blurred because the light rays reflected from these objects are focused in front of the retina, even when the ciliary muscles are relaxed (Baird et al. 2020).

Anatomy of the eye. From right to left, light can travel through the tear fluid and the cornea into the anterior chamber of the eye. Through the pupil, whose size can be adjusted using the intraocular muscles, the light reaches the lens, which can change its focus using the ciliary muscles and the zonule. Ideally, it focuses the incoming light precisely on the retina at the back of the vitreous body. Light from the centre of our FOV falls on the fovea, which has the highest density of cone cells. The optic nerve, which transmits visual information to the brain, is located at the blind spot. This is why we do not perceive light that falls on this point. Extraocular muscles enable us to move our eyes, while the eyelid enables us to close them.

When the incoming photons reach the retina, around 92 million rod- and approximately 4.6 million cone photoreceptors convert the incident light into action potentials, which are then propagated to the entire visual system (Curcio et al. 1990). Human eyes have three types of cones and one type of rod: the different cone photoreceptors each react optimally to certain wavelengths of light (red, blue and green), whereas the rods are not able to distinguish colours, but are much more sensitive to light and are therefore more suitable for night vision (Prasad and Galetta 2011).

The density of the cells varies across the FOV: The area with the highest density of cone cells (15,000 cells/degree²), and thus the highest possible visual acuity of 150 cycles per degree, is the fovea centralis in the centre of the visual field (Tuten and Harmening 2021; Miller et al. 1996). This means that we can see up to 150 black and white lines spread over 1° of visual angle in the centre of our FOV. Therefore, a minimum resolution of 300 pixels per degree of visual angle is required to display objects with foveal acuity. Currently available HMDs offer resolutions below 50 pixels per degree. However, this is still sufficient to display many small details. Acuity defined as standard vision (20/20) when using a Snellen chart is 30 cycles per degree, or 60 pixels per degree (Caltrider, Gupta, and Tripathy 2023). Towards the periphery, cone density and visual acuity rapidly decreases (Hirsch and Curcio 1989). Thus, we tend to bring objects of interest to the centre of our FOV by moving our eyes.

Fixating Objects

While we inspect an object with foveal accuracy, our eyes are in a comparatively still fixation state. In general, we fixate for about 90% of the total viewing time (A. T. Duchowski 2007). In free viewing, typical fixations have durations of between 0.1 and 0.4 s (Gajewski et al. 2005). The duration of a fixation can vary greatly, from a few hundredths to a few tenths of a second. In certain real-world situations they can also last several seconds (Benjamin W. Tatler et al. 2011; Hayhoe et al. 2003; Michael F. Land, Mennie, and Rusted 1999). In other experiments with memory tasks, the fixation time depended on the time required to obtain the necessary visual information for the current action (Benjamin W. Tatler et al. 2011; Hayhoe et al. 2003; Michael F. Land, Mennie, and Rusted 1999; Hayhoe, Bensinger, and Ballard 1998; Droll et al. 2005).

When we fixate on an object in the far distance, the horizontal angles of the left and right eye are almost parallel. In a situation in which the fixated object comes closer to us, our eyes turn continuously in the direction of the centre of the FOV. Our eyes therefore turn in opposite directions towards each other simultaneously. This movement is called vergence and is usually accompanied by a lens adjustment of the eye by the ciliary muscles, to keep the now closer object in focus. Both movements automatically happen at the same time through the accommodation-convergence reflex. While the visual offset between both eyes can be simulated by presenting stereo images, the screens of current HMDs are displaying images at a fixed focal length. This means that regardless of how far away a virtual object appears (which determines the convergence of the eyes), our eyes must always adjust to the fixed distance of the screen. As a result, unlike in reality, the entire field of view appears to have the same depth of field. Therefore, in VR the usual relationship between accommodation and convergence, to which we are accustomed, is disrupted. This phenomenon is called VAC.

Fixational Eye Movements

Our eyeballs can rotate on three rotation-axes using six different muscles: the medial and lateral recti for lateral movements along the yaw axis, the superior and inferior recti for upward and downward movements on the pitch axis and the superior and inferior obliques for twist movements along the roll axis (Davson 1990). In addition, the eyes can translate in the eye socket, which happens when closing the eyelid (Davson 1990; Kirchner, Watson, and Lappe 2022).

When examining fixations carefully and with very high spatial and temporal resolution, it becomes clear that the eyes are actually moving on a very small scale during fixations (Alexander and Martinez-Conde 2019). Fixational eye movements contribute to fixation stability and allow us to inspect different small details within an object with maximum foveal accuracy (Poletti, Listorti, and Rucci 2013; Rolfs 2009). These movements are categorised in three different types: tremor, ocular drift and microsaccades. Tremor is the smallest type of movement with amplitudes of 4/1000° and frequencies that normally range between 30 and 100 Hz (Alexander and Martinez-Conde 2019; Ditchburn and Ginsborg 1953; Martinez-Conde, Macknik, and Hubel 2004). Tremors are likely caused by the firing of motor neurons or the balancing process of the antagonistic eye muscles (Alexander and Martinez-Conde 2019; Spauschus et al. 1999; Ezenman, Hallett, and Frecker 1985). Ocular drift moves the eyes minimally and slowly in frequently changing directions (Rucci and Poletti 2015). Ocular drift’s amplitude is usually less than 45 arcmin and its speed is typically around 50 arcmin/s.

Microsaccades are involuntary, short, jagged eye movements that occur once or twice a second during fixation (Martinez-Conde, Otero-Millan, and Macknik 2013; Rolfs 2009). Some definitions of microsaccades include eye movements with an amplitude up to 1° or even 2° (Rolfs 2009). However, the term originally referred to movements with an amplitude of less than 30 arcmin that stabilise an image within the foveola, a tiny area of the retina where cones are the most densely packed (Rucci and Poletti 2015).

Saccades

To move our gaze from one fixation point to the next, we use saccades with much higher amplitudes. These saccades often reach peak speeds of over 300°/s, making them the fastest eye movements we can perform. Our most common eye movement pattern can be described as a saccade and fixation strategy, in which we regularly switch between looking at one object and making a quick, jerky eye movement to another location (Michael F. Land, Mennie, and Rusted 1999).

In general, saccade amplitude can vary widely, depending on the ongoing situation (Michael F. Land and Tatler 2009): When reading, small saccades are sufficient, whereas when preparing a meal, most fixations fall on task-relevant objects. Thus, the amplitudes of the saccades depend on the arrangement of the environment (Schütz, Braun, and Gegenfurtner 2011; Michael F. Land, Mennie, and Rusted 1999; Hayhoe et al. 2003). Even objects at a distance of 50° can normally be reached within one eye movement (Michael F. Land 2009; Benjamin W. Tatler et al. 2011). Up to a certain amplitude range, the peak velocity increases linearly with increasing amplitude, which is also referred to as the saccadic main sequence (Smit, Van Gisbergen, and Cools 1987; Gibaldi and Sabatini 2021; Bahill, Clark, and Stark 1975). Beyond this range, peak velocity saturates.

Saccadic targeting is not perfectly accurate. When looking closely at the saccade landing points of saccades directed at small peripheral objects, it can often be observed that saccades slightly undershoot their target location. Typically, this is then corrected by a second short saccade so that the fovea is then focused precisely on the new fixation object (Benjamin W. Tatler et al. 2011).

Saccadic Suppression

During saccades the retinal image is moving. Surprisingly, however, our visual perception is never interrupted by smeared phases during saccades (as compared to a moving camera with a slow shutter speed). Instead, we only perceive a continuous, clear stream of the objects we fixate. While a saccade takes place, visual input is suppressed (Ditchburn 1955; Wallach and Lewis 1966). Thus, a light flash that is only presented during a saccade cannot be seen (Bridgeman, Hendry, and Stark 1975; Latour 1962; B. L. Zuber and Stark 1966). The pupil also does not react to intrasaccadic brightness changes (B. L. Zuber, Stark, and Lorber 1966). Interestingly, saccadic suppression usually occurs a few tenths of a second before the saccade onset reaches its maximum, right at the start of the saccade and disappears gradually a few tenths of a second after it ends (Volkmann 1986; Diamond, Ross, and Morrone 2000). This time course changes depending on various factors, such as luminance characteristics of the background and the presented stimuli, the current light adaptation of the eye, characteristics of the stimulus, the position of the stimulus on the retina and the amplitude of the saccade (Volkmann 1986; Stevenson et al. 1986).

In voluntary saccades, saccadic suppression is stronger than in reactive saccades (Gremmler and Lappe 2017). Nevertheless, in most situations, we are able to recognise a previously perceived object after a voluntary saccade that has then shifted to a different position in the FOV. This allows us, for example, to select one of many apples on a tree, move our gaze, and immediately recognise the same apple again, even though it is now in a completely different position relative to the retina.

Thus, our mental representation of our environment is not tied to the object’s absolute position in space or to the available visual information during the movement (Irwin 1991). Instead, it seems we build our mental representation on assumptions that are constructed from visual information after the saccade:

For example, we do not notice when an object is displaced during a saccade if the displacement is less than a third of the amplitude of the saccade (Bridgeman, Hendry, and Stark 1975). Interestingly the same effect can be observed when the target is already displaced just before the saccade is initiated.

In other words, if a fitting postsaccadic target is found just after the saccade at a location not too far away from the object’s previous location, we assume that our environment is stationary (Deubel, Schneider, and Bridgeman 1996).

If no suitable target is found in the required spatio-temporal window, we seem to recalibrate the mental representation of our environment based on other information such as extraretinal signals (Deubel, Schneider, and Bridgeman 1996).

This recalibration leads to effects such as perceived spatial compression (Burr et al. 2010) This was the case in an experiment in which participants first saw a bar and a ruler. During a saccade, the bar was moved while the ruler disappeared. After the saccade, the ruler reappeared in the same place. The participants perceived the bar as immobile and assumed that the ruler had been moved after the saccade (Matsumiya and Uchikawa 2003).

In general, the compression effect occurs if visual references are available immediately after, rather than before or during, the saccade (Lappe, Awater, and Krekelberg 2000). It therefore seems to be mainly based on visual information that is available after the saccade.

A side effect of spatial compression can be that the number of elements perceived in the environment is reduced (Morrone, Ross, and Burr 2005; Ross, Morrone, and Burr 1997). This is because their perceived positions overlap due to the compression. Furthermore, the compression effect is not only limited to space, but can also influence time perception: During a saccade, the perception of time intervals can also be compressed. This can shorten the perceived duration by about half and even reverse the perceived temporal sequence of successive stimuli (Morrone, Ross, and Burr 2005; Burr et al. 2010).

Blinks

Another obvious example of suppressed visual perception is blinking. During blinking, the eyelid closes, the eyeballs rotate downwards and slightly translate towards the skull (Kirchner, Watson, and Lappe 2022). Although blinking, like saccades, occurs regularly and is easy to observe from the outside, we do not normally perceive the interruptions to our stream of visual perception. Instead, again it seems as if our visual system skips parts of our input. Suppression of visual input during blinking appears to be associated with an inhibitory signal from the brain triggered by efferent discharge during eyelid closure (Volkmann et al. 1982). After 100-150 ms, the eye usually returns to its previous position and the eyelid opens again (Volkmann 1986). The recovery of the suppression of the visual input during the blink occurs gradually over a period of 100-200 ms after the onset of the blink and is therefore somewhat faster compared to saccades (Volkmann 1986). Interestingly, there does not appear to be a hard-wired physiological mechanism that ensures visual stimuli are suppressed while we keep our eyes closed, as we can still perceive visual stimuli even with our eyelids shut. Instead, suppression seems to be mainly related to the closing of the lid. Although most light is blocked when the eyes are closed, it is thus possible to detect differences in brightness (Bierman, Figueiro, and Rea 2011; Sakai 2023). This works especially well with light sources with long wavelengths and can be tested easily when being outdoors by moving the head towards or away from the sun with the eyes closed.

In general, blinks can be clustered in different subtypes. During reflexive blinking, the eyelid is closed to protect the eye or to moisten it with tears (Evinger 1995). This blink reflex is, for example, triggered when air is blown into the eye or an object is quickly moving towards the face. Typically, we make 10–15 spontaneous blinks per minute, which is more than necessary to just prevent a dry cornea. Thus, we seem to also blink in situations without the need to protect the eyes. Moreover, our blink rate varies with different tasks (Karson et al. 1981; Doughty 2001). A consciously performed closing of one or both eyelids, is called a voluntary blink or wink. While we blink, we can also perform other eye movements. For example, it is possible to perform saccades during a blink (Kirchner et al. 2022; Rottach et al. 1998). Although the kinematics of these saccades are significantly altered, their parallel execution could be useful in reducing the time during which only very limited visual stimuli are available.

Gaze

To look at objects more closely, we move not only our eyes but also our head. In this thesis, combined eye-head movements will be referred to as gaze movements.

Horizontally, eye amplitudes can reach values around 50°, whereas head rotation amplitudes can even reach 175° (J. K. Wu Tiffany C. AND Tsotsos 2025). Thus, gaze shifts are useful to reach objects beyond our FOV (Thier and Ilg 2005; Foulsham 2015; Einhäuser et al. 2007).

In this case a coordinated head movement extends the saccade amplitude. However, a large proportion of eye and head movements are performed in opposite directions (Einhäuser et al. 2007; J. K. Wu Tiffany C. AND Tsotsos 2025).

This is, for example, the case, when we fixate a stationary object while moving our head. Then, vestibular movement signals that can detect head movements are transmitted to the eye movement control circuits (Cullen and Roy 2004).

This signal is the basis for the VOR, through which our head movements are compensated by counter-rotating eye movements with low latency, so that a more stable retinal image is generated (Fetter 2007). However, the image cannot be perfectly stabilised across the entire FOV because the eye position also shifts due to head movement, as the eyes are not positioned in the centre of the head’s axis of rotation (Harris 1994). Therefore, this translational shift must either be ignored or stabilisation must be optimised for a specific object of interest.

To coordinate gaze movements accurately, the neural circuits for head and eye movement are linked closely: A case report on a patient with eye muscle paralysis even showed that when eye movements are not possible, saccade-like head movements are performed instead (Gilchrist, Brown, and Findlay 1997).

During head movements, the speed of eye drift increases threefold, suggesting that drift also helps to compensate for head movements (Rucci and Poletti 2015; Skavenski et al. 1979; Aytekin, Victor, and Rucci 2014). The properties of the microsaccades, however, do not change, so it appears that they capture fine spatial details of objects independently of head movements (Jolly et al. 2023).

Gaze Stabilisation

When we try to fixate objects that are moving through the environment, we move our eyes to compensate for their motion.

Such ocular following responses are evoked with very short latencies by sudden visible motion (Miladinović et al. 2024). The following continuous slow eye movement that tracks a moving target is called smooth pursuit (Haarmeier and Thier 1999; Haarmeier et al. 1997; Drewes, Feder, and Einhäuser 2021). To reach a moving target, our eyes can first adjust their velocity to the objects’ speed and direction, then a saccade can bring the eyes to the target’s position while the smooth pursuit continues (Lisberger 2015; Rashbass 1961).

During pursuit, the eyes can smoothly compensate for speeds of up to 30°/s. At higher speeds, smooth pursuit movements are usually accompanied by catch-up saccades in motion direction.

Interestingly, tracking an object using pursuit eye movements requires a centrally-derived motion percept of the target (Steinbach 1976). This can be based on visual signal of the object’s motion, such as the visible edge of the tracked object. We can also follow acoustic or sensory signals, such as when our eyes follow our own hand movements or a sound in the dark (Hashiba et al. 1996; Berryhill, Chiu, and Hughes 2006; Rashbass 1961; Lisberger 2015). If perception of the object movement fails, we can no longer follow it smoothly with our eyes. Instead, the object is tracked through a series of saccades and smooth pursuit phases in an attempt to follow it, while the target appears to jump in space (Koerfer, Watson, and Lappe 2024).

When we perceive motion anywhere across our FOV, for example when looking out of the window of a train, a similar reflex called optokinetic nystagmus is triggered. Initially, we tend to again initiate a smooth eye movement, whose dynamic is similar to pursuit (Magnusson, Pyykkö, and and 1986). Next, however, our eyes do a fast eye movement in the opposite direction of the visible motion, where the cycle restarts and another smooth movement in motion direction begins (Drewes, Feder, and Einhäuser 2021). As a result, we perceive the object clearly, although everything in our FOV is in motion.

Eye Tracking

Eye movements are an essential part of our visual system. When we observe another person’s eyes, we can easily detect whether a saccade or blink is occurring. Although we intuitively have a rough idea of where the other person is looking, it is difficult to analyse the rapidly changing movements more precisely based on manual observation alone. Consequently, the first attempts at automated, objective measurement of eye movements were made over a century ago and have been supplemented over the decades by numerous further developments and new methods of eye and gaze tracking (Hutton 2019).

Initially, both Huey and Delabarre used a mechanical device consisting of a contact lens with rods attached to it, which marked the ongoing eye movements on a kymograph cylinder (Hutton 2019; Huey 1898; Delabarre 1898). With this method, slow eye movements could be recorded, although the weight of the measurement equipment probably added noticeable strain on the eye muscles and therefore influenced the measured eye movements to some extent. The weight could be reduced further, when Robinson introduced contact lenses with wire coils inside (Robinson 1963). Their movement could be measured using an electromagnetic field surrounding the head of the subject. To this day, this method offers a comparatively high degree of precision, with approximately 5 arc seconds over a limited range of approximately 5° (Young and Sheena 1975a).

Although wearing comfort has improved significantly and today’s lenses no longer require anaesthetic drugs such as cocaine to be applied to the cornea before insertion, wearing a comparatively thick lens can still be unpleasant for the subject and insertion requires practice (A. T. Duchowski 2007).

A less invasive approach is electrooculography (EOG). This eye tracking method makes use of the standing electric potential between the anterior positive and posterior negative pole of the eye. Using electrodes placed around the eye, one can determine the eyeball orientation with respect to the head position, although it lacks some spatial accuracy, especially on the vertical axis (A. T. Duchowski 2007). This data can be collected at very high sampling rates. Unfortunately, the signal often drifts slightly over a short amount of time since the electric potential of the pigmented epithelium also changes based on the amount of light that enters the eye. Therefore, although accurately measuring the eye direction with EOG requires very frequent recalibration, the method is well suited to detect saccades and blinks with low latency (Bolte and Lappe 2015; Kirchner et al. 2022).

The first predecessor of today’s most common eye tracking method was the photochronograph by Dodge and Cline (1901) (Dodge and Cline 1901; Hutton 2019). Instead of attaching measuring equipment directly on to or close to the eye, this method measured movements remotely. First, a light source was directed at the user’s eyes. Since eyes are not perfectly spherical, the reflection of light on the cornea changes depending on the direction of gaze. The reflected light was then exposed to a film plate that was later analysed. Therefore, this method was only suitable for visualizing the accumulated eye movements over a certain period of time. In revised devices, horizontal and vertical components of the eye reflection could be separated and exposed with higher temporal resolution on photographic tape or film (Judd, McAllister, and Steele 1905; Buswell 1935; Macele and Mueggenburg 2024).

Later, Yarbus (1967) developed contact lenses that were attached to the eye with suction cups and to which either eye-fixed visual stimuli or tiny mirrors were attached, which also made it possible to record eye movements on film (Hutton 2019; Benjamin W. Tatler et al. 2010).

Eventually, Yarbus (1967)’ approach led to a number of different remote eye trackers, all of which used different types of light reflections from the eye to determine the direction of gaze. To allow the subject to still see a stimulus while using such an eye tracker, the remote light source was soon replaced by infrared light. Infrared limbal eye trackers exploit the fact that the white sclera of the eye reflects more light than the iris, using measurements from a high-frequency infrared sensor as a signal for the direction of gaze on the horizontal or vertical axis (Hutton 2019).

On closer inspection, not just one, but four visible reflections of incoming light (also known as Purkinje reflections) can be observed in the eye: these come from the anterior and posterior surfaces of the cornea and the lens of the eye. Dual Purkinje eye trackers measure the disparity between the first reflection from the cornea and the fourth reflection from the eye lens (Cornsweet and Crane 1973; Richardson and Spivey 2008; Hutton 2019). In the first Purkinje eye trackers, this measurement was done mechanically: Adjustable mirrors were aligned in such a way that the two reflections overlapped. This calibration procedure allowed accurate (up to 1 min of arc) and fast eye direction estimations and was therefore suited to measure microsaccades, if the head was stabilised carefully.

As soon as camera equipment enabled real-time video recordings of eye movements, VOG became the most popular method of estimating gaze direction based on light reflections of the eye (Richardson and Spivey 2008; Young and Sheena 1975b; Merchant, Morrissette, and Porterfield 1974). The most common technique in VOG is dark pupil tracking. As in limbal eye trackers, this approach makes use of the fact that different features of the eyes reflect different amounts of light. In this case, since the pupil reflects even less light than the iris, the pupil can be detected as a dark small ellipse in each frame of the recording. This ellipse, can then be compared with the first corneal reflection that is induced using an infrared emitter. The distance between the two can be calibrated to estimate the gaze direction. In some devices, multiple infrared light sources and therefore multiple first order Purkinje reflections are used (see Figure 4). Moreover, the dual Purkinje eye tracker has been further developed with the help of camera technology: Instead of analogue mechanical parts, a digital dual Purkinje eye tracker uses an optical setup (with no moving components) and a digital imaging module (R.-J. Wu et al. 2023). Again, to track eye movements meticulously, the offset of the first and fourth Purkinje reflections are used, now obtained from a digital image.

Video-based eye tracking. With every eye movement, the pupil changes its position slightly (left vs. right). By comparing the tracked pupil (red dotted circle) with the tracked corneal reflection of several infrared emitters (white dots), the pupil position can be mapped to positions throughout the FOV through calibration.

Thanks to technological advances in cameras, processing hardware and batteries, VOG devices can be so lightweight that they can be integrated into wearable devices to track eye movements in natural and virtual environments. However, apart from contact lenses with wire coils, the presented tracking methods only measure the eye movements relative to the head. To track the gaze relative to a person’s surroundings, the orientation and position of the head must also be taken into account. In the case of wearable eye trackers, this can be done using an outward-facing camera, from whose recordings of the FOV the tracked gaze direction can later be visualised. When researching human behaviour in VEs, the data from the position tracking system, the IMU and the eye tracker can be combined to obtain a gaze direction signal. This signal can be both recorded and also used to make the VE react to the user’s eye movements.

Gaze-contingent Displays

Even before Sutherland built his first HMD prototype, he envisioned the capabilities of the ultimate display of the future (Sutherland 1965). Interestingly, despite the limited technology of the 1960s, his vision included a display that was capable of eye tracking. In his essay, Sutherland (1965) laid out an experiment in which the device would display a triangle whose edges would round off as soon as a user fixated them. He hoped that this approach would contribute to a better understanding of the mechanisms of human vision. To implement this idea, the eye tracking signal must be available in near real time so that visual stimuli can be updated in a timely manner. Such so-called GCDs were developed a few years later in the 1970s, initially for reading research (McConkie and Rayner 1975; Rayner 2014; Andrew T. Duchowski, Cournia, and Murphy 2004). However, these GCDs were not attached to the head, but worked with remote eye trackers and screens positioned on desks. Later, the same principle was applied again with the aim of improving system performance when displaying computationally intensive stimuli (Murphy and Duchowski 2002). In foveated rendering, for example, the level of detail or resolution of visual content in the periphery is reduced. It is therefore important that eye movements with a large amplitude in particular are detected quickly so that the image can be updated accordingly, giving users a detailed, high-resolution image in the centre of their FOV (Patney et al. 2016; Andrew T. Duchowski, Cournia, and Murphy 2004; O’Sullivan, Dingliana, and Howlett 2003).

In addition, GCDs met the expectation that they could deepen our understanding of our visual system. They enabled experiments on the time span of the perceptual window, which showed that we process up to 15 letters in reading and about 5° in searching (Bertera and Rayner 2000; McConkie and Rayner 1975). With the help of GCDs, it became possible to measure how vision degrades from the fovea to the periphery. It was found that this can vary depending on image characteristics such as spatial and temporal resolution, colour, luminance and contrast (Andrew T. Duchowski, Cournia, and Murphy 2004; Loschky and Wolverton 2007). Further experiments with desktop-based GCDs showed that contrast sensitivity for natural stimuli is significantly reduced compared to static images (Dorr and Bex 2011), gaze-contingent contrast influences colour perception (Mauderer, Flatla, and Nacenta 2016) and high cognitive load leads to a loss of accuracy evenly distributed over the eccentricities of the retina (Gaspar et al. 2016).

GCDs were also used in retinal visual impairments simulations that mimicked macular degeneration due to ageing (Geisler and Perry 2002; Vinnikov, Allison, and Swierad 2008). With the help of such applications, circumvention strategies of the visually impaired could be better understood and environments suitable for the disabled could be evaluated more easily. In addition, personally experiencing impairment simulations could help to raise awareness and increase empathy among the public. Finally, gaze-contingent HMDs could compensate the pupil swim effect and improve visible cues for the perception of depth in VR (Konrad, Angelopoulos, and Wetzstein 2020; Rui et al. 2023).

An ideal gaze-contingent HMD would have a refresh rate above the fusion flicker perception threshold and an eye tracking latency below the frame to frame interval. This would require very fast and expensive sensors that consume a lot of energy, heat up quickly and cannot be operated for long periods on battery power. However, because we use saccades to move our eyes from one fixation to another and vision is suppressed, during and shortly after each saccade, it is possible to present gaze-contingent, world-fixated stimuli using hardware with higher end-to-end latencies.

The maximum possible latency of the GCD at which the user cannot see the adjustment of the stimuli also depends on the saccadic main sequence. This is because the phase of maximum suppression and the recovery of vision take longer with saccades with higher amplitudes. (Volkmann 1986; Stevenson et al. 1986). Thus, a voluntary saccade with an amplitude of 8° results in suppression times around 40 ms after saccade onset, whereas a voluntary saccade with an amplitude of 16° results in suppression times around 80 ms after onset (Stevenson et al. 1986).

This, however, does not mean that larger saccades are always helpful when setting up a gaze-contingent experiment. For example, in foveated rendering, objects in the periphery are displayed with a lower resolution, which is why a degradation factor must be implemented that adjusts the resolution of a stimulus based on the distance to the centre of the FOV. Since saccades with longer amplitudes land further away from the original fixation, gaze-dependent manipulations such as low resolution may appear more obvious to the user after a saccade, if updated even shortly after the saccadic suppression period, while small saccades may still fall in a higher resolution area where the manipulation is less likely to be detected even if the visual content is updated too late (Loschky and Wolverton 2007). Therefore, the end-to-end latency from an eye movement to the change in display content, the expected saccade amplitude and the gradient of the manipulation must be carefully considered when setting up gaze-contingent manipulations that should not be perceived.

Requirements for Gaze-contingent HMDs

Since some currently available HMDs are equipped with eye trackers, it may be possible to use them as gaze-contingent HMDs. In contrast to desktop-based gaze-dependent setups, these could enable stimuli to be presented at specific positions in the FOV, independent of both eye movements and head movements, which would allow the presentation of gaze-contingent stimuli in VEs.

In order to assess whether this is possible, it is first necessary to estimate the minimum latency requirements for GCDs. When using a remote eye tracker and a desktop screen to create a gaze-contingent setup, no differences in eye movements were observed between an artificial latency of 5 ms and 15 ms (Loschky and McConkie 2000). At an added latency of 45 ms, however, Loschky and McConkie (2000) observed slightly longer fixation durations, although even then, the performance in a search task was not altered.

In a detection task of gaze-contingent peripheral blur, performance did not significantly decrease until a latency of 60 ms was reached (Loschky and Wolverton 2007). Although these observations do not lead to a clear maximum latency for a GCD, they can give an idea of the required latency range which is between 15 and 60 ms.

Displays in currently available HMDs have a frame rate of about 90 Hz (11.1 ms). However, to implement a VR-GCD, not only the frame-to-frame interval, but the overall end-to-end latency from a gaze shift to an image that is displayed on the HMD needs to be considered. In that time frame, multiple processing steps need to be done. First, motions from all available tracking sensors need to be recorded. Then, the tracked signal needs to be used to estimate the current state of, head and eyes with accuracy and precision. Next, the VE needs to be updated based on this estimate. Finally, the updated VE needs to be rendered to a stereo-image that is then displayed on the HMD.

The latency between a change in position and a corresponding change on the display can be determined externally by simultaneously recording the start of the HMD’s position change and the display with a high-speed camera. For head tracking, current HMDs have motion to photon latencies between 21 and 42 ms, which is below the required latency range for GCDs (Warburton et al. 2023; Niehorster, Li, and Lappe 2017).

Because the HMD occludes the eyes, a different method is required to measure the latency of VR eye trackers. We present this method in the following chapter. The development of HMDs has come a long way since 1968. Prototypes have become commercially available products that are used for research. At the same time, by continuously innovating new eye tracking methods, we have increased our knowledge about our visual perception and how it is influenced by the way we move our eyes. In order to investigate the potential use of VR with eye tracking in psychophysical research, for example to present strictly controlled visual stimuli while simultaneously measuring head, eye and walking movements, we must be able to estimate the latency of the tracking sensors. In the following study, we address this need by measuring the time required by currently available HMDs to register an actual saccade, record the corresponding eye tracking data and finally display a change on the screen depending on the recorded data.

References

Abdlkarim, Diar, Massimiliano Di Luca, Poppy Aves, Mohamed Maaroufi, Sang-Hoon Yeo, R. Chris Miall, Peter Holland, and Joeseph M. Galea. 2024. “A Methodological Framework to Assess the Accuracy of Virtual Reality Hand-Tracking Systems: A Case Study with the Meta Quest 2.” Behavior Research Methods 56 (2): 1052–63. https://doi.org/10.3758/s13428-022-02051-8.

Alexander, Robert G., and Susana Martinez-Conde. 2019. “Fixational Eye Movements.” In Eye Movement Research: An Introduction to Its Scientific Foundations and Applications, edited by Christoph Klein and Ulrich Ettinger, 73–115. Cham: Springer International Publishing. https://doi.org/10.1007/978-3-030-20085-5_3.

Anthes, Christoph, Rubén Jesús García-Hernández, Markus Wiedemann, and Dieter Kranzlmüller. 2016. “State of the Art of Virtual Reality Technology.” In 2016 IEEE Aerospace Conference, 1–19. https://doi.org/10.1109/AERO.2016.7500674.

Aytekin, Murat, Jonathan D. Victor, and Michele Rucci. 2014. “The Visual Input to the Retina During Natural Head-Free Fixation.” Journal of Neuroscience 34 (38): 12701–15. https://doi.org/10.1523/JNEUROSCI.0229-14.2014.

Bahill, A. Terry, Michael R. Clark, and Lawrence Stark. 1975. “The Main Sequence, a Tool for Studying Human Eye Movements.” Mathematical Biosciences 24 (3): 191–204. https://doi.org/10.1016/0025-5564(75)90075-9.

Baird, Paul N., Seang-Mei Saw, Carla Lanca, Jeremy A. Guggenheim, Earl L. Smith III, Xiangtian Zhou, Kyoko-Ohno Matsui, et al. 2020. “Myopia.” Nature Reviews Disease Primers 6 (1): 99. https://doi.org/10.1038/s41572-020-00231-4.

Baker, Simon. 2022. “Pulse Width Modulation (PWM).” TFTCentral. https://www.tftcentral.co.uk. https://tftcentral.co.uk/articles/pulse_width_modulation.

Bang, Kiseung, Youngjin Jo, Minseok Chae, and Byoungho Lee. 2021. “Lenslet VR: Thin, Flat and Wide-FOV Virtual Reality Display Using Fresnel Lens and Lenslet Array.” IEEE Transactions on Visualization and Computer Graphics 27 (5): 2545–54. https://doi.org/10.1109/TVCG.2021.3067758.

Berryhill, Marian E., Tanya Chiu, and Howard C. Hughes. 2006. “Smooth Pursuit of Nonvisual Motion.” Journal of Neurophysiology 96 (1): 461–65. https://doi.org/10.1152/jn.00152.2006.

Bertera, James H., and Keith Rayner. 2000. “Eye Movements and the Span of the Effective Stimulus in Visual Search.” Perception & Psychophysics 62 (3): 576–85. https://doi.org/10.3758/BF03212109.

Bierman, Andrew, Mariana G. Figueiro, and Mark S. Rea. 2011. “Measuring and predicting eyelid spectral transmittance.” Journal of Biomedical Optics 16 (6): 067011. https://doi.org/10.1117/1.3593151.

Bolte, B., and M. Lappe. 2015. “Subliminal Reorientation and Reposition in Immersive Virtual Environments Using Saccadic Suppression.” IEEE Transactions on Visualization and Computer Graphics 21 (4): 545–52. https://doi.org/10.1109/TVCG.2015.2391851.

Bridgeman, Bruce, Derek Hendry, and Lawrence Stark. 1975. “Failure to Detect Displacement of the Visual World During Saccadic Eye Movements.” Vision Research 15 (6): 719–22. https://doi.org/10.1016/0042-6989(75)90290-4.

Burr, D. C., J. Ross, P. Binda, and M. C. Morrone. 2010. “Saccades Compress Space, Time and Number.” Trends in Cognitive Sciences 14 (12): 528–33. https://doi.org/10.1016/j.tics.2010.09.005.

Buswell, G. T. 1935. How People Look at Pictures: A Study of the Psychology and Perception in Art. How People Look at Pictures: A Study of the Psychology and Perception in Art. Oxford, England: Univ. Chicago Press.

Caltrider, David, Abhishek Gupta, and Koushik Tripathy. 2023. Evaluation of Visual Acuity. StatPearls Publishing. http://europepmc.org/books/NBK564307.

Chen, Hai-Wei, Jiun-Haw Lee, Bo-Yen Lin, Stanley Chen, and Shin-Tson Wu. 2018. “Liquid Crystal Display and Organic Light-Emitting Diode Display: Present Status and Future Perspectives.” Light: Science & Applications 7 (3): 17168–68. https://doi.org/10.1038/lsa.2017.168.

Cornsweet, T. N., and H. D. Crane. 1973. “Accurate Two-Dimensional Eye Tracker Using First and Fourth Purkinje Images.” J. Opt. Soc. Am. 63 (8): 921–28. https://doi.org/10.1364/JOSA.63.000921.

Cullen, Kathleen E., and Jefferson E. Roy. 2004. “Signal Processing in the Vestibular System During Active Versus Passive Head Movements.” Journal of Neurophysiology 91 (5): 1919–33. https://doi.org/10.1152/jn.00988.2003.

Curcio, Christine A., Kenneth R. Sloan, Robert E. Kalina, and Anita E. Hendrickson. 1990. “Human Photoreceptor Topography.” Journal of Comparative Neurology 292 (4): 497–523. https://doi.org/10.1002/cne.902920402.

Davis, Arthur, and Frank Kühnlenz. 2007. “Optical Design Using Fresnel Lenses.” Optik & Photonik 2 (4): 52–55. https://doi.org/10.1002/opph.201190287.

Davis, James, Yi-Hsuan Hsieh, and Hung-Chi Lee. 2015. “Humans Perceive Flicker Artifacts at 500 Hz.” Scientific Reports 5 (1): 7861. https://doi.org/10.1038/srep07861.

Davson, Hugh. 1990. Physiology of the Eye. Bloomsbury Academic. https://doi.org/10.1007/978-1-349-09997-9.

Delabarre, E. B. 1898. “A Method of Recording Eye-Movements.” The American Journal of Psychology 9 (4): 572–74. http://www.jstor.org/stable/1412191.

Deubel, H., W. X. Schneider, and B. Bridgeman. 1996. “Postsaccadic Target Blanking Prevents Saccadic Suppression of Image Displacement.” Vision Research 36 (7): 985–96.

Diamond, Mark R., John Ross, and M. C. Morrone. 2000. “Extraretinal Control of Saccadic Suppression.” Journal of Neuroscience 20 (9): 3449–55. https://doi.org/10.1523/JNEUROSCI.20-09-03449.2000.

Ditchburn, R. W. 1955. “Eye-Movements in Relation to Retinal Action.” Optica Acta: International Journal of Optics 1 (4): 171–76. https://doi.org/10.1080/713818684.

Ditchburn, R. W., and B. L. Ginsborg. 1953. “Involuntary Eye Movements During Fixation.” The Journal of Physiology 119 (1): 1–17. https://doi.org/10.1113/jphysiol.1953.sp004824.

Dodge, Raymond, and Thomas Sparks Cline. 1901. “The Angle Velocity of Eye Movements.” Psychological Review 8 (2): 145–57. https://doi.org/10.1037/h0076100.

Dorr, Michael, and Peter J. Bex. 2011. “A gaze-contingent display to study contrast sensitivity under natural viewing conditions.” In Human Vision and Electronic Imaging XVI, edited by Bernice E. Rogowitz and Thrasyvoulos N. Pappas, 7865:78650Y. International Society for Optics; Photonics; Spie. https://doi.org/10.1117/12.872502.

Doughty, Michael J. 2001. “Consideration of Three Types of Spontaneous Eyeblink Activity in Normal Humans: During Reading and Video Display Terminal Use, in Primary Gaze, and While in Conversation.” Optometry and Vision Science 78 (10). https://journals.lww.com/optvissci/fulltext/2001/10000/consideration_of_three_types_of_spontaneous.11.aspx.

Drewes, Jan, Sascha Feder, and Wolfgang Einhäuser. 2021. “Gaze During Locomotion in Virtual Reality and the Real World.” Frontiers in Neuroscience 15. https://doi.org/10.3389/fnins.2021.656913.

Droll, Jason A, Mary M Hayhoe, Jochen Triesch, and Brian T Sullivan. 2005. “Task Demands Control Acquisition and Storage of Visual Information.” Journal of Experimental Psychology. Human Perception and Performance 31 (6): 1416—1438. https://doi.org/10.1037/0096-1523.31.6.1416.

Duchowski, A. T. 2007. Eye Tracking Methodology. Springer London. https://doi.org/10.1007/978-1-84628-609-4.

Duchowski, Andrew T., Nathan Cournia, and Hunter Murphy. 2004. “Gaze-Contingent Displays: A Review.” CyberPsychology & Behavior 7 (6): 621–34. https://doi.org/10.1089/cpb.2004.7.621.

Einhäuser, Wolfgang, Frank Schumann, Stanislavs Bardins, Klaus Bartl, Guido Böning, Erich Schneider, and Peter König. 2007. “Human Eye-Head Co-Ordination in Natural Exploration.” Network: Computation in Neural Systems 18 (3): 267–97. https://doi.org/10.1080/09548980701671094.

Evinger, C. 1995. “A Brain Stem Reflex in the Blink of an Eye.” Physiology 10 (4): 147–53. https://doi.org/10.1152/physiologyonline.1995.10.4.147.

Ezenman, M., P. E Hallett, and R. C. Frecker. 1985. “Power Spectra for Ocular Drift and Tremor.” Vision Research 25 (11): 1635–40. https://doi.org/10.1016/0042-6989(85)90134-8.

Ferree, C. E., G. Rand, and E. T. Harris. 1933. “Intensity of Light and Area of Illuminated Field as Interacting Factors in Size of Pupil.” Journal of Experimental Psychology 16 (3): 408. https://doi.org/10.1037/h0072100.

Fetter, Michael. 2007. “Vestibulo-Ocular Reflex.” Developments in Ophthalmology 40: 35—51. https://doi.org/10.1159/000100348.

Foulsham, T. 2015. “Eye Movements and Their Functions in Everyday Tasks.” Eye 29 (2): 196–99. https://doi.org/10.1038/eye.2014.275.

Gajewski, Daniel A., Aaron M. Pearson, Michael L. Mack, Francis N. Bartlett, and John M. Henderson. 2005. “Human Gaze Control in Real World Search.” In Attention and Performance in Computational Vision, edited by Lucas Paletta, John K. Tsotsos, Erich Rome, and Glyn Humphreys, 83–99. Berlin, Heidelberg: Springer Berlin Heidelberg. https://doi.org/10.1007/978-3-540-30572-9_7.

Gaspar, John G., Nathan Ward, Mark B. Neider, James Crowell, Ronald Carbonari, Henry Kaczmarski, Ryan V. Ringer, Aaron P. Johnson, Arthur F. Kramer, and Lester C. Loschky. 2016. “Measuring the Useful Field of View During Simulated Driving with Gaze-Contingent Displays.” Human Factors 58 (4): 630–41. https://doi.org/10.1177/0018720816642092.

Geisler, Wilson S., and Jeffrey S. Perry. 2002. “Real-Time Simulation of Arbitrary Visual Fields.” In Proceedings of the 2002 Symposium on Eye Tracking Research & Applications, 83–87. ETRA ’02. New York, NY, USA: Association for Computing Machinery. https://doi.org/10.1145/507072.507090.

Gibaldi, Agostino, and Silvio P. Sabatini. 2021. “The Saccade Main Sequence Revised: A Fast and Repeatable Tool for Oculomotor Analysis.” Behavior Research Methods 53 (1): 167–87. https://doi.org/10.3758/s13428-020-01388-2.

Gilchrist, Iain D., Valerie Brown, and John M. Findlay. 1997. “Saccades Without Eye Movements.” Nature 390 (6656): 130–31. https://doi.org/10.1038/36478.

Gremmler, Svenja, and Markus Lappe. 2017. “Saccadic Suppression During Voluntary Versus Reactive Saccades.” Journal of Vision 17 (8): 8–8. https://doi.org/10.1167/17.8.8.

Haarmeier, Thomas, and Peter Thier. 1999. “Impaired analysis of moving objects due to deficient smooth pursuit eye movements.” Brain 122 (8): 1495–505. https://doi.org/10.1093/brain/122.8.1495.

Haarmeier, Thomas, Peter Thier, Marc Repnow, and Dirk Petersen. 1997. “False Perception of Motion in a Patient Who Cannot Compensate for Eye Movements.” Nature 389 (6653): 849–52. https://doi.org/10.1038/39872.

Haitao, Huang, Dong Ruijun, Han Na, Bai Jiarong, Wu Yulong, Li Ke, Wang Chenru, Ma Zhanshan, Chen Lili, and Zhang hao. 2022. “Research on Stray Light Affecting the Imaging of Fresnel Lens in Virtual Reality Equipment.” SID Symposium Digest of Technical Papers 53 (1): 827–29. https://doi.org/10.1002/sdtp.15620.

Hammock, Bill. 2015. “How a Film Projector Works.” Youtube. https://www.youtube.com/watch?v=En__V0oEJsU&t=250s.

Harris, Laurence R. 1994. “Visual Motion Caused by Movements of the Eye, Head and Body.” Visual Detection of Motion 1: 397–435. https://www.yorku.ca/harris/pubs/1994visualmotioncausedbymovementsofeyeheadandbody.pdf.

Hashiba, M, T Matsuoka, S Baba, and S Watanabe. 1996. “Non-Visually Induced Smooth Pursuit Eye Movements Using Sinusoidal Target Motion.” Acta Otolaryngologica. Supplementum 525: 158—162. http://europepmc.org/abstract/MED/8908293.

Hayhoe, Mary M., David G. Bensinger, and Dana H. Ballard. 1998. “Task Constraints in Visual Working Memory.” Vision Research 38 (1): 125–37. https://doi.org/10.1016/S0042-6989(97)00116-8.

Hayhoe, Mary M., Anurag Shrivastava, Ryan Mruczek, and Jeff B. Pelz. 2003. “Visual Memory and Motor Planning in a Natural Task.” Journal of Vision 3 (1): 6–6. https://doi.org/10.1167/3.1.6.

Hibbard, Paul B., Loes C. J. van Dam, and Peter Scarfe. 2020. “The Implications of Interpupillary Distance Variability for Virtual Reality.” In 2020 International Conference on 3D Immersion (IC3D), 1–7. https://doi.org/10.1109/IC3D51119.2020.9376369.

Hirsch, Joy, and Christine A. Curcio. 1989. “The Spatial Resolution Capacity of Human Foveal Retina.” Vision Research 29 (9): 1095–1101. https://doi.org/10.1016/0042-6989(89)90058-8.

Holzwarth, Valentin, Joy Gisler, Christian Hirt, and Andreas Kunz. 2021. “Comparing the Accuracy and Precision of SteamVR Tracking 2.0 and Oculus Quest 2 in a Room Scale Setup.” In Proceedings of the 2021 5th International Conference on Virtual and Augmented Reality Simulations, 42–46. ICVARS ’21. New York, NY, USA: Association for Computing Machinery. https://doi.org/10.1145/3463914.3463921.

Huey, Edmund B. 1898. “Preliminary Experiments in the Physiology and Psychology of Reading.” The American Journal of Psychology 9 (4): 575–86. http://www.jstor.org/stable/1412192.

Hutton, S. B. 2019. “Eye Tracking Methodology.” In Eye Movement Research: An Introduction to Its Scientific Foundations and Applications, edited by Christoph Klein and Ulrich Ettinger, 277–308. Cham: Springer International Publishing. https://doi.org/10.1007/978-3-030-20085-5_8.

Irwin, David E. 1991. “Information Integration Across Saccadic Eye Movements.” Cognitive Psychology 23 (3): 420–56. https://doi.org/10.1016/0010-0285(91)90015-G.

Jia, Jerry, Tsz Tai Chan, Trisha Lian, and Kevin W. Rio. 2023. “Local Pupil Swim in Virtual- and Augmented-Reality: Root Cause and Perception Model.” Journal of the Society for Information Display 31 (5): 230–40. https://doi.org/10.1002/jsid.1210.

Jolly, Paul, Yuanhao H. Li, Michele A. Cox, Ashley M. Clark, Bin Yang, Ruitao Lin, Zhetuo Zhao, and Michele Rucci. 2023. “Microsaccades in Head-Free High-Acuity Tasks.” Journal of Vision 23 (9): 5817–17. https://doi.org/10.1167/jov.23.9.5817.

Judd, Charles H, Cloyd N McAllister, and WM Steele. 1905. “General Introduction to a Series of Studies of Eye Movements by Means of Kinetoscopic Photographs.” Psychological Review Monographs 7 (1): 1–16.

Karson, Craig N., Karen Faith Berman, Edward F. Donnelly, Wallace B. Mendelson, Joel E. Kleinman, and Richard Jed Wyatt. 1981. “Speaking, Thinking, and Blinking.” Psychiatry Research 5 (3): 243–46. https://doi.org/10.1016/0165-1781(81)90070-6.

Kirchner, Johannes, Tamara Watson, Niko A. Busch, and Markus Lappe. 2022. “Timing and Kinematics of Horizontal Within-Blink Saccades Measured by EOG.” Journal of Neurophysiology 127 (6): 1655–68. https://doi.org/10.1152/jn.00076.2022.

Kirchner, Johannes, Tamara Watson, and Markus Lappe. 2022. “Real-Time MRI Reveals Unique Insight into the Full Kinematics of Eye Movements.” eNeuro 9 (1). https://doi.org/10.1523/ENEURO.0357-21.2021.

Koerfer, Krischan, Tamara Watson, and Markus Lappe. 2024. “Inability to Pursue Nonrigid Motion Produces Instability of Spatial Perception.” Science Advances 10 (45): eadp6204. https://doi.org/10.1126/sciadv.adp6204.

Konrad, Robert, Anastasios Angelopoulos, and Gordon Wetzstein. 2020. “Gaze-Contingent Ocular Parallax Rendering for Virtual Reality.” ACM Trans. Graph. 39 (2). https://doi.org/10.1145/3361330.

Kowacs, PA, EJ Piovesan, LC Werneck, H Fameli, and H Pereira da Silva. 2004. “Headache Related to a Specific Screen Flickering Frequency Band.” Cephalalgia 24 (5): 408–10. https://doi.org/10.1111/j.1468-2982.2004.00686.x.

Kowacs, PA, EJ Piovesan, LC Werneck, H Fameli, AC Zani, and HP da Silva. 2005. “Critical Flicker Frequency in Migraine. A Controlled Study in Patients Without Prophylactic Therapy.” Cephalalgia 25 (5): 339–43. https://doi.org/10.1111/j.1468-2982.2004.00861.x.

Land, Michael F. 2009. “Vision, Eye Movements, and Natural Behavior.” Visual Neuroscience 26 (1): 51—62. https://doi.org/10.1017/s0952523808080899.

Land, Michael F., N. Mennie, and J. Rusted. 1999. “The Roles of Vision and Eye Movements in the Control of Activities of Daily Living.” Perception 28 (11): 1311–28. https://doi.org/10.1068/p2935.

Land, Michael F., and Benjamin W. Tatler. 2009. Looking and Acting: Vision and Eye Movements in Natural Behaviour. Looking and Acting: Vision and Eye Movements in Natural Behaviour. New York, NY, US: Oxford University Press. https://doi.org/10.1093/acprof:oso/9780198570943.001.0001.

Lappe, Markus, Holger Awater, and Bart Krekelberg. 2000. “Postsaccadic Visual References Generate Presaccadic Compression of Space.” Nature 403 (6772): 892–95. https://doi.org/10.1038/35002588.

Latour, P. L. 1962. “Visual Threshold During Eye Movements.” Vision Research 2 (7): 261–62. https://doi.org/10.1016/0042-6989(62)90031-7.

Lisberger, Stephen G. 2015. “Visual Guidance of Smooth Pursuit Eye Movements.” Journal Article. Annual Review of Vision Science 1 (Volume 1, 2015): 447–68. https://doi.org/https://doi.org/10.1146/annurev-vision-082114-035349.

Liu, Zhaojun, Chun-Ho Lin, Byung-Ryool Hyun, Chin-Wei Sher, Zhijian Lv, Bingqing Luo, Fulong Jiang, et al. 2020. “Micro-Light-Emitting Diodes with Quantum Dots in Display Technology.” Light: Science & Applications 9 (1): 83. https://doi.org/10.1038/s41377-020-0268-1.

Loschky, Lester C., and George W. McConkie. 2000. “User Performance with Gaze Contingent Multiresolutional Displays.” In Proceedings of the 2000 Symposium on Eye Tracking Research & Applications, 97–103. ETRA ’00. New York, NY, USA: Association for Computing Machinery. https://doi.org/10.1145/355017.355032.

Loschky, Lester C., and Gary S. Wolverton. 2007. “How Late Can You Update Gaze-Contingent Multiresolutional Displays Without Detection?” ACM Transactions on Multimedia Computing, Communications, and Applications 3 (4). https://doi.org/10.1145/1314303.1314310.

Luo, Zhenyi, Yuqiang Ding, Qian Yang, and Shin-Tson Wu. 2024. “Ghost Image Analysis for Pancake Virtual Reality Systems.” Opt. Express 32 (10): 17211–19. https://doi.org/10.1364/OE.523196.

Macele, Philipp, and Jan Mueggenburg. 2024. “Playing with the Eyes. A Media History of Eye Tracking.” In Disability and Video Games: Practices of En-/Disabling Modes of Digital Gaming, edited by Markus Spöhrer and Beate Ochsner, 117–43. Cham: Springer International Publishing. https://doi.org/10.1007/978-3-031-34374-2_5.

Magnusson, MÅNs, Ilmari Pyykkö, and Bo Norrving and. 1986. “The Relationship of Optokinetic Nystagmus to Pursuit Eye Movements, Vestibular Nystagmus and to Saccades in Humans: A Clinical Study.” Acta Oto-Laryngologica 101 (5-6): 361–70. https://doi.org/10.3109/00016488609108620.

Mankowska, Natalia D., Anna B. Marcinkowska, Monika Waskow, Rita I. Sharma, Jacek Kot, and Pawel J. Winklewski. 2021. “Critical Flicker Fusion Frequency: A Narrative Review.” Medicina 57 (10). https://doi.org/10.3390/medicina57101096.

Marsh, Dave. 2017. “Temporal Rate Conversion.” https://learn.microsoft.com/en-us/previous-versions/windows/hardware/design/dn642112(v=vs.85)?redirectedfrom=MSDN.

Martinez-Conde, Susana, Stephen L. Macknik, and David H. Hubel. 2004. “The Role of Fixational Eye Movements in Visual Perception.” Nature Reviews Neuroscience 5 (3): 229–40. https://doi.org/10.1038/nrn1348.

Martinez-Conde, Susana, Jorge Otero-Millan, and Stephen L. Macknik. 2013. “The Impact of Microsaccades on Vision: Towards a Unified Theory of Saccadic Function.” Nature Reviews Neuroscience 14 (2): 83–96. https://doi.org/10.1038/nrn3405.

Matsumiya, Kazumichi, and Keiji Uchikawa. 2003. “The Role of Presaccadic Compression of Visual Space in Spatial Remapping Across Saccadic Eye Movements.” Vision Research 43 (18): 1969–81. https://doi.org/10.1016/S0042-6989(03)00301-8.

Mauderer, Michael, David R. Flatla, and Miguel A. Nacenta. 2016. “Gaze-Contingent Manipulation of Color Perception.” In Proceedings of the 2016 CHI Conference on Human Factors in Computing Systems, 5191–5202. CHI ’16. New York, NY, USA: Association for Computing Machinery. https://doi.org/10.1145/2858036.2858320.

McConkie, George W., and Keith Rayner. 1975. “The Span of the Effective Stimulus During a Fixation in Reading.” Perception & Psychophysics 17 (6): 578–86. https://doi.org/10.3758/BF03203972.

Merchant, John, Richard Morrissette, and James L. Porterfield. 1974. “Remote Measurement of Eye Direction Allowing Subject Motion over One Cubic Foot of Space.” IEEE Transactions on Biomedical Engineering BME-21 (4): 309–17. https://doi.org/10.1109/TBME.1974.324318.

Miladinović, Aleksandar, Christian Quaia, Simone Kresevic, Miloš Ajčević, Laura Diplotti, Paola Michieletto, Agostino Accardo, and Stefano Pensiero. 2024. “High-Resolution Eye-Tracking System for Accurate Measurement of Short-Latency Ocular Following Responses: Development and Observational Study.” JMIR Pediatr Parent 7 (December): e64353. https://doi.org/10.2196/64353.

Miller, Donald T., David R. Williams, G. Michael Morris, and Junzhong Liang. 1996. “Images of Cone Photoreceptors in the Living Human Eye.” Vision Research 36 (8): 1067–79. https://doi.org/10.1016/0042-6989(95)00225-1.

Mordi, John A, and Kenneth J Ciuffreda. 1998. “Static Aspects of Accommodation: Age and Presbyopia.” Vision Research 38 (11): 1643–53. https://doi.org/10.1016/S0042-6989(97)00336-2.

Morrone, M. C., J. Ross, and D. Burr. 2005. “Saccadic Eye Movements Cause Compression of Time as Well as Space.” Nature Neuroscience 8 (7): 950–54. https://doi.org/10.1038/nn1488.

Murphy, Hunter, and Andrew T. Duchowski. 2002. “Perceptual Gaze Extent & Level of Detail in VR: Looking Outside the Box.” In ACM SIGGRAPH 2002 Conference Abstracts and Applications, 228. SIGGRAPH ’02. New York, NY, USA: Association for Computing Machinery. https://doi.org/10.1145/1242073.1242241.

Niehorster, Diederick C, Li Li, and Markus Lappe. 2017. “The Accuracy and Precision of Position and Orientation Tracking in the HTC Vive Virtual Reality System for Scientific Research.” I-Perception 8 (3): 2041669517708205. https://doi.org/10.1177/2041669517708205.

Nikonorov, Artem, Roman Skidanov, Vladimir Fursov, Maksim Petrov, Sergey Bibikov, and Yuriy Yuzifovich. 2015. “Fresnel Lens Imaging with Post-Capture Image Processing.” In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR) Workshops. https://doi.org/10.1109/CVPRW.2015.7301373.

O’Sullivan, Carol, John Dingliana, and Sarah Howlett. 2003. “Eye-Movements and Interactive Graphics.” Edited by J. Hyönä, R. Radach, and H. Deubel. Amsterdam: North-Holland. https://doi.org/10.1016/B978-044451020-4/50030-X.

Patney, Anjul, Marco Salvi, Joohwan Kim, Anton Kaplanyan, Chris Wyman, Nir Benty, David Luebke, and Aaron Lefohn. 2016. “Towards Foveated Rendering for Gaze-Tracked Virtual Reality.” ACM Transactions on Graphics (TOG) 35 (6): 179. https://doi.org/10.1145/2980179.2980246.

Poletti, Martina, Chiara Listorti, and Michele Rucci. 2013. “Microscopic Eye Movements Compensate for Nonhomogeneous Vision Within the Fovea.” Current Biology 23 (17): 1691–95. https://doi.org/10.1016/j.cub.2013.07.007.

Prasad, Sashank, and Steven L. Galetta. 2011. “Chapter 1 - Anatomy and Physiology of the Afferent Visual System.” In Neuro-Ophthalmology, edited by Christopher Kennard and R. John Leigh, 102:3–19. Handbook of Clinical Neurology. Elsevier. https://doi.org/10.1016/B978-0-444-52903-9.00007-8.

Rashbass, C. 1961. “The Relationship Between Saccadic and Smooth Tracking Eye Movements.” The Journal of Physiology 159 (2): 326–38. https://doi.org/10.1113/jphysiol.1961.sp006811.

Rayner, Keith. 2014. “The Gaze-Contingent Moving Window in Reading: Development and Review.” Visual Cognition 22 (3-4): 242–58. https://doi.org/10.1080/13506285.2013.879084.

Richardson, Daniel, and Michael Spivey. 2008. “Eye-Tracking: Characteristics and Methods.” In Encyclopedia of Biomaterials and Biomedical Engineering. https://doi.org/10.1201/b18990-101.

Robinson, David A. 1963. “A Method of Measuring Eye Movemnent Using a Scieral Search Coil in a Magnetic Field.” IEEE Transactions on Bio-Medical Electronics 10 (4): 137–45. https://doi.org/10.1109/TBMEL.1963.4322822.

Rolfs, Martin. 2009. “Microsaccades: Small Steps on a Long Way.” Vision Research 49 (20): 2415–41. https://doi.org/10.1016/j.visres.2009.08.010.

Ross, John, M. Concetta Morrone, and David C. Burr. 1997. “Compression of Visual Space Before Saccades.” Nature 386 (6625): 598–601. https://doi.org/10.1038/386598a0.

Rottach, Klaus G., Vallabh E. Das, Walter Wohlgemuth, Ari Z. Zivotofsky, and R. John Leigh. 1998. “Properties of Horizontal Saccades Accompanied by Blinks.” Journal of Neurophysiology 79 (6): 2895–2902. https://doi.org/10.1152/jn.1998.79.6.2895.

Rucci, Michele, and Martina Poletti. 2015. “Control and Functions of Fixational Eye Movements.” Journal Article. Annual Review of Vision Science 1 (Volume 1, 2015): 499–518. https://doi.org/https://doi.org/10.1146/annurev-vision-082114-035742.

Rui, Congshan, Lei Zhao, Tianwen Hou, Jiaojiao Liang, and Chaohao Wang. 2023. “Pupil Swim Measurement and Analysis of Near-Eye Displays.” SID Symposium Digest of Technical Papers 54 (S1): 371–74. https://doi.org/10.1002/sdtp.16306.

Sakai, Hideki. 2023. “Perception of Brightness When the Eyes Are Closed.” Color Research & Application 48 (1): 63–68. https://doi.org/10.1002/col.22832.

Scarfe, Peter, and Andrew Glennerster. 2019. “The Science Behind Virtual Reality Displays.” Journal Article. Annual Review of Vision Science 5 (Volume 5, 2019): 529–47. https://doi.org/https://doi.org/10.1146/annurev-vision-091718-014942.

Schütz, Alexander C., Doris I. Braun, and Karl R. Gegenfurtner. 2011. “Eye Movements and Perception: A Selective Review.” Journal of Vision 11 (5): 9–9. https://doi.org/10.1167/11.5.9.

Skavenski, A. A., R. M. Hansen, R. M. Steinman, and B. J. Winterson. 1979. “Quality of Retinal Image Stabilization During Small Natural and Artificial Body Rotations in Man.” Vision Research 19 (6): 675–83. https://doi.org/10.1016/0042-6989(79)90243-8.

Smit, A. C., J. A. M. Van Gisbergen, and A. R. Cools. 1987. “A Parametric Analysis of Human Saccades in Different Experimental Paradigms.” Vision Research 27 (10): 1745–62. https://doi.org/10.1016/0042-6989(87)90104-0.

Spauschus, Alexander, J. Marsden, David M. Halliday, Jay R. Rosenberg, and P. Brown. 1999. “The Origin of Ocular Microtremor in Man.” Experimental Brain Research 126 (4): 556–62. https://doi.org/10.1007/s002210050764.

Steinbach, Martin J. 1976. “Pursuing the Perceptual Rather Than the Retinal Stimulus.” Vision Research 16 (12): 1371–76. https://doi.org/10.1016/0042-6989(76)90154-1.

Stevenson, S. B., F. C. Volkmann, J. P. Kelly, and L. A. Riggs. 1986. “Dependence of Visual Suppression on the Amplitudes of Saccades and Blinks.” Vision Research 26 (11): 1815–24. https://doi.org/10.1016/0042-6989(86)90133-1.

Sutherland, Ivan E. 1965. “The Ultimate Display.” In Proceedings of the IFIP Congress, 2:506–8. New York.

Idem. 1968. “A Head-Mounted Three Dimensional Display.” In Proceedings of the December 9-11, 1968, Fall Joint Computer Conference, Part i, 757–64. AFIPS ’68 (Fall, Part i). New York, NY, USA: Association for Computing Machinery. https://doi.org/10.1145/1476589.1476686.

Tatler, Benjamin W., Mary M. Hayhoe, Michael F. Land, and Dana H. Ballard. 2011. “Eye Guidance in Natural Vision: Reinterpreting Salience.” Journal of Vision 11 (5): 5–5. https://doi.org/10.1167/11.5.5.

Tatler, Benjamin W, Nicholas J Wade, Hoi Kwan, John M Findlay, and Boris M Velichkovsky. 2010. “Yarbus, Eye Movements, and Vision.” I-Perception 1 (1): 7–27. https://doi.org/10.1068/i0382.

Thier, Peter, and Uwe J Ilg. 2005. “The Neural Basis of Smooth-Pursuit Eye Movements.” Current Opinion in Neurobiology 15 (6): 645–52. https://doi.org/10.1016/j.conb.2005.10.013.

Tuten, William S., and Wolf M. Harmening. 2021. “Foveal Vision.” Current Biology 31 (11): R701–3. https://doi.org/10.1016/j.cub.2021.03.097.

Vinnikov, Margarita, Robert S. Allison, and Dominik Swierad. 2008. “Real-Time Simulation of Visual Defects with Gaze-Contingent Display.” In Proceedings of the 2008 Symposium on Eye Tracking Research & Applications, 127–30. ETRA ’08. New York, NY, USA: Association for Computing Machinery. https://doi.org/10.1145/1344471.1344504.

Virk, Rizwan. 2025. “Chapter 4 - Intelligent Virtual Characters, AGI, and the Future of Metaverse.” In Human-Centered Metaverse, edited by Chang S. Nam, Donggil Song, and Heejin Jeong, 51–73. Morgan Kaufmann. https://doi.org/10.1016/B978-0-443-21996-2.00008-5.

Volkmann, Frances C. 1986. “Human Visual Suppression.” Vision Research 26 (9): 1401–16. https://doi.org/10.1016/0042-6989(86)90164-1.

Volkmann, Frances C., Lorrin A. Riggs, Aimee G. Ellicott, and Robert K. Moore. 1982. “Measurements of Visual Suppression During Opening, Closing and Blinking of the Eyes.” Vision Research 22 (8): 991–96. https://doi.org/10.1016/0042-6989(82)90035-9.

Wallach, Hans, and Charles Lewis. 1966. “The Effect of Abnormal Displacement of the Retinal Image During Eye Movements.” Perception & Psychophysics 1 (1): 25–29. https://doi.org/10.3758/BF03207816.

Warburton, Matthew, Mark Mon-Williams, Faisal Mushtaq, and J. Ryan Morehead. 2023. “Measuring Motion-to-Photon Latency for Sensorimotor Experiments with Virtual Reality Systems.” Behavior Research Methods 55 (7): 3658–78. https://doi.org/10.3758/s13428-022-01983-5.

Wheatstone, Charles. 1838. “On Some Remarkable, and Hitherto Unobserved, Phenomena of Binocular Vision.” Philosophical Transactions of the Royal Society of London 128: 371–94. https://doi.org/10.1098/rstl.1838.0019.

Wu, John K., Tiffany C. AND Tsotsos. 2025. “Real-World Visual Search Goes Beyond Eye Movements: Active Searchers Select 3D Scene Viewpoints Too.” PLOS ONE 20 (7): 1–30. https://doi.org/10.1371/journal.pone.0319719.

Wu, Ruei-Jr, Ashley M. Clark, Michele A. Cox, Janis Intoy, Paul C. Jolly, Zhetuo Zhao, and Michele Rucci. 2023. “High-resolution eye-tracking via digital imaging of Purkinje reflections.” Journal of Vision 23 (5): 4–4. https://doi.org/10.1167/jov.23.5.4.

Yarbus, Alfred L. 1967. “Saccadic Eye Movements.” In Eye Movements and Vision, 129–46. Boston, MA: Springer US. https://doi.org/10.1007/978-1-4899-5379-7_5.

Young, Laurence R., and David Sheena. 1975a. “Eye-Movement Measurement Techniques.” American Psychologist 30 (3): 315–30. https://doi.org/10.1037/0003-066X.30.3.315.

Idem. 1975b. “Survey of Eye Movement Recording Methods.” Behavior Research Methods & Instrumentation 7 (5): 397–429. https://doi.org/10.3758/BF03201553.

Zuber, B. L., and L. Stark. 1966. “Saccadic Suppression: Elevation of Visual Threshold Associated with Saccadic Eye Movements.” Experimental Neurology 16 (1): 65–79. https://doi.org/10.1016/0014-4886(66)90087-2.

Zuber, B. L, L Stark, and M Lorber. 1966. “Saccadic Suppression of the Pupillary Light Reflex.” Experimental Neurology 14 (3): 351–70. https://doi.org/10.1016/0014-4886(66)90120-8.