Maze Experiment #4

Maze Experiment #4

Factors affecting representations



The results of experiment III suggested that VE’s allow accurate and useful survey knowledge of large-scale real world environments. However, the VE used in experiment III had features that were designed specifically to assist the formation of survey knowledge. Experiment IV was conducted to determine what minimal features of a VE are necessary to allow survey knowledge. We compared the representations of people after experience with a wireframe VE, a fully rendered VE maze, and a real-world walk-through of a maze.

A partial account of this experiment was presented at the 39th annual meeting of the Psychonomics Society, Dallas TX, 21 Nov. 1998.

A postscript file of this paper can be downloaded here.
Or click the Acrobat icon to download a .pdf file of the paper Download

ABSTRACT

Spatial knowledge in virtual environments (VE’s) is often evaluated using performance measures acquired in the VE. We show that pointing errors measured in a VE are highly predictive of pointing errors in the real world; however, errors in distance estimations made in a VE are not as predictive of distance errors in the real world. We examine factors that affect bearing and distance estimations made in real vs. virtual environments and find that gender is highly influential.

INTRODUCTION

In the last decade, there has been considerable interest in using computer-generated environments (virtual environments – or VE’s) for training spatial knowledge. Because VE’s are able to depict three-dimensional spaces interactively, they offer a promising medium for training people about the spatial characteristics of places and situations that are rare, remote, or dangerous. For example, VE’s can be used to train firefighters about a building’s layout before they must enter it to put out a fire (see Bliss, Tidwell, & Guest, 1997). Another promising application of VE technology is as a research tool for understanding human spatial cognition. In addition to enabling participants to explore large spaces within the confines of the laboratory, VE’s also allow experimenters more control over stimulus characteristics than they typically have in the real world. Indeed, several recent neurophysiological studies of spatial cognition have used VE’s as stimuli to draw conclusions about real-world spatial cognition (Aguirre & D’Esposito, 1997; Maguire et al., 1998).
Training and research applications that use VE’s as substitutes for real-world spaces raise two important questions. First, when spatial knowledge is acquired in a VE, what is the degree to which measurements taken in the VE can substitute for similar ones in the real-world? This question is especially important for training applications in which assessment of the trainee’s capabilities must be made before transfer. Second, before generalizing results found in a VE to real-world cognition, it is important to know the degree to which spatial knowledge acquired in a VE is comparable to that acquired in the real-world. A recent study by Ruddle, Payne, and Jones (1997) focused our attention on these questions. Ruddle et al. (1997) trained people to learn the spatial layout of a virtual office building and later measured their knowledge of it while they were in the VE. They then compared these results directly with those from a widely-cited study of spatial cognition in a similar but real office building (Thorndyke & Hayes-Roth, 1982). Before making such comparisons between measurements acquired in a VE and those acquired in the real-world, we feel that it is necessary to assess the degree to which the understanding of a virtual space predicts that of the real world space on which it was modeled. It is also important to understand the degree to which spatial knowledge acquired in a VE is systematically different than that acquired in the real world.
We addressed these issues by exposing people to two maze environments—one virtual, the other real. We then tested participants’ knowledge of distances and directions between objects in these mazes. To examine the transfer of spatial knowledge, participants were also tested in a real-world maze after learning in a virtual one.

METHOD

Participants
The participants were 27 students (12 men) enrolled in an introductory Psychology course at the University of Washington. Twenty-one of the participants received extra credit for their participation. The remaining six were paid $10 per hour.
Materials
The real-world environments were two 4.88 m x 4.88 m mazes constructed from 2.13 m black curtains hanging from an overhead grid of cables. The system of cables and curtains allowed the experimenter to reconfigure the mazes rapidly between conditions of the experiment. Mazes were covered with white fabric to reduce the amount of directed light entering them. Four prominent landmarks (in the real world : three large cardboard letters and a cardboard box; in the VE : a ball, a violin, a sword, and a gun) were placed at fixed locations in either maze. Figure 1 illustrates the configuration of the mazes.


Figure 1. Schematic maps of the mazes used in the experiment. T’s denote the location of the targets to which participants pointed and estimated distances.

For the virtual condition of the experiment, these mazes were modeled in World Up by SENSE8 and run on a Pentium Pro 200 using a Diamond FireGL 3000 graphics accelerator board. We used color and fog effects (but no texture mapping) in the VE maze to enhance its interpretability. Participants viewed the VE while sitting 38 cm from a 35 cm x 26.5 cm monitor with a resolution of 1152 x 900 (32K colors, 76 Hz refresh). The speed of this system was approximately 12.64 frames per second. Navigation in the VE was controlled with a Thrustmaster PFCS joystick, providing three degrees of freedom of movement: translation in either dimension along the ground as well as the ability to pan the viewpoint around the vertical axis (yaw).

Procedure
Participants were first trained to navigate in a VE with a joystick until they could complete a ‘virtual obstacle course’ in under five minutes. They then practiced pointing and estimating distances in feet and the experimenter thoroughly explained the nature of their tasks to them. Participants then learned both a real-world and a VE maze. The order of presentation and the maze configuration were counterbalanced across participants.
In the virtual maze, participants were given as much time as they wanted (minimum of seven minutes) to explore and learn the relative locations of objects. When the participant indicated that he or she had adequately learned the virtual maze, we tested his or her knowledge of it : first in the VE (virtual measurements), and then in the real world (transfer measurements).
Each participant provided nine bearing and six distance estimations while in the VE. For bearing estimations, participants were placed directly in front of one of the maze objects and instructed to rotate their viewpoint to the direction of one of the three other targets. A set of cross-hairs superimposed on the monitor screen helped participants align the direction of their viewpoint on the target. After each bearing estimation, participants estimated the distance to the target in feet. The three symmetric distance estimations (e.g. A to B after having already estimated the distance between B and A) were not used in subsequent analyses. For the transfer phase of the experiment, the experimenter escorted the participant into the real world version of the (virtual) maze that had just been learned. Participants made eight bearing and distance estimates from four fixed locations to various unseen (and un-passed) objects in the maze. Bearing estimations in the real world were measured with a dial mounted on a 1.04 m stand. Participants rotated the dial to point in the direction of each target, and the direction was recorded to the nearest degree. After each bearing estimation, participants estimated the distance to the target in feet.
Each participant was also given as much time as they wanted (minimum of four minutes) to explore a real-world maze (see Figure 1). After learning the real-world maze, participants made eight bearing and distance estimations within it using the dial described above. Table 1 summarizes the three main types of measurements that we obtained.


Table 1. The three measurement types taken repeatedly from each participant.

RESULTS

Mean signed distance and bearing errors for the three types of measurements (virtual, transfer, and real) were computed for each participant by averaging the differences between estimations and actual quantities. Averaged over all participants, mean signed bearing errors (Mreal = -0.45°, SD = 14.06; Mtranfer = -0.59°, SD = 23.84; Mvirtual = -5.48°, SD = 20.97) were not significantly different from zero, nor were they significantly different between measurement types (F(2, 52) = 0.52, p = .60). Figure 2 illustrates participants’ signed bearing errors for each of the measurement types.
Distances were generally overestimated. This overestimation was significant for the real world (M = 0.66 feet, SD = 1.02)(t(26) = 3.40, p = .002) and transfer measures (M = 0.51 feet, SD = 1.03)(t(26) = 2.57, p = .02) and approached significance for the virtual measure (M = 0.53 feet, SD = 1.50) (t(26) = 1.85, p=.08). However, there was no significant difference in signed distance errors between the three measurement types (F(2, 52) = 0.18, p = .84). Because there were no significant differences between virtual, transfer, and real signed errors for both distances and bearings, the analyses that follow use only unsigned errors.

Relationships between measurement types
In general, correlations between unsigned bearing errors were higher than those for distance errors. The correlation between real and virtual bearing errors (r(27) = .63, p < .001) was higher than that between real and virtual distance errors (r(27) = .46, p = .02), however, this was not a statistically significant difference. Similarly, the correlation between transfer task and virtual bearing errors (r(27) = .72, p < .001) was higher (though not significantly higher) than that between transfer task and virtual distance errors (r(27) = .43, p = .03). Table 2 presents the intercorrelations, means, standard deviations, and estimated reliabilities of the bearing and distance errors for the three measurement types.


Figure 2. Bearing estimations for men and women relative to the true bearing in the three phases of the experiment.
The centroid of the estimations is indicated by the arrow head.


Table 2. Means, standard deviations, reliabilities, and Pearson intercorrelations between distance and bearing errors for the three measurement types.

Factors affecting bearing and distance errors
Figure 3 illustrates the relationship between the amount of time participants spent learning each maze and the degree to which they made errors in bearing estimations. In general, people who were more disoriented were those who had spent more time learning the maze. Jackknifed correlations between the time spent learning the maze and the error in bearing estimations were greatest for the transfer (r(27) = .515, p = .01) measure, and were similar for the real (r(27) = .386, p = .05) and virtual (r(27) = .386, p = .05) measures. Because learning time exerted a significant effect on bearing errors, we used it as a covariate in the analyses that follow.
Gender significantly influenced the accuracy with which participants made bearing judgments. In general, men’s errors in bearing estimation (Mreal = 16.70°, SD = 6.75; Mtransfer = 24.10°, SD = 7.21; Mvirtual = 17.46°, SD = 11.10) were smaller than those for women (Mreal = 20.23°, SD = 6.46; Mtransfer = 39.60°, SD = 20.41). Figure 2 shows that the error in women’s bearing estimations was particularly high in the VE (Mvirtual = 52.15°, SD = 29.31). A 2 (gender [between] ) by 3 (measurement type – real/transfer/VE [within] ) mixed effects ANCOVA on unsigned bearing errors (treating the average time spent learning the mazes as a covariate) revealed a significant interaction between gender and measurement type (F(2,48) = 6.62, p = .003) and a significant main effect of gender (F(1,24) = 6.96, p = .01). These effects are illustrated in Figure 4.


Figure 3. Bearing error as a function of time spent learning the maze.


Figure 4. The effects of gender and measurement type on errors in bearing estimation.

Very different effects were found for errors in distance estimations. Distance errors were more affected by measurement type (see table 2) than by gender or the interaction of gender and measurement type. A 2 (gender) x 3(measurement type : real/transfer/VE) ANCOVA conducted on unsigned distance errors (and using the mean time spent learning the mazes as a covariate) revealed no significant main effects or interactions, although the effect of environment approached significance (F(2,48) = 3.14, p = .052). These data are illustrated in Figure 5.


Figure 5. The effect of gender and measurement type on distance errors.

A common scale for angles and distances
In order to compare the relative accuracy of distance estimations (which are measured in feet) and bearing estimations (which are measured in degrees), we correlated judged interpoint bearings with actual bearings (and judged with actual interpoint distances) for each participant. After standardizing these correlations (with Fisher’s r-to-z transformation), we performed a 2 (gender) x 3 (measurement type : real/transfer/virtual) x 2 (data source : bearings/distances) ANOVA to examine the factors influencing relative accuracy of bearings and distances. The most significant effects were those of the data source (Mbearings = 2.10, SD = 0.57; Mdistances = 1.11, SD = 0.48) (F(1,25) = 242.2, p < .001) and measurement type (Mreal = 1.96, SD = 0.45; Mtransfer = 1.47, SD = 0.63; Mvirtual = 1.39, SD = 0.65) (F(2,50) = 19.05, p <.001) however, gender and several interactions were also prominent. Figure 6 illustrates the effects of gender, data source, and measurement type on relative accuracy of bearing and distance judgments.


Figure 6. Effects on the correlations between judged and actual distances and judged and actual bearings.

DISCUSSION

One of the aims of this experiment was to address the degree to which measuring spatial knowledge in a VE adequately predicts subsequent performance in the real-world. Our results suggest that a person’s ability to point to unseen objects in a VE is quite predictive (r » .60) of their ability to do so in the real world. On the other hand, the accuracy of distance judgments in a VE does not transfer as well to the real world (r » .44). The lower correlations between virtual and real distance judgments are probably due in part to the lower reliability of errors in distance judgments. Much of the difference between the predictive capability of bearing and distance errors may also be due to range effects, and future work should aim at determining the degree to which these findings generalize to larger environments. It is clear that in our room-sized maze environment, it was easier for participants to become severely disoriented than to make grossly inaccurate distance estimates. The extent to which this trend also occurs in larger spaces is an open question. Despite these caveats, the finding that errors in distances and errors in bearings are differentially affected by factors such as gender and measurement type provides some support for the notion that bearings and distances are generated from distinct psychological processes, rather than both deriving from a common conception. It is important to note that disorientation in virtual mazes was particularly severe for women. Several (7) women – and no men – had average bearing errors in excess of 40° for both the virtual and transfer phases of the experiment. However, the gender difference between real world errors was much smaller. These results corroborate earlier findings that understanding the spatial characteristics of VE’s may be more challenging for women than for men (Waller, Hunt, & Knapp, 1998; Astur, Ortiz, & Sutherland, 1998). Gender differences in the ability to use and understand VE’s is an important issue for future research.

ACKNOWLEDGEMENTS

This research was supported by the Office of Naval Research grant N00014-96-0380, Earl Hunt, Principal Investigator. Correspondence concerning this article should be addressed to David Waller, Department of Psychology Box 351525, University of Washington, Seattle, WA 98195. Electronic mail may be sent to dwaller@u.washington.edu.

REFERENCES

Aguirre, G. K., & Desposito, M. (1997). Environmental knowledge is subserved by separable dorsal/ventral neural areas. Journal of Neuroscience, 17(7), 2512-2518.

Astur, R. S., Ortiz, M. L., & Sutherland, R. J. (1998). A characterization of performance by men and women in a virtual Morris water task: A large and reliable sex difference. Behavioural Brain Research, 93(1-2), 185-190.

Bliss, J. P., Tidwell, P. D., & Guest, M. A. (1997). The effectiveness of virtual reality for administering spatial navigation training to firefighters. Presence: Teleoperators and Virtual Environments, 6, 73-86.

Maguire, E. A., Burgess, N., Donnett, J. G., Frackowiak, R. S. J., Frith, C. D., & Okeefe, J. (1998). Knowing where and getting there: A human navigation network. Science, 280(5365), 921-924.

Ruddle, R. A., Payne, S. J., & Jones, D. M. (1997). Navigating buildings in "desk-top" virtual environments: Experimental investigations using extended navigational experience. Journal of Experimental Psychology: Applied, 3, 143-159.

Thorndyke, P. W., & Hayes-Roth, B. (1982). Differences in spatial knowledge acquired from maps and navigation. Cognitive Psychology, 14, 560-589.

Waller, D., Hunt, E., & Knapp, D. (1998). The transfer of spatial knowledge in virtual environment training. Presence: Teleoperators and Virtual environments, 7(2), 129-143.

[ return to top ]