Virtual strike and dip - Advancing inclusive and accessible field geology

ABSTRACT


The "field" environment: real, virtual, and implementation for remote learning
The field may be the single most prominent element defining geosciences.Processes relevant to Earth sciences happen in the field, and their phenomenological traces are observable in that physical space.Thus, fieldwork and the educational components of field trips and field camps are frequently held in the highest regard (Orion and Hofstein, 1994;Elkins and Elkins, 2007;Pyle, 2009;Semken et al., 2018).Fieldwork remains a graduation requirement for most geoscience programs despite increasing concerns over it being inaccessible to many students, predominantly from underrepresented groups, as a result of financial, cultural, physical, and safety barriers.
Frequently, physical locations are hard to reach, or they may be impossible, dangerous, or too expensive to access (e.g., the location is on a different continent, in a restricted area, or only existed in the past) (Slater, 1999;Bowman and McMahan, 2007), or from the spring of 2020 to at least into the summer of 2021, physical field experiences are hindered by the global COVID-19 pandemic.Furthermore, recent studies have shown that the field experience is not inclusive and may be hindering retention and diversity within geoscience undergraduate programs (Hall et al., 2004;Giles et al., 2020;Morales et al., 2020).In contrast, virtual field trips can allow instructors to expose students to widely accessible, relevant, and authentic learning experiences independent of time and space (e.g.Stumpf et al., 2008;Bursztyn et al., 2017;Mead et al. 2019;Klippel et al. 2020).Leveraging increasingly accessible high-resolution computing devices for education has the potential to positively impact student engagement (Witmer and Singer, 1998;IJsselsteijn and Riva, 2003) and efforts to integrate emerging technology into the classroom to improve undergraduate success in introductory geoscience courses have further demonstrated the importance of experiential learning exemplified best by field trips (Cunningham and Lansiquot, 2019;Dolphin et al., 2019;Lansiquot and MacDonald, 2019;Moysey and Lazar, 2019).While there is some positive evidence that compares actual and virtual field trips (e.g., Klippel et al. 2019, Marshall et al. under review), considering fieldwork without the field (i.e. in a virtual environment) is a challenging concept for Earth science educators.Consequently, virtual and remote learning in the geosciences has remained a niche product and it required the COVID-19 pandemic to explore remote learning opportunities for place-based education at scale and across disciplines.We have seen a dramatic influx of efforts (e.g.numerous NAGT Workshops; Earth Educators Rendezvous, 2020) and papers since 2020 that detail the creative ways a community, deprived of their traditional educational methods, has responded to distancing constraints and travel bans (e.g., Andrews et al., 2020;Bethune, 2020;Madon, 2020;Rotzein et al., 2020;Sajjadi et al. 2020;Tibaldi et al., 2020, Rotzein et al., 2021;Whitmeyer and Dordevic, 2021).
Virtual environments, immersive or desktop-based, allow for creating realistic and flexible experiences for (virtual) field trips and the learning activities (e.g., measuring geologic structures and building mental models of spatial orientation and scale of landscape features) that are essential to practice on these field trips.Examining digital twins of outcrops through magnification, collecting samples, or measuring the stratigraphy are, with recent technological advances, straightforward to realize virtually.Over the last four years we and others have been building this capacity through combining efficient data collection in the form of 360° images, high resolution images, virtual outcrop models, and simple measuring tools.What we identified as missing are more complex geological tools and pedagogies for the application and practice of concepts such as strike and dip.Strike and dip measurements and rock identification are the fundamental aspects of any geologic map.Taking and interpreting such field measurements both require physical practice and are fundamentally essential for geoscience education, but are generally not covered extensively in virtual environments (see eRock; Cawood and Bond, 2018 for exceptions).Strike and dip measurements allow students and professionals alike to interpret structures in the Earth's crust and reconstruct deformed regional areas.It is through strike and dip that a geologist understands the regional geology from deposition to deformation.

Why field geology? Spatial reasoning in the geosciences
Students in the geosciences are frequently required to reason about objects or features that occur at spatial scales too large or small to be directly observed (Gagnier et al., 2017) or hidden from view (Shipley et al., 2013;Ormand et al., 2014;Almquist et al., 2018;Zhao and Klippel, 2019;Atit et al., 2020).As a result, faculty frequently describe students' difficulty with spatial visualization as one of the barriers to success in the geosciences (e.g.Barab and Dede, 2007;Titus and Horsman, 2009;Atit et al., 2020).In particular, spatial visualization is critical to success in courses such as sedimentology and stratigraphy, structural geology, and field techniques (Gagnier et al., 2017).Tectonic and sedimentary processes usually form geo-spatially predictable features, deducible from patterns observed in surface data when one is capable of visualizing the 3D geometry (Alles and Riggs, 2011).Students who possess the spatial visualization abilities necessary to succeed in these courses are also more likely to continue in the geosciences (Titus and Horsman, 2009).
The development of geological reasoning skills can be scaffolded by introducing students to a sequence of exercises starting with prototypical, accessible, and understandable physical locations, and also by introducing more experiential practice opportunities at the lower-level prior to the more challenging applications found in subsequent upper-level geoscience courses.In many postsecondary institutions, the concept of strike and dip as geological measurements is introduced in an introductory physical geology course.Later, students practice taking these measurements extensively in a field methods course, apply these methods through different lenses of geologic interpretation in subsequent focus courses, and conclude with a capstone summer field course: Field Camp.Visualizing the 3D forms and structures of our planet is a critical skill for the geosciences, and the foundation of this skill lies in a solid understanding of geological maps and strike and dip measurements.

A place for virtual and immersive technologies in place-based learning
Place-based learning, such as field trips, combines the practices found in problem-based learning and experiential learning to foster a sense of place that generates an authentic learning environment, something valued across disciplines from social to physical sciences.Virtual environments, and especially immersive virtual environments, allow for creating learning environments grounded in the same learning theories and pedagogies as place-based education.
Associated theories are discussed from different angles such as discovery, inquiry, and problembased learning as well as experiential learning (Kolb, 2014).The focus of this article is not on learning theories and as such we are not providing an in-depth discussion of the different approaches.Similarities of these approaches are grounded in a constructivist perspective on learning (Winn, 1993;Dalgarno, 2002) building on the power of contextualizing learning through integrating prior knowledge and experience in addition to the context in which the content is embedded.Bangera and Bronwell (2014) found that benefits of these approaches include that they may offer a more effective and accessible starting point for students, including minority, lowincome, and first-generation college students and can provide students with a greater ability to use scientific thinking in other aspects of their lives.These approaches, and in particular discoverybased learning, have also been found to be key to successful STEM education (PCAST, 2012).
What role can virtual and immersive technologies play in discovery-based courses and fostering equity and access to STEM education such as geoscience field trips?The theoretical basis for the transformative nature, especially of immersive technologies for education, is rapidly growing (Dede, 2009;O'Connor and Domingo, 2017;Liu et al., 2020;Parong and Mayer, 2020;Wu et al., 2020).Characteristics of virtual and immersive technologies lend themselves to realize placebased learning (Semken et al., 2018), experiential and embodied learning (Johnson-Glenberg 2018) as well as designing environments for discovery-based learning.Placing learners into the real-world with a specific problem that is relevant to a location provides a more direct connection of key learning points that students can understand and use to become more engaged (Powers, 2004;Bursztyn et al., 2020).Designing virtual environments in which students' learning activities are scaffolded by exercises and instruction is at the core of discovery-based learning (McComas, 2014).Geological processes can sometimes be difficult to visualize during field trips due to vast spatial and time scales-this is one area in the discipline that iVR can offer a distinct advantage.
The blending of place-based and discovery-based learning, especially in immersive, virtual environments allows for the "perceptual blending of the real and the virtual world with its placebased authenticity" to enable better learning experiences (Barab and Dede, 2007, p. 2).The geosciences have long been either explicitly or implicitly using experiential, place-based exercises to foster discovery-based learning in their curriculum through, for example, field trips (Semken et al., 2018;Atit et al., 2020).Entering the 2020 Field Camp season, a crucial component of most traditional geoscience programs, instructors and students were faced with limited options: no field camp, limited and socially distanced field camp, or virtual field camp.Here it is pertinent to channel the virtuality momentum into constructive, critical, and empirically-grounded discussions of the future and utility of VR for geoscience education.It is important to note that virtual and immersive virtual experiences cannot only be designed to mimic actual field experience but that they offer opportunities beyond physical reality such as reacting to the learner in real-time (Lopes and Bidarra, 2011;Vandewaetere et al., 2013;Sajjadi et al., 2014;Shute et al., 2016).This paper presents a virtual Strike and Dip tool (SaD) in a web-based desktop virtual reality (dVR) environment.In addition to posing many challenges, the COVID-19 pandemic induced transition to primarily online teaching also presented geoscience educators with a new opportunity to improve introductory field-mapping instruction to be more inclusive if we are able to recreate strike and dip lab experiences through virtual environments.SaD is an interactive experience created for the purpose of guiding students to think spatially for critical geological applications by taking strike and dip measurements from 3D models of geological structures.The SaD tool mimics an introductory geologic mapping lab where students are taught strike and dip measurements using a set of angled boards with accompanying rock samples staged around a classroom (or open space) to reveal an imagined geologic structure.We have replicated this experience and traditional pedagogies in the virtual world with SaD and its series of digital planes and corresponding virtual rock samples (high resolution 3D digital models downloaded from Sketchfab ™ .With this tool, students can interactively learn what strike and dip measurements are, practice the basics of field mapping using strike and dip, as well as practice taking measurements using a variety of geological structure types.The SaD tool mimics geoscience place-based learning experiences and combines them with the flexibility and scalability of dVR.A small-scale pilot assessment (eleven participants) using the dVR SaD interface and an accompanying mapping assignment was completed in Fall 2020 and presented at a workshop (Bursztyn et al., 2021).Building on the pilot study we improved the design iteratively and rolled out SaD as a large-scale study in a 250 student introductory geoscience class.We present here a more in-depth discussion of SaD, the newly conducted empirical evaluation and analysis, a critical discussion of results showing important considerations for the future of virtual geosciences, and our vision for future SaD and virtual geoscience toolkit developments.

The Strike and Dip tool
The representation of 3D geologic structures in 2D form requires several standard map notations, the most important of which are strike and dip measurements.New learners are typically introduced to taking strike and dip measurements using the "right hand rule" (RHR) convention.
There are a few variations of the RHR, but a commonly used one (and the one used in this study) is as follows: hold the right hand flat, with the palm down on the planar geologic feature, thumb extended at 90º degrees to fingers, and fingers pointing down dip (Fig. 1).Within the SaD tool, RHR is an optional feature that can be toggled off or on per user preference.Strike and dip is often a challenging concept to teach to new learners of geology in the best of times, but the COVID-19 pandemic presented geoscience educators with a new challenge: removing the in-person field trip instruction that provides guided practice in taking strike and dip measurements.Therefore, what were deemed the fundamental components of in-person field instruction for learning to measure geologic structures (identifying strike and dip planes and manipulating a compass to determine their orientation in space), were the primary focus of the SaD tool.The primary components of the SaD tool are the Compass Tool and the Small Data Panel/Data Set (Fig. 2a;2b).The strike and dip data are recorded in the tool in the All Data Sets panel (Fig. 2b).Users can navigate around a 3D digital environment to locations where they can measure the strike and dip of various slopes (platforms or outcrops).The user can locate their position via the Mini World Map or full screen World Map (Fig. 2c).Once the user is positioned close to the slope they would like to measure, they orient the position and rotation of the compass tool (using the compass control panel) to correspond to the strike or dip measurements.In the virtual environment levels, "station locations" are specifically laid out to correspond to the assignment maps.There are four different setting levels within the SaD tool; from least to most challenging they are: bumper cubes, bumper rocks, cubes, and rocks.The two cubes levels have field stations set up within the virtual environment as rectangular planes with a virtual hand sample rock floating above (Fig. 3a).The cubes levels have very obvious planar surfaces for taking strike and dip measurements.The two rocks levels have their field stations set up with rectangular planes draped with rock "skins" that give an appearance closer to an outcrop (Fig. 3b).Depending on the complexity of the rock texture of the "skin", the planar surfaces within the rocks level environments are more challenging to precisely identify.The two bumper levels have an algorithm that flags the strike and dip measurements in red if they are greater than 10º and 5º off, respectively.These flags enable self-correction by the students and facilitate only recording correct measurements in the data display panel.

Participants
A total of 147 undergraduate students (with an average age of 19.73) participated in this study.
Out of this population, 98 students self-identified as male, 44 as female, three as other, and two preferred not to answer.All students were recruited from an introductory geoscience class (Geosc 001 -Physical Geology) at The Pennsylvania State University in the Fall 2020 semester.This class was chosen for the introductory nature of material taught including the strike and dip content already in the course curriculum.The SaD experience was embedded in this course as a laboratory assignment and students were awarded course credit for their participation.In essence, the laboratory was conducted in a context equivalent to the traditional face-to-face environment.

Procedure
The lab exercise was administered with the help of teaching assistants (TAs).Following the same procedure as the traditional in-person lab that the SaD dVR experience replaced, students were assigned pre-lab homework readings.During the lecture they were presented the standard introductory material on geologic maps and mapping, such as how to interpret the geologic rule of v's, measuring and plotting strike and dip on a map, drawing contacts, and constructing basic cross-sections.Earlier in the semester students completed a geologic mapping exercise from their lab workbooks for which they were provided strike and dip measurements.This lab exercise was graded and returned to the students prior to their introduction to the SaD tool for their virtual field mapping activity.At the beginning of the SaD lab, students were shown an introductory video tutorial demonstrating how to access and utilize the SaD tool through an online dVR environment.
Navigation between "field stations" within the environment using arrow keys and/or mouse, proper hand placement for right hand rule, measurement of strike and dip, as well as using the mini map feature are all demonstrated within this tutorial video.TAs provided additional office hours after the lab session and online video resources (which included a longer comprehensive tutorial video and written instructions for the SaD tool as well as a video tutorial on the basics of geologic mapping and drawing a cross-section).Participants in this study used the SaD tool at the beginner (least challenging) bumper cubes level.
As for the traditional in-person lab exercise, the SaD mapping activity was completed in a single 3-hour lab session and consisted of two parts, both tasking the students with gathering information (strike and dip, rock descriptions) with which to compile a geologic map, legend, cross section, and interpretation of geologic events that formed the area.Students were given blank base maps and fill-in-the-blank field notes to complete as they worked in the virtual environment.This aspect of the assignment tasked the students with transcribing the data as they would in the real world and practice active mapping.Students were also provided with the rock identifications for the map areas to reduce the number of tasks they had to complete in their single lab session.The first mapping activity (Map 1) of the assignment was an optional "practice" map with five rock types, six field stations, and relatively simple geologic relationships to interpret (Fig. 4).The second mapping activity of the assignment (Map 2) was classified as the "real" map with 15 field stations and slightly more complex geologic relationships; this is the map that was evaluated for their grade in this lab assignment.Grading of the lab exercise included evaluation of 1) the map itself, 2) the field notes, 3) the cross-section, 4) the explanation, and 5) the interpretation of geologic events that formed the area (Fig. 4).

Assessment measures and analyses
The participants' experiences and learning with the SaD tool were assessed using self-reported questionnaires (Appendix A).All of the questionnaire items are from established and validated instruments (summarized and adapted by Lee et al., 2010 andKlingenberg, 2020).As part of the demographic information, participants were asked to report on their age, gender, major and minor fields of study, and year of study.Furthermore, participants were asked to report on their familiarity with navigating geographical software such as ArcGIS, as well as their familiarity with playing computer games.Direct student learning of geologic mapping constructs (i.e. via lab grades) was not assessed for this study as the focus was on the experience of the participants using the SaD tool for the purpose of learning the basics of geologic mapping.

Quantitative assessment and analyses
After interacting with the SaD tool, the experiences (learning and general) of the participants were measured in light of representational fidelity, immediacy of control, perceived usefulness, perceived ease of use, motivation, control and active learning, reflective thinking, perceived learning effectiveness, satisfaction, and self-efficacy (Table 1, see also Appendix A for the full question list).All constructs were measured on a scale of 1 to 5 and individual items were averaged and collapsed into the final construct score.
In order to maintain an unbiased distribution into the low/high categories, cases where a participant scored exactly the same as the median (3 for geographical software familiarity, and 4 for gaming familiarity) were excluded.Using this approach 53 participants were identified belonging to the low-Software Familiarity category, 41 to high-Software Familiarity, 47 to low-Gaming Familiarity, and 66 to high-Gaming Familiarity.The experience and learning metrics of participants were compared based on these categories using the independent samples t-test or, alternatively, Mann-Whitney U test in case of non-normal distribution.
In addition to geographical software and gaming familiarity, we also explored the effect of gender on the experiences and learning of participants.As such, the experiences and learning metrics of 98 male participants were compared with 44 female participants.Two-way ANOVAs were conducted to explore the interaction effect between geographical software/gaming familiarity and gender on the measured experience and learning metrics reported in Table 1.All analyses were performed using IBM SPSS Statistics 22. Table 1.Metrics from participant questionnaire and their respective explanations

Metric
Explanation Representational fidelity 1  The degree of realism within the virtual environment.

Immediacy of control 1
The ability to change position/direction and manipulate objects within the virtual environment.

Perceived usefulness 1
Two metrics for "usability" where 1) usefulness relates to the terms: important, relevant, useful, valuable; and 2) ease of use relates to the terms: convenient, controllable, easy, unburdensome.Perceived ease of use 1

Motivation 1
Intrinsic interest based on autonomy and competence; within virtual environment derived from user control over what/when is viewed.

Control and active learning 1
Active involvement in the learning process; learners make their own decisions about the pace, order, and flow of learning activities while completing the task.

Reflective thinking 1
The generation of curiosity or confusion about what is seen being used as a catalyst for learning new concepts by making sense of observations.

Perceived learning effectiveness 1
Two metrics for "learning" in the affective domain where 1) perceived effectiveness relates to generation of understanding, meaning, and interest in the topic; and 2) satisfaction relates to gaining knowledge through the virtual environment, including appreciation for the learning experience.Satisfaction 1

Self-efficacy 2
The degree of confidence in understanding of the topics practiced through the virtual experience. 1,2metrics derived from 1  Lee et al., 2002 and 2 Klingenberg, 2020

Qualitative assessment and analyses
Within the survey, two open-ended questions were asked from the participants about their experiences with the SaD tool: 1) "How was your learning experience using this tool?Describe how you felt about practicing geologic mapping in a virtual environment." 2) "How did your experience using the strike and dip tool change between the first and second mapping activities?Explain within the context of the technology (ease of use, functionality, etc.)" Combined with the quantitative analyses, qualitative analyses provide deeper insights into how the SaD tool was perceived by the participants.Based on the structured content analysis approach proposed by Schreier (2012), two independent coders examined the responses of participants and inductively generated codes that would capture their content.The coders reached agreement by grouping and rearranging the codes into the final schemas (one for each question) based on the most frequent codes.Inter-rater reliability tests based on Cohen's Kappa were also conducted for the finalized results.To further understand these results, we examined the associations between geographical software familiarity and gaming familiarity groupings (high/low) and each of the codes using a chi-square test of independence and a post-hoc test with Bonferroni correction (resulting in an adjusted alpha of 0.0125).

Quantitative analysis
We first looked at the scores for the different measured metrics (Table 1) averaged over all participants to analyze the overall assessment of the SaD tool.The results summarized in Table 2 show slightly above-average scores for the representational fidelity and motivation metrics, and well-above-average scores for immediacy of control, perceived usefulness, perceived ease of use, control and active learning, reflective thinking, perceived learning effectiveness, satisfaction, and self-efficacy.These scores indicate a positive overall evaluation of the SaD tool, implying that it succeeded in eliciting a good experience for users, and therefore can be considered an effective learning instrument.
As a second step, we were interested in how the experience with the SaD tool was impacted by individual differences between the participants related to past exposure to geographical software and video games.The sampled population reported a slightly above-average score for familiarity with navigating geographical software (M=2.86,SD=1.25), and a well-above average score for familiarity with gaming (M=3.91,SD=1.23).The results from the analyses comparing the survey scores of participants based on their software and gaming familiarity groupings are reported in Tables 3 and 4 that follow.For control and active learning, scores of students belonging to the high gaming familiarity group were higher than the low gaming familiarity group (M=3.5, SD=0.85 and M=3.12, SD=0.9, respectively; t(111) = 2.253, p < 0.05).For perceived learning effectiveness, scores of students belonging to the high gaming familiarity group were higher than the low gaming familiarity group (Mdn = 3.43 and Mdn = 3, respectively; U(Nlow = 47, Nhigh= 66) = 1147.000,z = -2.357,p = 0.018).
With respect to gender, our results indicate that male students (M = 3.48, SD = 0.83) reported significantly higher scores for self-efficacy than female students (M = 3.12, SD = 0.85), t(139) = 2.329, p < 0.05).No other significant differences for gender were shown to exist.Finally, we were interested in investigating the possible interactions between geographical software/gaming familiarity and gender on the experience and learning metrics of participants.
Two-way analyses of variance (ANOVAs) were conducted for this inquiry (see Table 5 for complete results) and revealed no statistically significant results.
Our results indicate that the individual differences among students in light of their prior familiarity with navigating geographical software as well as their familiarity with gaming has a pronounced effect on their experiences.The unveiled trend indicates that higher familiarity with either geographical software or gaming leads to a significantly better experience with the SaD tool.Importantly, no effects of gender or significant interactions between software/game familiarity and gender on the experience and learning metrics of participants were observed.total n for high and low software and gaming familiarity includes unspecified gender 1

Qualitative analysis
The results from our qualitative analysis of the two open-ended survey questions are reported in Tables 6 and 7 that follow.With respect to the first open-ended question, "How was your learning experience using this tool?",almost 18% of participants reported that the tool was easy to use while nearly 17% reported that the tool was difficult to use (Table 6).For example, two contrasting participant comments are: "it was easy to navigate" and "I felt confused and overwhelmed on the program almost the entire time I was using it…".Related to useability, almost 11% of participants indicated that the controls for using the tool are not intuitive, e.g.: "it was very frustrating to try and rotate the compass to the right spot…".Another 8% indicated that the tool had a high and steep learning curve, e.g.: "firstly, I thought it is hard but then I got used to it".Furthermore, about 12.5% of participants had performance issues such as lagging and crashing, e.g.: "it was a little slow, as it did not respond immediately to my inputs…".
Importantly, a little over 15% of participants reported that the tool has increased their interest in learning the topic and 22% reported that they perceived the tool as effective for learning, while only 6% reported that they did not perceive the tool to be effective for learning.For example, two contrasting participant comments about the experience are: "…I felt like I was doing actual work…" and "…I think that an in-person experience would be more effective to understand strike and dip…".Related to the latter example, 11% of participants indicated that they would prefer the real environment to the virtual for learning about this topic.
Finally, our results show that 49% of the sampled population had an overall positive impression of the tool whereas only 17% and 13% reported an overall negative or overall mixed impression, respectively.Others did not express clear inclination.
A chi-square test of independence revealed that participants with low geographical software familiarity had a much higher overall negative impression (29.5 %) compared to those with a high geographical software familiarity (2.85 %), χ 2 (1, N = 79) = 9.52, p < 0.01.With respect to the second question, "How did your experience … change between … mapping activities?", 62.5% of participants reported that their experience improved from the first to the second mapping activity (Table 7).More than half of those who reported an improvement to their experience explicitly mentioned that their experience was easier in the second mapping activity because of practicing in the first mapping activity.About 20% of participants reported that their experience remained the same, and 18% reported that their experience worsened from the first to the second mapping activity.From those who reported that their experience worsened, 12.4% stated that the second mapping activity was more difficult and almost 8% stated that they experienced more lag in the second mapping activity.A chi-square test of independence revealed no significant differences between geographical software familiarity categories or gaming familiarity categories and the codes.In summary, the qualitative analysis of the second question indicates that more exposure to the SaD tool improves the overall experience for users but the fact that second activity is more demanding in terms of required graphic power resulted in more performance issues.1

DISCUSSION AND CONCLUSIONS
Using the SaD tool, an entirely remote introductory field mapping exercise was successfully completed by students during the COVID-19 pandemic.This field mapping exercise replicated exactly, in the digital world, the tasks the students would have normally completed in an in-person lab: measuring strike and dip of staged "outcrops", using those data to assemble a map, and interpreting the geologic history for that "region".Using traditional aspects in a new way, this environment not only taught students how to visualize the orientation of strike and dip on a rock plane, but also how to correctly line up a compass using the RHR convention.It also challenged students to conceptualize and infer overall geologic relationships using the measurements they took at each individual 3D outcrop model.From a teaching perspective, the SaD tool also provides three distinct advantages: 1) the time required to set up a staged beginner mapping area is conserved, which in turn permits 2) multiple mapping environments to be explored by the students (e.g."practice" Map 1 followed by "real" Map 2) with different levels of challenge (e.g.bumper cubes vs bumper rocks) available to facilitate individualized learning; as well as 3) the bumper setting flagging incorrect measurements, providing the opportunity for self-correction.In a regular face-to-face introductory mapping lab, there is realistically only time to set up one staged mapping environment and during the exercise, the instructor is trying to assist individual students with a wide range of issues from using their left hand, to holding the compass upside down, to having made and mapped several incorrect measurements without realizing their error.SaD dramatically increased efficient instruction through error flagging alone.
Both quantitative and qualitative results suggest that the students reacted overall positively to the SaD tool.Further, qualitative results suggest that SaD was an effective learning instrument for the mapping exercise, as participants reported an increase in understanding of strike and dip from Map 1 to Map 2. These findings are in agreement with those from the earlier pilot study (n=11) using the same software (Bursztyn et al., 2021) and suggest that SaD can be considered an effective learning instrument.The quantitative results indicate that students familiar with other geographical software or gaming software had a much better experience in light of representational fidelity, immediacy of control, perceived usefulness, perceived ease of use, control and active learning, perceived learning effectiveness, satisfaction, and self-efficacy compared to those who were unfamiliar.This is important as it suggests that introducing students to virtual learning environments more frequently will have positive effects on their learning experience.
Although the results of the qualitative analysis are valuable on their own, when considering the prior individual experiences of users in relation to their open-ended feedback, interesting themes emerge.When comparing participants in the high geographical software familiarity group to those in the low geographical software familiarity group, we see that those in the high familiarity group perceived the tool to be much easier to use and controls to be more intuitive.Similarly, participants in the high familiarity group experienced less performance issues and had a less steep learning curve.It was also the case that participants in this group had a lower tendency to claim preference for the real environment over the virtual one and these participants determined the tool to be effective for learning at much higher rates than those in the low familiarity group.The high geographical software familiarity grouping reported a much higher overall positive impression and much lower overall negative impression of the tool.Finally, a very similar trend is seen when comparing participants of high and low gaming familiarity.Apart from performance issues and learning curve, in almost all the other metrics, participants in the high gaming familiarity group reported a much better experience than those in the low gaming familiarity group.The qualitative results align with the quantitative results, which further strengthens the conclusion that students with higher geographical software familiarity and to some degree, gaming familiarity, gained more cognitively and psychologically from their SaD experience.Our results corroborate observations made in other experiments evaluating the importance and impact of prior familiarity with similar software on the experiences and performance of learners in virtual environments (Bagher et al., under review).Importantly, the absence of effects of gender on the participants' experience and learning metrics suggests an equitable learning experience across gender demographics.
To further explore some of the feedback received through the open-ended questions, we address comments geared towards issues with usability, fidelity to real world environments, and limitations with software.

Usability and fidelity to learning mapping in the real world
Notably, most of the negative comments with the SaD tool are with regard to lag and frustration of becoming familiar with the settings and controls (Tables 5 and 6) and not the sometimes confusing aspect of taking and interpreting strike and dip measurements.Within this lab, the 3D virtual outcrops presented had easy to determine strike planes.Because the RHR convention was represented with a digital right hand that could be manipulated, users could easily determine dip direction and therefore angle.Furthermore, because participants were using the tool with the beginner bumper setting, they were alerted to any incorrect measurements instantaneously.
In the field without a perfectly staged 3D outcrop, it is sometimes difficult to determine the true strike of a lithologic unit, and therefore easy to accidentally measure an apparent plane instead of a true one.Although no "lag" time is associated with field mapping (except perhaps prolonged snack breaks), good, easily determinable strike and dip outcrops are not always abundant.This forces introductory students to learn and practice strike and dip on outcrops that are overly complicated for new learners.For example, Appalachian State students must travel one to two hours each way to the Valley and Ridge Province where they learn how to map in "sedimentary" units that are, in reality, slightly metamorphosed meta-sedimentary rocks, and sometimes have slight foliation or crystallization.Furthermore, the region is heavily deformed with outcropregional sized folds and faults.Finding appropriate outcrops for introductory students is difficult and those that are found are on steep terrain and therefore not wholly accessible.
Interestingly enough, most of the comments made about the reactiveness of the controls are variations of comments heard as an instructor from students in the field.For example, "Initially I didn't know how to use it, so it was frustrating…."and "…At first it was a bit overwhelming, but with some instruction it became much easier and quicker to use…".This is a common comment from students at the end of the semester in a field methods course.Another comment, "It was very frustrating to try and rotate the compass to the right spot…" or "…I struggled with getting everything in place each time…", is a staple in regards to placing the compass when students first get into the field.The comment regarding only seeing one strike and dip measurement at a time ("…was not effective in learning because I was only able to see one strike dip at a time and could not figure out how they related to each other spatially…") is also not an uncommon struggle in the field.Most places do not have kilometer long outcrops in which to visualize the structures of the whole area.One must actively map each individual strike and dip measurement one at a time, only interpreting the structures once there are enough points across the map to put together the geologic story.Similarly, SaD users may also view their "map" with the World Map feature (Fig. 2c) and visualize the region in its entirety.Lastly, the comment "I felt confused and overwhelmed on the program almost the entire time I was using it" is so common in the field that many instructors address this as a known occurrence and the statement is frequently countered with some version of 'You may be lost the majority of the time, the key is to recognize when you are "found" and to fill in the gaps.'Despite the participants in this study having never actually mapped geology before, let alone in a real-world environment, there were several confident comments that in person experience would be more effective for learning and alleviating confusion than the SaD tool.These comments are difficult to address with their "the grass is always greener" perspective.This type of perspective was seen in a study by Stumpf et al., (2008), who found that students exposed to an in-person only field trip claimed preference for the virtual version while students in the virtual field trip group decreed the opposite.The thought that nothing can compare with a real-world field trip is predominant among some geologists but it is one that is exclusive and unimaginative.With the development of realistic virtual desktop environments, iVR experiences, and platforms like Sketchfab and Open Topography, along with public access to texture and material designers like Substance by Adobe, it is becoming more possible and pertinent to develop virtual environments that mimic real world structures, and therefore their value for replicating place-or discovery-based learning (e.g.O'Connor and Domingo, 2017;Atit et al., 2020;Nesbit et al., 2020;Parong and Mayer, 2020;Riquelme et al., 2020;Wu et al., 2020).With iVR, users can even navigate through and interact with virtual environments in a very realistic way, which we suggest is also valuable in discoverybased learning (e.g.Liu et al., 2020;Parong and Mayer, 2020;Wu et al., 2020;Métois et al., 2021).
The results of this study point to a mix of positive evaluation and room for improvements of the SaD tool.Considering that SaD is still evolving, it is expected to receive comments related to usability issues from the participants.Such comments can help us better identify the shortcomings of this tool and plan for future improvements.It is important to emphasize that our results also indicate that a high number of participants perceived the tool as useful for their learning and the overall impression of the tool is positive.

Procedural limitations
For this study, SaD was used in a single lab session following an earlier workbook-style mapping exercise.Although all students were assigned the earlier mapping exercise, only those who completed it had it returned and available for their reference during the subsequent SaD lab activity.Furthermore, it is unclear how many students, if any, referred back to this exercise for reminders or guidance during the SaD exercise.Because the SaD lab was administered during a single lab session, Map 1 (the practice map) was made optional to alleviate the pressure of potential time constraints.Consequently, not all students completed the practice map prior to the main assignment (Map 2).The small-scale pilot study (Bursztyn et al., 2021) built in two work and submission sessions to the exercise with instructor feedback following the first "practice" mapping activity.We were unable to follow this procedure due to curriculum scheduling complications for the present study and this limitation resulted in students either opting to not complete the first mapping exercise, or completing both with the pressure of time-constraints.
Within the dVR experience itself, participants were limited to using the SaD tool restricted to the beginner bumper cubes setting.With bumper cubes incorrect measurements are flagged, but students do not know why they are wrong or how to correct themselves.It will be important to develop the SaD tool to include adaptive interventions such as individualized embedded hints and mapping guidance that would facilitate the learning experience of beginner mappers using the bumper settings.In VR environments it is feasible to implement adaptive learning strategies, such as adaptive interventions, hints, and feedback (Peirce and Wade, 2010;Zaharias et al., 2012), in addition to more dynamic strategies in the form of difficulty and learning content adjustments within the learning experience (Hocine et al., 2015;Streicher and Smeddinck, 2016).Such strategies can support personalized experiences for learners exhibiting different levels of abilities and competencies in relation to the learning experience.In the case of our bumper settings, adaptive interventions might provide feedback on the nature of the error the user has made.It will also be important to study the effect of including such adaptive interventions into the learning environment, both on student learning and on user experience.
Finally, from an individual differences perspective, the exclusion of ethnicity from the participant questionnaire survey was an oversight not realized until too late in the procedure to be corrected.
Critically examining individual differences in the context of the learning experience will continue to be of utmost importance moving forward.Furthermore, in this study, we did not collect the scores from the student work.Individual differences are not only important to consider for the useability of the instrument, but also critical to examine the effect the tool has on student learning.

Technical limitations
Several students experienced technical difficulties including their computers crashing, the SaD tool lagging, and difficulty maneuvering within the virtual environment.Between the pilot study (Bursztyn et al., 2021) and this study we tried to address the lagging concern, knowing that many students would not have access to gaming computers with high-powered video cards.Visual lag can be reduced by minimizing the complexity of the 3D rock models through reducing the number of polygons for each 3D model.However, the tradeoff in this regard is that the 3D models with reduced polygons will at some point become no longer recognizable as particular rock types.We have since been exploring other avenues such as applying detailed texture maps over simplified geometries.Through the use of programs such as Adobe Substance highly detailed textures can be created that give the appearance of complex 3D geometry, many of which are digital twins for diagnostic rock textures.These textures can then be applied to 3D models with simple geometry (such as cubes) while retaining the visual appearance and detail of highly complex 3D models but without creating lag.

OUTLOOK: ADVANCING INCLUSIVITY, ACCESSIBILITY, AND REALISM
Beyond the students' technical difficulties, we also recognize that the interaction fidelity of maneuvering in a two-dimensional dVR environment representing a complex 3D natural environment is limited.Navigation within such an environment is complex and requires training (key combinations, mouse and/or trackpad to maneuver and position the compass vs walking up to a surface and using one's hands).On the other hand, it has been shown that virtual environments, especially developed for web-based distribution and mobile devices, can remove barriers to accessibility and create a culture of inclusion in geoscience classrooms (O'Sullivan and Kearney, 2018;Chenrai and Jitmahantakul, 2019).In recent years, field experiences have been critically looked at from different perspectives.To name some of the prominent challenges: field trips pose troubling accessibility issues excluding students with disabilities but also students who cannot afford to participate due to time or financial constraints.Field work is further challenged by an increasing awareness of harassment that is happening in the field, which is often targeting women and minority students and faculty who do not conform to the stereotypical mainstream conceptions of fieldwork, that is, it is a white, male-dominated domain.Marín-Spiotta et al. (2020) call out this issue, comparing it to the Vegas Rule, criticizing the understanding that "what happens in the field, stays in the field".For the diversity of students who self-select out of geoscience programs to avoid the physical and/or emotional burden of required field mapping experiences, the promise of virtual mapping with digital twin environments such as provided by SaD may provide a solution that facilitates their access, safety, and also retention.
In light of the new openness to virtual experiences, it is essential to critically look at the opportunities (i.e., breaking down long standing barriers of accessibility and inclusion) and challenges that remote learning offers to Earth educators.To establish remote learning opportunities as alternative pathways in geoscience education, we need tools as well as empirical studies that critically examine the opportunities, the challenges, and the feasibility of virtual learning experiences.Many studies remain anecdotal (e.g., Marshall et al., under review) but it is time to establish research frameworks and to connect place-based education with established assessments and practices in virtual and immersive learning (Klippel et al. 2020;Petersen et al. 2020).Immersive virtual reality (iVR) is inherently a three-dimensional (3D), spatial medium (Maceachren and Brewer, 2004) and therefore offers a natural interface to all representations of data that, too, are three-dimensional in nature.
present study as it has been shown that low spatial ability learners can benefit more from a desktop VR application in comparison with high spatial ability learners (Lee et al., 2009).Future empirical evaluations of SaD comparing immersive versus non-immersive instances will include a stronger focus on spatial abilities.
The SaD tool continues to be developed and evolve with each iteration into becoming a more realistic digital twin for teaching field geology technique.The next steps for this tool are mapped out, focused on creating 3D models that mirror real world lithologic features (including, but not limited to, individual sand grains, identifiable fossils, foliation and crystalline textures).As a community, we are ever closer to creating complete, realistic virtual environments for an inclusive and accessible geology field class with world class "outcrops" that mimic those one sees in the classic geology field camps and trips hosted in the Western United States.

Figure 1 .
Figure 1.A schematic of how one measures strike and dip on an outcrop.(a) One measures strike and dip on the planar surface of a rock.The strike represents the line at which the planar rock surface intersects with any horizontal plane.The dip angle is the angle between that dipping surface and the horizontal plane.(b) An example of one using RHR in the real world and (c) in the SaD virtual field environment.

Figure 2 .
Figure 2. The SaD HUD (Heads Up Display).The HUD is composed of all the tools visible on screen throughout the program.Each tool can be toggled on/off depending on user preference.(a) The main HUD displays the Small Data Window, where the user's most recent strike and dip measurements are displayed.The Tour Guide allows the user to view which stop they are presently located.The Mini World Map (red outline) shows the user their location in a miniature view.The user may view the compass with more ease using the UI Compass Face as they are measuring the orientation of the rock with the Compass Tool.The Compass Control Panel is used to position the

Figure 3 .
Figure 3.The two main settings: cubes and rocks.Each may be used with the bumper prefix to allow the user an error-flagging buffer when measuring platform/outcrop orientation (+/-10° strike, 5° dip).(a) shows the level cubes which replicates the classroom beginner technique of using a platform to practice taking strike and dip measurements.(b) An example of the rocks level, which features 3D outcrops.

Figure 4 .
Figure 4.The before and after examples of Map 1.(a) Students are initially given a blank map with space to fill in the explanation and cross-section.Students are expected to fill in field notes and interpretation of geologic events on a separate piece of paper.(b) A completed map and accompanying cross-section.

Table 2
low familiarity group (M=3.64,SD=0.83andM=3.16, SD=0.89, respectively; t(91) = 2.651, 395 p = 0.01).For a complete reporting of these results refer to Table3.396 Our results indicate statistically significant differences (by a combination of independent samples t-tests and the Mann-Whitney test in the case of non-normal distribution) for almost all the metrics in the general and learning experiences of students grouped by low and high software familiarity.For representational fidelity, scores of the high software familiarity group were higher than those in the low software familiarity group (M=3.46,SD=0.95 and M=2.59, SD=0.92, respectively; t(92) = 4.461, p < 0.001).For immediacy of control, scores in the high familiarity group were higher higher than in the low familiarity group (M=3.4,SD=0.92 and M=2.9, SD=0.97, respectively; t(92) = 2.570, p = 0.012).Lastly, scores for self-efficacy were greater in the high familiarity group than in the

Table 3 .
Results of independent samples t-test comparing students grouped by software familiarity similar trend in the results was observed for students grouped by gaming familiarity.Our results indicate statistically significant differences (by a combination of independent samples t-tests and the Mann-Whitney test in the case of non-normal distribution) for almost all the metrics in the general and learning experiences of students grouped by low and high gaming familiarity.For *P < 0.05; **P < 0.001; italics denote metrics with non-normal distribution for which Mann-Whitney test was also used A 8, respectively; t(111) = 2.483, p < 0.05).For perceived ease of use, scores of students belonging to the high gaming familiarity group were higher than the low gaming familiarity group (M=3.42,SD=0.74 and M=2.95, SD=0.8, respectively; t(110) = 3.459, p < 0.01).

Table 4 .
Results of independent samples t-test comparing students grouped by gaming familiarity

1 438 Table 5 .
Results of ANOVA examining interaction effects of geographical software familiarity and gender and gaming familiarity and gender

Table 6 .
Qualitative analysis results for open-ended learning experience question The post-hoc test with Bonferroni correction was in agreement that negative impressions are significantly more common for participants in the low geographical software familiarity category (p < 0.01).No other significant differences between the geographical software familiarity categories or game familiarity categories were observed.1 H-SF=high software familiarity, L-SF=low software familiarity, H-GF=high gaming familiarity, L-GF=low gaming familiarity 1

Table 7 .
Qualitative analysis results for open-ended experience change across activities question