Document [original]

Citation: Hinricher, N.; König, S.;

Schröer, C.; Backhaus, C. Influence of

Virtual Reality on User Evaluation of

Prototypes in the Development

Process—A Comparative Study with

Control Rooms for Onshore Drilling

Rigs. Appl. Sci. 2023,13, 8319.

https://doi.org/10.3390/

app13148319

Academic Editor: João Marcelo

Teixeira

Received: 15 June 2023

Revised: 12 July 2023

Accepted: 13 July 2023

Published: 18 July 2023

Licensee MDPI, Basel, Switzerland.

This article is an open access article

distributed under the terms and

conditions of the Creative Commons

Attribution (CC BY) license (https://

creativecommons.org/licenses/by/

4.0/).

applied

sciences

Article

Influence of Virtual Reality on User Evaluation of Prototypes in

the Development Process—A Comparative Study with Control

Rooms for Onshore Drilling Rigs

Niels Hinricher 1,* , Simon König 1, Chris Schröer 1and Claus Backhaus 2

Center for Ergonomics and Medical Engineering, FH Münster University of Applied Sciences, Bürgerkamp 3,

48565 Steinfurt, Germany; simon.koenig@fh-muenster.de (S.K.); chris.schroeer@fh-muenster.de (C.S.)

Institute of Psychology and Ergonomics, Technical University Berlin, Fasanenstraße 1, 10623 Berlin, Germany;

claus.backhaus@fh-muenster.de

*Correspondence: niels.hinricher@fh-muenster.de

Abstract:

User evaluations of prototypes in virtual reality (VR) offer high potential for products

that require resource-intensive prototype construction, such as drilling rigs. This study examined

whether the user evaluation of a VR prototype for controlling an onshore drilling rigproduces results

comparable to an evaluation in the real world. Using a between-subject design, 16 drilling experts

tested a prototype in VR and reality. The experts performed three different work processes and evalu-

ated their satisfaction based on task performance, user experience, and usability via standardized

questionnaires. A test leader evaluated the effectiveness of the work process execution using a 3-level

rating scheme. The number of user interactions and time on task were recorded. There were no

significant differences in the effectiveness, number of interactions required, perceived usability, and

satisfaction with respect to task performance. In VR, the drilling experts took significantly more time

to complete tasks and rated the efficiency of the VR prototype significantly higher. Overall, the real-

world evaluation provided more insights into prototype optimization. Nevertheless, several usability

issues have been identified in VR. Therefore, user evaluations in VR are particularly suitable in the

early development phases to identify usability issues, without the need to produce real prototypes.

Keywords:

virtual prototype; usability test; virtual reality (VR); oil and gas industry; human–technology

interaction; control room design

1. Introduction

The development of new products with complex human–machine interfaces ideally

follows a user-centered design process [

]. In this process, prototypes are iteratively

developed based on user requirements, as evaluated by users in usability tests. However,

prototyping is time-consuming and expensive [

]. To reduce these costs, companies use

virtual prototypes at the beginning of their development process [3].

Virtual reality (VR) technology offers the possibility of visualizing and experiencing

virtual prototypes in detail. Users can test and evaluate the prototypes early in the de-

velopment process by simulating the anticipated real-world environment in which the

prototypes will eventually be utilized [

]. A literature review by Freitas et al. [

] shows that

the application areas of user evaluation in VR are the automotive (37%), engineering (26%),

and academic or unspecified (37%) sectors. In all three fields, VR is primarily applied to

review design. In VR design reviews, the prototype is presented to the development team,

experts, and users in three dimensions, thereby revealing the optimization potential and

improving the efficiency of the development process [6–13].

User testing is one of the most reliable methods for evaluating the usability of a proto-

type. Future users operate the new product and perform typical work processes [

]. Initial

studies showed that the results of user testing in VR can be transferred to reality. However,

Appl. Sci. 2023,13, 8319. https://doi.org/10.3390/app13148319 https://www.mdpi.com/journal/applsci

Appl. Sci. 2023,13, 8319 2 of 20

in these studies, the prototypes and products were not tested in a pure VR environment,

but in mixed reality environments [

–

]. In these tests, virtual environments were mixed

with real controls, such as a steering wheel, because the lack of haptic feedback can limit

the prototype evaluation [19,20].

However, purely virtual prototypes are advantageous, particularly during the early

development phases. Different concepts and combinations of human–machine interfaces

can be tested, independent of control elements or other real content. In addition, user

tests can be performed regardless of location, allowing users anywhere in the world to be

included in testing.

Virtual user tests are particularly useful for complex machines and devices with a high

demand for operational safety, such as control rooms, drilling rigs, and medical devices.

To minimize possible usability issues and the resulting hazards to the environment and

humans, numerous prototypes need to be tested, which results in high development costs

and long development times. However, studies examining complex devices or machines in

VR are scarce. Aromaa et al. [

] conducted a user test in VR using a tunnel-boring machine.

In this study, the influence of two different transparency levels of a machine boom on the

work performance was investigated. The findings of the study enabled the identification of

the preferred transparency level among the operators. Bergroth et al. [

] investigated the

suitability of VR for evaluating the control rooms in nuclear power plants. The test subjects

rated VR as a suitable means for evaluating the control rooms. However, neither study

validated their results with an evaluation in the real world.

User tests in VR offer high potential for the development of offshore or onshore drilling

rigs, which feature human–machine interfaces comprising various controls and displays.

Onshore drilling rigs are used for deep drilling operations in the exploration for oil, gas,

and geothermal resources on land. A driller is responsible for the drilling process. This

person controls the drilling process from a driller cabin, monitors several displays, and

operates the technical equipment of the drilling rig. Catastrophes such as the Deepwater

Horizon explosion in the Gulf of Mexico show the importance of the user-oriented design

of human–machine interfaces in this industry [23].

Using VR, the number, suitability, and layout of displays and controls can be examined

first in purely virtual tests before real controls are mixed with the virtual content, or before

real prototypes are produced. Prototypes in VR can be changed more easily; therefore, more

prototypes can be tested compared to the traditional user-centered development process.

Overall, there is a lack of studies investigating a product during the development

process to determine whether a purely virtual user test yields comparable findings to a

user test with a physical prototype. The literature review by Gutemberg Junior et al. [

]

on the application of VR in product development in the oil and gas industry shows that no

studies exploring user testing in VR are available for this industry.

Therefore, this study is accompanied by a multiyear project to develop a new prototype

for controlling onshore drilling rigs. Figure 1shows the prototype model. The front screen

displays the process-relevant data that must be monitored by the driller. Consoles, such

as joysticks and rotary controls, are controls for operating the various machines on a rig.

Additional controls and functions are available in a newly designed user interface that is

operated with touchscreens. Typical operations performed with the prototype can be found

in the Supplementary Materials (Figures S1–S3).

In this study, user tests were conducted in VR and reality using the prototype shown

in Figure 1. The goal was to investigate whether there were significant differences between

the tests in terms of the number and types of user errors, user experience, user acceptance,

and the time required to perform the work.

Appl. Sci. 2023,13, 8319 3 of 20

Appl. Sci. 2023, 13, x FOR PEER REVIEW 3 of 20

Figure 1. Model of the prototype developed for operating a drilling rig. Displays for process-rele-

vant data are shown on the front screen. The side consoles contain touchscreens and operating ele-

ments for controlling machines and equipment of the drilling rig.

In this study, user tests were conducted in VR and reality using the prototype shown

in Figure 1. The goal was to investigate whether there were significant diﬀerences between

the tests in terms of the number and types of user errors, user experience, user acceptance,

and the time required to perform the work.

2. Materials and Methods

2.1. Participants

To investigate whether user evaluation in VR is comparable to that in real-world set-

tings, the prototype was tested by drilling experts in a between-subjects design in reality

and VR. Thus, the drilling experts tested either the real prototype or the virtual one.

The construct validity of VR simulations is typically assessed by comparing the work

processes of experts and novices [25,26]. Therefore, in this study, the VR prototype was

also tested by novices. The novices were students from a university environment. Table 1

presents the data of the participants. All novices were enrolled in a bachelor’s or master’s

degree program with a technical focus at the time of the study, and they reported using

computers daily. None of the participants had any prior experience with VR systems at

the time of the study. Work experience refers to the operation of an onshore drilling rig.

Table 1. Subject data: gender, age, and work experience with operating drilling rigs.

Trial Group Gender m/f [n] Age ± SD [a] Work Experience [a]

VR Drilling Experts 8/0 42 ± 5 12 ± 10

VR Drilling Novice 10/0 26 ± 3 0

Real Drilling Experts 8/0 40 ± 7 9 ± 6

2.2. Experimental Setup

2.2.1. Real Prototype

Figure 2 shows the experimental setup for the user test in real-world settings. The

prototype shown in Figure 1 was manufactured in physical form. User interface mockups

in the form of click dummies were displayed on the touchscreens on the side consoles of

the prototype. The click dummies were created using Adobe XD software (version 36.0,

Adobe XD, Adobe Inc., San Jose, CA, USA), and contained 321 interfaces.

On the front screen, the participants were presented with indicators of process-rele-

vant data. The indicators, predominantly gauges and bar graphs, were also created using

Adobe XD and could be modified in the background depending on the participant’s in-

teractions with Wizard of Oz [27]. The Wizard of Oz technique is an experimental tech-

nique used for simulating systems that are impossible or expensive to implement [28,29].

Figure 1.

Model of the prototype developed for operating a drilling rig. Displays for process-relevant

data are shown on the front screen. The side consoles contain touchscreens and operating elements

for controlling machines and equipment of the drilling rig.

2. Materials and Methods

2.1. Participants

To investigate whether user evaluation in VR is comparable to that in real-world

settings, the prototype was tested by drilling experts in a between-subjects design in reality

and VR. Thus, the drilling experts tested either the real prototype or the virtual one.

The construct validity of VR simulations is typically assessed by comparing the work

processes of experts and novices [

]. Therefore, in this study, the VR prototype was

also tested by novices. The novices were students from a university environment. Table 1

presents the data of the participants. All novices were enrolled in a bachelor’s or master’s

degree program with a technical focus at the time of the study, and they reported using

computers daily. None of the participants had any prior experience with VR systems at the

time of the study. Work experience refers to the operation of an onshore drilling rig.

Table 1. Subject data: gender, age, and work experience with operating drilling rigs.

Trial Group Gender m/f [n] Age ±SD [a] Work Experience [a]

VR Drilling Experts 8/0 42 ±5 12 ±10

VR Drilling Novice 10/0 26 ±3 0

Real Drilling Experts 8/0 40 ±7 9 ±6

2.2. Experimental Setup

2.2.1. Real Prototype

Figure 2shows the experimental setup for the user test in real-world settings. The

prototype shown in Figure 1was manufactured in physical form. User interface mockups

in the form of click dummies were displayed on the touchscreens on the side consoles of

the prototype. The click dummies were created using Adobe XD software (version 36.0,

Adobe XD, Adobe Inc., San Jose, CA, USA), and contained 321 interfaces.

On the front screen, the participants were presented with indicators of process-relevant

data. The indicators, predominantly gauges and bar graphs, were also created using Adobe

XD and could be modified in the background depending on the participant’s interactions

with Wizard of Oz [

]. The Wizard of Oz technique is an experimental technique used for

simulating systems that are impossible or expensive to implement [28,29].

The participants were filmed using two cameras (GoPro Hero 5; GoPro Inc., San

Mateo, CA, USA) during the tests. One camera was located above the front screen, and

the participants were filmed from the front. The second camera was positioned behind

the participants, and it filmed their interactions with the controls and touchscreens. The

Appl. Sci. 2023,13, 8319 4 of 20

cameras were connected via Bluetooth to a tablet (Samsung Note 10; Samsung Electronics Co.,

Suwon-si, Republic of Korea). The experiments were conducted in an empty office room.

Appl. Sci. 2023, 13, x FOR PEER REVIEW 4 of 20

Figure 2. Experimental setup for user test in real-world setting: Participants performed tasks with

the prototype. The “Wizard of Oz” changed the indicators on the front screen depending on the

participant’s interactions. The test leader evaluated the execution of the tasks, using a 3-level scale

(Table 2).

The participants were filmed using two cameras (GoPro Hero 5; GoPro Inc., San

Mateo, CA, USA) during the tests. One camera was located above the front screen, and the

participants were filmed from the front. The second camera was positioned behind the

participants, and it filmed their interactions with the controls and touchscreens. The cam-

eras were connected via Bluetooth to a tablet (Samsung Note 10; Samsung Electronics Co.,

Suwon-si, Republic of Korea). The experiments were conducted in an empty oﬃce room.

2.2.2. VR Prototype

The virtual prototype was created in the Unity 2020.2.3f1 development environment

(Unity Technologies, San Francisco, CA, USA) and programmed using the C# language

(Microsoft Corporation, Redmond, WA, USA). Visualization was performed using a Valve

Index head-mounted display (HMD) (Valve Corporation, Bellevue, WA, USA), PC with

an i7 processor, and a GeForce GTX 1070 graphics card (NVIDIA Inc., Santa Clara, CA,

USA).

Figure 3 shows the experimental setup for the user test in VR. The participants sat in

an industrial seat that is also used in onshore drilling rigs. An HMD was used to display

the new prototype to the participants. The participants were able to freely interact with

the prototype using controllers (Valve Index). The controllers used integrated sensors to

detect the hand and finger positions, making it possible for the controls to be understood

as in reality. In addition, natural interactions with the touchscreen were possible when the

participants spread their index fingers away from the controller. This was detected by

sensors, and the splayed finger was visualized using VR. Grasping in VR was performed

Figure 2.

Experimental setup for user test in real-world setting: Participants performed tasks with

the prototype. The “Wizard of Oz” changed the indicators on the front screen depending on the

participant’s interactions. The test leader evaluated the execution of the tasks, using a 3-level

scale (Table 2).

2.2.2. VR Prototype

The virtual prototype was created in the Unity 2020.2.3f1 development environment

(Unity Technologies, San Francisco, CA, USA) and programmed using the C# language

(Microsoft Corporation, Redmond, WA, USA). Visualization was performed using a Valve

Index head-mounted display (HMD) (Valve Corporation, Bellevue, WA, USA), PC with an

i7 processor, and a GeForce GTX 1070 graphics card (NVIDIA Inc., Santa Clara, CA, USA).

Figure 3shows the experimental setup for the user test in VR. The participants sat in

an industrial seat that is also used in onshore drilling rigs. An HMD was used to display

the new prototype to the participants. The participants were able to freely interact with

the prototype using controllers (Valve Index). The controllers used integrated sensors to

detect the hand and finger positions, making it possible for the controls to be understood

as in reality. In addition, natural interactions with the touchscreen were possible when

the participants spread their index fingers away from the controller. This was detected by

sensors, and the splayed finger was visualized using VR. Grasping in VR was performed

by spreading all fingers away from the controller, guiding the controller to the joystick, and

then grasping the controller again.

Appl. Sci. 2023,13, 8319 5 of 20

Appl. Sci. 2023, 13, x FOR PEER REVIEW 5 of 20

by spreading all fingers away from the controller, guiding the controller to the joystick,

and then grasping the controller again.

Figure 3. Experimental setup for the user test in VR: participants performed tasks using the VR

prototype. The test leader evaluated the execution of the tasks, using a 3-level scale (Table 2).

The designs on the touchscreen and indicators on the front screen corresponded to

those of the real test. A camera (GoPro Hero 5; GoPro Inc., USA) documented the state-

ments and interactions of the participants during the tests.

2.3. Experimental Procedure

2.3.1. Simulated Work Processes

The central work processes for geological drilling using the rotary drilling method

are “drilling,” “connection making,” and “tripping” [30]. During drilling, a rotating drill

bit mechanically crushes the rock to be drilled through. Subsequently, the crushed rock is

conveyed to the surface using a drilling fluid pumped through a drill string. Torque is

applied by a top drive, which is connected to the drill string, and the drive may be moved

vertically in a mast. The drill string suspended in the mast is typically composed of several

drill pipes with an approximate length of 9 m. After every 9 m (or more) of drilling, a new

drill pipe is screwed onto the drill string. This process is called “connection making”. To

continue the drilling process, the top drive must be reconnected to the drill string. This

process is called “top drive connection”. When the drill bit needs to be changed (for wear

reasons), the drill string is pulled out of the ground step-by-step to sequentially unscrew

the individual drill pipes. This process is called “tripping” [31]. During the drilling pro-

cess, the driller must observe the indicators and adjust the target parameters as necessary.

Figure 3.

Experimental setup for the user test in VR: participants performed tasks using the VR

prototype. The test leader evaluated the execution of the tasks, using a 3-level scale (Table 2).

The designs on the touchscreen and indicators on the front screen corresponded

to those of the real test. A camera (GoPro Hero 5; GoPro Inc., San Mateo, CA, USA)

documented the statements and interactions of the participants during the tests.

2.3. Experimental Procedure

2.3.1. Simulated Work Processes

The central work processes for geological drilling using the rotary drilling method

are “drilling”, “connection making”, and “tripping” [

]. During drilling, a rotating drill

bit mechanically crushes the rock to be drilled through. Subsequently, the crushed rock

is conveyed to the surface using a drilling fluid pumped through a drill string. Torque is

applied by a top drive, which is connected to the drill string, and the drive may be moved

vertically in a mast. The drill string suspended in the mast is typically composed of several

drill pipes with an approximate length of 9 m. After every 9 m (or more) of drilling, a new

drill pipe is screwed onto the drill string. This process is called “connection making”. To

continue the drilling process, the top drive must be reconnected to the drill string. This

process is called “top drive connection”. When the drill bit needs to be changed (for wear

reasons), the drill string is pulled out of the ground step-by-step to sequentially unscrew

the individual drill pipes. This process is called “tripping” [

]. During the drilling process,

Appl. Sci. 2023,13, 8319 6 of 20

the driller must observe the indicators and adjust the target parameters as necessary. In the

“top drive connection” and “tripping” processes, the driller must simultaneously observe

indicators and operate controls located on the side consoles. Since this study focuses on

interactions between the driller and prototype, the processes “top drive connection” and

“tripping” were simulated.

2.3.2. Real Prototype

At the beginning of the tests, the participants were provided a standardized introduc-

tion to the test procedure and prototype. The controls on the side consoles and indicators

on the front screen were explained to the participants. Subsequently, the participants

performed 20 different introductory tasks, such as “logging in”, “opening inside blowout

preventer” (IBOP), and “displaying camera image of the mast”. These tasks were process-

independent, and were used to test menu structures. All tasks and instructions for the

participants can be found in the Supplementary Materials.

The tripping process was simulated following the introductory tasks. The participants

were asked to pull the drill pipe, set it down, and move the top drive back onto the drill

string. The use cases consisted of 17 tasks (Figure 4). After the tripping process, the “top

drive connection” process was simulated. The use cases consisted of eight tasks (Figure 5).

Finally, participants were asked to repeat the tripping process. The learnability of the

prototype was investigated by comparing two tripping simulations.

After each process execution, the participants completed an after-scenario question-

naire (ASQ) (see Section 2.4.2). At the end of the experiment, the participants filled out the

user experience questionnaire (UEQ) (Section 2.4.3) and system usability scale (Section 2.4.4).

Finally, semi-structured interviews were conducted with the experts (Section 2.4.6).

2.3.3. VR Prototype

The drilling experts in the VR environment were instructed according to the same

standardized procedure as the real-world drilling prototype. Subsequently, the test partici-

pants were shown how to use the controllers in VR to grip the control elements and operate

a touchscreen. The drilling experts operated each control element once. The novices were

shown a video of the basics of the rotary-drilling procedure and work processes to be

carried out in the study, prior to receiving standardized instruction on the prototype and

VR controllers. The subsequent procedure was identical to the user test in the real world.

The test participants completed the questionnaires after removing the VR HMD.

2.4. Measures

2.4.1. Task Success Rate

A drilling expert and usability expert evaluated the performance of the tasks according

to a 3-level rating scheme (Table 2). After the participants completed the test, their ratings

were compared. In case of differences, the video material was reviewed and a rating was

agreed upon.

Table 2. Criteria for assessing task success rate.

Evaluation Description

Good Fast operation without assistance

Error-free execution

Medium Prolonged hesitation before operation

Errors are corrected without indications by the test leader

Poor Execution of the task after assistance of the test leader

For analysis, the ratings were presented as stacked bar graphs (Figures 4and 5). Each

task was rated individually. The bars indicate the relative frequencies of the evaluation

Appl. Sci. 2023,13, 8319 7 of 20

levels (green = good, yellow = medium, red = poor). Subsequently, success rates were

calculated using the following formula (Nielsen [32]):

success rate =∑good +∑medium·0.5

participants·tasks ×100

The calculated success rates were averaged and individually evaluated for each task

and usage scenario.

2.4.2. Satisfaction with the Task Performance

After each usage scenario, the participants completed the ASQ. Participants rated the

following questions with a seven-point Likert scale ranging from strongly agree to strongly

disagree. The scale assigns values ranging from one (strongly agree) to seven (strongly

disagree). For the evaluation, the arithmetic mean of all of the ratings per prototype

was determined.

•“Overall, I am satisfied with the ease of completing the tasks in this scenario”.

•

“Overall, I am satisfied with the amount of time it took to complete the tasks in

this scenario”.

2.4.3. User Experience

User experience was measured using the UEQ [

]. The UEQ consists of 26 bipolar

items divided into the following six dimensions:

•Attractiveness: Describes the overall impression of the product.

•

Perspicuity: Describes a user’s feeling that the interaction with a product is easy,

predictable, and controllable.

•Efficiency: Describes how quickly and efficiently the user can use the product.

•Dependability: Describes the feeling of being in control of the system.

•Stimulation: Describes the user’s interest and enthusiasm for the product.

•Novelty: Describes whether product design is perceived as innovative or creative.

Participants rated the items using a seven-point Likert scale [

]. Each box on the

Likert scale was assigned a value between

−

3 and +3. +3 corresponds to an adjective

with positive connotation. The scores were averaged per dimension and reported as the

UEQ score.

2.4.4. User Acceptance

User acceptance was measured using the system usability scale (SUS) [

]. The SUS is

an effective and simple method for evaluating the user acceptance of a system, and consists

of 10 alternating positive and negative statements. Each statement was given a point

between one and five. Depending on the phrasing of the item (positive/negative), a 5-point

score corresponds to either the statement “strongly agree” or “strongly disagree”. The

results were expressed as a score between 0 (negative) and 100 (positive). This 100-point

scale facilitated the comparison of different products [36].

2.4.5. Number of User Interactions and Time for Use Scenario

To check whether the participants interacted with the prototype comparably in VR

and reality, the number of user interactions and completion times were recorded for each

use scenario. Grasping the joystick, turning the control knob, and pressing a button on the

touchscreen were considered as interactions. The processing time was defined as the time

between reading the use scenario aloud and completing it.

2.4.6. Semi-Structured Interview on Satisfaction with the Prototype

Following the usability tests, semi-structured interviews were conducted with experts.

The interview questions are presented in Table 3. The participants who tested the virtual

prototype remained in the VR environment during the interviews. The interviews were

Appl. Sci. 2023,13, 8319 8 of 20

evaluated using content analysis, according to Mayring [

]. Hence, the interviews were

transcribed, and categories were inductively formed. These categories were then quantified.

Potential improvements to the prototype were identified via two workshops with usability

experts (n = 2) and engineers developing drilling rigs (n = 3). Separate workshops were

conducted for the VR and real prototypes.

Table 3. Interview questions.

Category Question

General operation

How do you evaluate the operation?

Which functions are you missing?

Are there functions that would not be directly accessible during

safety-critical moments?

Would you prefer this concept to the previous operating concept?

Touchscreens

How do you like the touchscreens?

Could you read everything on the touchscreens?

Do the touch fields have an appropriate size?

Control elements

How do you like the control elements on the side consoles?

Are there any physical control elements that you would prefer to have

implemented as touch functions?

Are there touch functions that you would prefer to have implemented as

physical control elements?

Iron roughneck

How do you rate the control of the iron roughneck via the touchscreens?

Indicators How do you rate the indicator screen?

Could you read indicators correctly?

2.5. Statistical Analysis

Statistical analyses were performed using SPSS Statistics software (version 27, IBM,

Armonk, NY, USA). A t-test for independent samples was used to examine whether the

experts achieved significantly (

= 0.05) different results in the user test when using the VR

prototype compared to the real prototype. Evaluation parameter success rates, ASQ scores,

UEQ scores, SUS scores, number of interactions, and task completion time presented in

Section 2.4 were compared.

To examine construct validity, the t-test for independent samples was used to deter-

mine whether the evaluation parameters differed significantly between the VR drilling

expert and novice groups (

= 0.05). However, the differences between the novices in VR

and experts in the real world were not examined.

All of the participant groups performed the tripping process twice. The t-test for

independent samples was used to examine whether the parameter success rates, ASQ

scores, numbers of interactions, and task completion times significantly improved when

performed for the second time (α= 0.05).

3. Results

3.1. Task Success Rate

With the real prototype, the drilling experts achieved a mean success rate of 88

12%

when performing the introductory tasks. With the VR prototype, the experts achieved

a success rate of 87

14%, while the novices achieved a success rate of 87

17%. No

significant differences were observed (p> 0.05). The success rates for each task are provided

in the Supplementary Materials.

Figure 4presents the success rates of the tripping process. The drilling experts achieved

a mean success rate of 91

13% with the real prototype. For the VR prototype, the experts

scored a mean success rate of 91

12%. The novices achieved a mean success rate of

76 ±18%

with the VR prototype. The difference between drilling experts and novices in

the mean success rate for the VR prototype was significant (p= 0.006). The definitions of

the individual tasks can be found in the Glossary.

Appl. Sci. 2023,13, 8319 9 of 20

Moreover, Figure 4shows that experts had issues with the same tasks in both the VR

and real prototype environments. For example, experts were sometimes unable to extend

the iron roughneck (IR) and adjust its height in both VR and reality (Tasks 7 and 8). An iron

roughneck is a machine used for screwing and unscrewing a drill string. Several novices

had issues in Tasks 9–15. The experts had no difficulty in performing these tasks.

Appl. Sci. 2023, 13, x FOR PEER REVIEW 9 of 20

significant diﬀerences were observed (p > 0.05). The success rates for each task are pro-

vided in the Supplementary Materials.

Figure 4 presents the success rates of the tripping process. The drilling experts

achieved a mean success rate of 91 ± 13% with the real prototype. For the VR prototype,

the experts scored a mean success rate of 91 ± 12%. The novices achieved a mean success

rate of 76 ± 18% with the VR prototype. The diﬀerence between drilling experts and nov-

ices in the mean success rate for the VR prototype was significant (p = 0.006). The defini-

tions of the individual tasks can be found in the Glossary.

Moreover, Figure 4 shows that experts had issues with the same tasks in both the VR

and real prototype environments. For example, experts were sometimes unable to extend

the iron roughneck (IR) and adjust its height in both VR and reality (Tasks 7 and 8). An

iron roughneck is a machine used for screwing and unscrewing a drill string. Several nov-

ices had issues in Tasks 9–15. The experts had no diﬃculty in performing these tasks.

Figure 4. Comparison of the success rates for the simulation of the tripping process. The bars indi-

cate the relative frequencies of the evaluation levels (green = good, yellow = medium, red = poor).

Figure 5 lists the success rates of the simulation of the top drive connection process.

The experts achieved a mean success rate of 94 ± 5% with the real prototype. For VR, the

experts achieved a success rate of 95 ± 9%. The novices achieved a mean success rate of 84

± 19%. There were no significant diﬀerences (p > 0.05).

Figure 5. Comparison of success rates for the simulation of the top drive (TD) connection process.

The bars indicate the relative frequencies of the evaluation levels (green = good, yellow = medium,

red = poor)

1 Pull drill string out of PWR slips 100% 64% 65%

2 Open PWR slips 94% 100% 80%

3 Pull up drill string 88% 100% 100%

4 Close PWR slips 69% 79% 85%

5 Place drill string into PWR slips 94% 86% 55%

6 Call up IR menu 100% 86% 50%

7 Extend IR 63% 71% 55%

8 Set IR height 63% 71% 50%

9 Close button clamp 100% 100% 95%

10 Unscrew drill string 100% 100% 100%

11 Extend Link Tilt 100% 100% 90%

12 Lower Elevator 100% 100% 70%

13 Open Elevator 100% 100% 75%

14 Activate active float 100% 100% 95%

15 Lower top drive to rig floor 100% 100% 85%

16 Extend and retract Link Tilt 86% 93% 55%

17 Close Elevator 100% 100% 95%

Total: 91% 91% 76%

Tasks Real Prototype Experts VR Prototype Experts VR Prototype Novices

1 Activate TD mode "Spin" 94% 100% 65%

2 Lower TD 100% 100% 50%

3 Close Clamp 94% 79% 95%

4 Activate TD mode "Torque" 94% 100% 90%

5 Open IBOP 88% 100% 95%

6 Activate TD mode "Drill" 100% 86% 95%

7 Open Clamp 88% 100% 100%

Total: 94% 95% 84%

Tasks Real Prototype Experts VR Prototype Experts VR Prototype Novices

Figure 4.

Comparison of the success rates for the simulation of the tripping process. The bars indicate

the relative frequencies of the evaluation levels (green = good, yellow = medium, red = poor).

Figure 5lists the success rates of the simulation of the top drive connection process.

The experts achieved a mean success rate of 94

5% with the real prototype. For VR, the

experts achieved a success rate of 95

9%. The novices achieved a mean success rate of

84 ±19%. There were no significant differences (p> 0.05).