Continuous Temporal Contiguity Narration-Animation for Learning in Virtual Reality

Saved in:
Bibliographic Details
Title: Continuous Temporal Contiguity Narration-Animation for Learning in Virtual Reality
Language: English
Authors: Laurie Porte (ORCID 0000-0003-2132-2706), Jean-Michel Boucheix (ORCID 0000-0003-0830-0609), Stéphane Argon, Clémence Rougeot
Source: Journal of Computer Assisted Learning. 2026 42(2).
Availability: Wiley. Available from: John Wiley & Sons, Inc. 111 River Street, Hoboken, NJ 07030. Tel: 800-835-6770; e-mail: cs-journals@wiley.com; Web site: https://www.wiley.com/en-us
Peer Reviewed: Y
Page Count: 23
Publication Date: 2026
Document Type: Journal Articles
Reports - Research
Education Level: Junior High Schools
Middle Schools
Secondary Education
Descriptors: Foreign Countries, Junior High School Students, Computer Simulation, Simulated Environment, Multimedia Materials, Time Factors (Learning), Visual Learning, Multisensory Learning, Aural Learning, Computer Uses in Education, Educational Technology, Biology, Science Education, Recall (Psychology), Comprehension
Geographic Terms: France
DOI: 10.1002/jcal.70218
ISSN: 0266-4909
1365-2729
Abstract: Background: Temporal contiguity between visual and verbal (narrated) information could strongly influence learning from multimedia animations, especially in the case of a rich visual context such as Virtual Reality. Compared to spatial contiguity, less research has been carried out on the impact of the time lag, for example, the de-synchronisation, between visual (pictures) and verbal (text) information during animation-based learning and mainly on adults. Further, the results of this research were mixed, especially in the case of continuous temporal contiguity. The material conventionally used was composed of short, schematic animations and not of a rich, visual, learning context and no studies have been carried out on temporal contiguity in virtual reality. Objectives: In order to help satisfy this need for research, we used a multimedia lesson, in VR (desktop) about organic matter decomposition in a forest environment to investigate the influence of the (de)-synchronisation of, for example, time lag between, narrated text and dynamic pictures in rich PC-VR video. The theoretical framework of the study involved a consistent combination of the Cognitive Theory of Multimedia Learning (CTML, Mayer 2014) and of the Time-Based Resource-Sharing model of working memory (TBRS, Barrouillet and Camos 2021). Method: In an experimental pre-post-test learning paradigm, the effects of five new temporal contiguity conditions on various recall measures, comprehension, and also new episodic memory tasks were tested on 227 junior secondary school participants (second year: 12 years old). They were invited to study an on-screen Virtual Reality lesson on the European forest soil decomposition mechanism. The delays between the narrated words and the corresponding animations were: -6, -2, 0, +2, +6 s, respectively. Prior knowledge, working memory capacity, and spatial abilities were controlled. Results and Conclusions: Temporal gaps, even -2 s had a significant negative effect on recall and comprehension. Performances were influenced by working memory capacity. However, the episodic memory task results suggested that text-picture integration with long text segments was greater than previous literature has shown. Learners were relatively aware of time lags. Implications: In the future, precise time-locked correspondence between spoken text and dynamic pictures might turn out to be of high importance for immersive VR, because of the high degree of interactivity, for example, user control of the environment dynamics. These results are consistent with the CTML and the TBRS models, and they suggest that it would be worthwhile to incorporate episodic memory measures into the CTML.
Abstractor: As Provided
Entry Date: 2026
Accession Number: EJ1500453
Database: ERIC
Description
Abstract:Background: Temporal contiguity between visual and verbal (narrated) information could strongly influence learning from multimedia animations, especially in the case of a rich visual context such as Virtual Reality. Compared to spatial contiguity, less research has been carried out on the impact of the time lag, for example, the de-synchronisation, between visual (pictures) and verbal (text) information during animation-based learning and mainly on adults. Further, the results of this research were mixed, especially in the case of continuous temporal contiguity. The material conventionally used was composed of short, schematic animations and not of a rich, visual, learning context and no studies have been carried out on temporal contiguity in virtual reality. Objectives: In order to help satisfy this need for research, we used a multimedia lesson, in VR (desktop) about organic matter decomposition in a forest environment to investigate the influence of the (de)-synchronisation of, for example, time lag between, narrated text and dynamic pictures in rich PC-VR video. The theoretical framework of the study involved a consistent combination of the Cognitive Theory of Multimedia Learning (CTML, Mayer 2014) and of the Time-Based Resource-Sharing model of working memory (TBRS, Barrouillet and Camos 2021). Method: In an experimental pre-post-test learning paradigm, the effects of five new temporal contiguity conditions on various recall measures, comprehension, and also new episodic memory tasks were tested on 227 junior secondary school participants (second year: 12 years old). They were invited to study an on-screen Virtual Reality lesson on the European forest soil decomposition mechanism. The delays between the narrated words and the corresponding animations were: -6, -2, 0, +2, +6 s, respectively. Prior knowledge, working memory capacity, and spatial abilities were controlled. Results and Conclusions: Temporal gaps, even -2 s had a significant negative effect on recall and comprehension. Performances were influenced by working memory capacity. However, the episodic memory task results suggested that text-picture integration with long text segments was greater than previous literature has shown. Learners were relatively aware of time lags. Implications: In the future, precise time-locked correspondence between spoken text and dynamic pictures might turn out to be of high importance for immersive VR, because of the high degree of interactivity, for example, user control of the environment dynamics. These results are consistent with the CTML and the TBRS models, and they suggest that it would be worthwhile to incorporate episodic memory measures into the CTML.
ISSN:0266-4909
1365-2729
DOI:10.1002/jcal.70218