The Effect of Multimodal Input on L2 Learners' Reading Comprehension: A Preregistered Eye-Tracking Study

Saved in:
Bibliographic Details
Title: The Effect of Multimodal Input on L2 Learners' Reading Comprehension: A Preregistered Eye-Tracking Study
Language: English
Authors: Tetiana Tytko (ORCID 0000-0002-1501-9279), Bronson Hui (ORCID 0000-0003-1822-3109), Nick B. Pandža (ORCID 0000-0003-2503-0778)
Source: Modern Language Journal. 2026 110(1):210-239.
Availability: Wiley. Available from: John Wiley & Sons, Inc. 111 River Street, Hoboken, NJ 07030. Tel: 800-835-6770; e-mail: cs-journals@wiley.com; Web site: https://www.wiley.com/en-us
Peer Reviewed: Y
Page Count: 30
Publication Date: 2026
Document Type: Journal Articles
Reports - Research
Education Level: Adult Education
Descriptors: Linguistic Input, Second Language Learning, Reading Comprehension, Eye Movements, Adult Students, Adult Learning, Spanish, Illustrations
DOI: 10.1111/modl.70023
ISSN: 0026-7902
1540-4781
Abstract: Multimodal materials (e.g., written text supplemented by images and/or audio) are commonplace in language classrooms. While they have been consistently shown to be beneficial for vocabulary acquisition, the efficacy of multimodal input in scaffolding text comprehension is less clear. Conflicting findings have also been reported in terms of the relationship between comprehension and attention to pictures as measured by eye tracking. In this preregistered study, we provide further empirical evidence to this research base by testing a new population, adult beginners, and by including a reading-only, no-image condition as the baseline. In a counterbalanced within-subject design, 65 learners of Spanish were exposed to different parts of a story on an eye tracker under three experimental conditions: reading only (RO), reading + image (RI), and reading + image + audio (RIA). Results revealed that comprehension was higher in the RIA and RI than in the RO condition, indicative of the usefulness of the audio and/or the image. The number of looks at the images was higher in the RIA condition, confirming that the audio allowed readers to attend to the pictorial information provided. Attention to the image, however, positively predicted comprehension scores only for the RIA condition.
Abstractor: As Provided
Notes: https://osf.io/hq57n/?view_only=ed878de940e44cadade885f21efeea45
Entry Date: 2026
Accession Number: EJ1500536
Database: ERIC
Description
Abstract:Multimodal materials (e.g., written text supplemented by images and/or audio) are commonplace in language classrooms. While they have been consistently shown to be beneficial for vocabulary acquisition, the efficacy of multimodal input in scaffolding text comprehension is less clear. Conflicting findings have also been reported in terms of the relationship between comprehension and attention to pictures as measured by eye tracking. In this preregistered study, we provide further empirical evidence to this research base by testing a new population, adult beginners, and by including a reading-only, no-image condition as the baseline. In a counterbalanced within-subject design, 65 learners of Spanish were exposed to different parts of a story on an eye tracker under three experimental conditions: reading only (RO), reading + image (RI), and reading + image + audio (RIA). Results revealed that comprehension was higher in the RIA and RI than in the RO condition, indicative of the usefulness of the audio and/or the image. The number of looks at the images was higher in the RIA condition, confirming that the audio allowed readers to attend to the pictorial information provided. Attention to the image, however, positively predicted comprehension scores only for the RIA condition.
ISSN:0026-7902
1540-4781
DOI:10.1111/modl.70023