NotesFAQContact Us
Collection
Advanced
Search Tips
Audience
Laws, Policies, & Programs
What Works Clearinghouse Rating
Showing 1 to 15 of 116 results Save | Export
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Kurniasih, Nia; Emilia, Emi; Sujatna, Eva Tuckyta Sari – International Journal of Language Testing, 2023
This study aimed at evaluating a PISA-like reading test developed by teachers participating in the teacher training for teaching PISA-like reading. To serve this purpose, an experimental test was administered to 107 students aged 15-16 using a set of text and questions constructed according to the criteria of the PISA Reading test Level 1. Item…
Descriptors: International Assessment, Foreign Countries, Achievement Tests, Secondary School Students
Peer reviewed Peer reviewed
Direct linkDirect link
Huilin Chen; Zhiqing Lin; Qipeng Chen; Peida Zhan – Language Assessment Quarterly, 2025
Because of the need to provide fine-grained longitudinal diagnostic feedback, longitudinal cognitive diagnosis is an emerging approach that integrates cross-sectional cognitive diagnosis models (CDM) with longitudinal data analysis techniques. By adopting the generalized longitudinal higher-order log-linear CDM, this study attempted to track the…
Descriptors: Longitudinal Studies, Diagnostic Tests, Clinical Diagnosis, Reading Skills
Peer reviewed Peer reviewed
Direct linkDirect link
Ji, Xuejun Ryan; Wu, Amery D. – Educational Measurement: Issues and Practice, 2023
The Cross-Classified Mixed Effects Model (CCMEM) has been demonstrated to be a flexible framework for evaluating reliability by measurement specialists. Reliability can be estimated based on the variance components of the test scores. Built upon their accomplishment, this study extends the CCMEM to be used for evaluating validity evidence.…
Descriptors: Measurement, Validity, Reliability, Models
Peer reviewed Peer reviewed
Direct linkDirect link
Jana Welling; Timo Gnambs; Claus H. Carstensen – Educational and Psychological Measurement, 2024
Disengaged responding poses a severe threat to the validity of educational large-scale assessments, because item responses from unmotivated test-takers do not reflect their actual ability. Existing identification approaches rely primarily on item response times, which bears the risk of misclassifying fast engaged or slow disengaged responses.…
Descriptors: Foreign Countries, College Students, Guessing (Tests), Multiple Choice Tests
Peer reviewed Peer reviewed
Direct linkDirect link
Daniel Kasper; Katrin Schulz-Heidorf; Knut Schwippert – Sociological Methods & Research, 2024
In this article, we extend Liao's test for across-group comparisons of the fixed effects from the generalized linear model to the fixed and random effects of the generalized linear mixed model (GLMM). Using as our basis the Wald statistic, we developed an asymptotic test statistic for across-group comparisons of these effects. The test can be…
Descriptors: Models, Achievement Tests, Foreign Countries, International Assessment
Peer reviewed Peer reviewed
Direct linkDirect link
Chen, Huilin; Cai, Yuyang; de la Torre, Jimmy – Language Assessment Quarterly, 2023
This study uses a cognitive diagnosis model (CDM) approach to investigate the associations among specific L2 reading subskills. Participants include 1,203 Year-4 English major college students randomly selected from the nationwide test takers of Band 8 of Test for English Majors (TEM8), a large-scale English proficiency test for senior English…
Descriptors: Foreign Countries, Second Language Learning, English (Second Language), Majors (Students)
Peer reviewed Peer reviewed
Direct linkDirect link
Mehrazmay, Roghayeh; Ghonsooly, Behzad; de la Torre, Jimmy – Applied Measurement in Education, 2021
The present study aims to examine gender differential item functioning (DIF) in the reading comprehension section of a high stakes test using cognitive diagnosis models. Based on the multiple-group generalized deterministic, noisy "and" gate (MG G-DINA) model, the Wald test and likelihood ratio test are used to detect DIF. The flagged…
Descriptors: Test Bias, College Entrance Examinations, Gender Differences, Reading Tests
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Buyukatak, Emrah; Anil, Duygu – International Journal of Assessment Tools in Education, 2022
The purpose of this research was to determine classification accuracy of the factors affecting the success of students' reading skills based on PISA 2018 data by using Artificial Neural Networks, Decision Trees, K-Nearest Neighbor, and Naive Bayes data mining classification methods and to examine the general characteristics of success groups. In…
Descriptors: Classification, Accuracy, Reading Tests, Achievement Tests
Peer reviewed Peer reviewed
Direct linkDirect link
Lohmann, Julian F.; Zitzmann, Steffen; Voelkle, Manuel C.; Hecht, Martin – Large-scale Assessments in Education, 2022
One major challenge of longitudinal data analysis is to find an appropriate statistical model that corresponds to the theory of change and the research questions at hand. In the present article, we argue that "continuous-time models" are well suited to study the continuously developing constructs of primary interest in the education…
Descriptors: Longitudinal Studies, Structural Equation Models, Time, Achievement Tests
Peer reviewed Peer reviewed
Direct linkDirect link
George, Ann Cathrice; Robitzsch, Alexander – International Journal of Testing, 2021
Modern large-scale studies such as the Progress in International Reading Literacy Study (PIRLS) do not only report reading competence of students on a global reading scale but also report reading on the level of reading subskills. However, the number of and the dependencies between the subskills are frequently discussed. In this study, different…
Descriptors: Foreign Countries, Grade 4, Achievement Tests, International Assessment
Peer reviewed Peer reviewed
Direct linkDirect link
Trendtel, Matthias; Robitzsch, Alexander – Journal of Educational and Behavioral Statistics, 2021
A multidimensional Bayesian item response model is proposed for modeling item position effects. The first dimension corresponds to the ability that is to be measured; the second dimension represents a factor that allows for individual differences in item position effects called persistence. This model allows for nonlinear item position effects on…
Descriptors: Bayesian Statistics, Item Response Theory, Test Items, Test Format
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Çigdemir, Seval – International Journal of Progressive Education, 2022
This study aims to examine the individual and environmental factors affecting the reading comprehension level through the structural equation model. To test the research questions, the relational scanning model, one of the quantitative research methods, was adopted. The research was conducted in Ankara in the 2019-2020 and 2020-2021 academic…
Descriptors: Environmental Influences, Reading Comprehension, Structural Equation Models, Elementary School Students
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Tatarinova, Galiya; Neamah, Nour Raheem; Mohammed, Aisha; Hassan, Aalaa Yaseen; Obaid, Ali Abdulridha; Ismail, Ismail Abdulwahhab; Maabreh, Hatem Ghaleb; Afif, Al Khateeb Nashaat Sultan; Viktorovna, Shvedova Irina – International Journal of Language Testing, 2023
Unidimensionality is an important assumption of measurement but it is violated very often. Most of the time, tests are deliberately constructed to be multidimensional to cover all aspects of the intended construct. In such situations, the application of unidimensional item response theory (IRT) models is not justifieddue to poor model fit and…
Descriptors: Item Response Theory, Test Items, Language Tests, Correlation
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Ningsih, Tutuk; Yuwono, Dwi Margo; Sholehuddin, M. Sugeng; Suharto, Abdul Wachid Bambang – Journal of Social Studies Education Research, 2021
Learning at home not only provides written assignments that are changed in electronic form but must also reflect student learning outcomes at home. Likewise, researchers use literary reading to avoid students getting bored with learning Indonesian language literacy and character education. However, improving literacy skills is not just reading…
Descriptors: Indonesian, Computer Assisted Testing, Fiction, Literacy
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Viengsang, Raveewan; Wasanasomsithi, Punchalee – LEARN Journal: Language Education and Acquisition Research Network, 2022
In recent decades, there has been an attempt to introduce the concept of "assessment for learning" into English language classrooms based on a belief that assessment can be utilized to assist learners in the learning process, not just for teachers to make judgments and decisions. In so doing, the learning-oriented assessment frameworks…
Descriptors: English (Second Language), Second Language Learning, Second Language Instruction, Reading Tests
Previous Page | Next Page »
Pages: 1  |  2  |  3  |  4  |  5  |  6  |  7  |  8