NotesFAQContact Us
Collection
Advanced
Search Tips
Showing 1 to 15 of 23,065 results Save | Export
Peer reviewed Peer reviewed
Direct linkDirect link
Kim, Stella Y.; Lee, Won-Chan – Journal of Educational Measurement, 2023
The current study proposed several variants of simple-structure multidimensional item response theory equating procedures. Four distinct sets of data were used to demonstrate feasibility of proposed equating methods for two different equating designs: a random groups design and a common-item nonequivalent groups design. Findings indicated some…
Descriptors: Item Response Theory, Equated Scores, Monte Carlo Methods, Research Methodology
Peer reviewed Peer reviewed
Direct linkDirect link
Karoline A. Sachse; Sebastian Weirich; Nicole Mahler; Camilla Rjosk – International Journal of Testing, 2024
In order to ensure content validity by covering a broad range of content domains, the testing times of some educational large-scale assessments last up to a total of two hours or more. Performance decline over the course of taking the test has been extensively documented in the literature. It can occur due to increases in the numbers of: (a)…
Descriptors: Test Wiseness, Test Score Decline, Testing Problems, Foreign Countries
Peer reviewed Peer reviewed
Direct linkDirect link
Kelly Edwards; James Soland – Educational Assessment, 2024
Classroom observational protocols, in which raters observe and score the quality of teachers' instructional practices, are often used to evaluate teachers for consequential purposes despite evidence that scores from such protocols are frequently driven by factors, such as rater and temporal effects, that have little to do with teacher quality. In…
Descriptors: Classroom Observation Techniques, Teacher Evaluation, Accuracy, Scores
Peer reviewed Peer reviewed
Direct linkDirect link
Kylie Gorney; Sandip Sinharay – Journal of Educational Measurement, 2025
Although there exists an extensive amount of research on subscores and their properties, limited research has been conducted on categorical subscores and their interpretations. In this paper, we focus on the claim of Feinberg and von Davier that categorical subscores are useful for remediation and instructional purposes. We investigate this claim…
Descriptors: Tests, Scores, Test Interpretation, Alternative Assessment
Peer reviewed Peer reviewed
Direct linkDirect link
Fellinghauer, Carolina; Debelak, Rudolf; Strobl, Carolin – Educational and Psychological Measurement, 2023
This simulation study investigated to what extent departures from construct similarity as well as differences in the difficulty and targeting of scales impact the score transformation when scales are equated by means of concurrent calibration using the partial credit model with a common person design. Practical implications of the simulation…
Descriptors: True Scores, Equated Scores, Test Items, Sample Size
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Lu, Ru; Kim, Sooyeon – ETS Research Report Series, 2021
This study evaluated the impact of subgroup weighting for equating through a common-item anchor. We used data from a single test form to create two research forms for which the equating relationship was known. The results showed that equating was most accurate when the new form and reference form samples were weighted to be similar to the target…
Descriptors: Equated Scores, Weighted Scores, Raw Scores, Test Items
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Tom Benton – Practical Assessment, Research & Evaluation, 2025
This paper proposes an extension of linear equating that may be useful in one of two fairly common assessment scenarios. One is where different students have taken different combinations of test forms. This might occur, for example, where students have some free choice over the exam papers they take within a particular qualification. In this…
Descriptors: Equated Scores, Test Format, Test Items, Computation
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Bal-Sezerel, Bilge; Atesgöz, N. Nazli; Kirisçi, Nilgün – Journal of Theoretical Educational Science, 2023
The Flynn effect, which advocated that there was a rise in the global IQ score, was widely accepted by the relevant scientific community. However, there are recent research findings that this effect has been reversed. In this study, both Flynn and anti-Flynn effects were investigated. The purpose of this study is to analyze students' general,…
Descriptors: Intelligence Tests, Scores, Elementary School Students, Intelligence Quotient
Peer reviewed Peer reviewed
Direct linkDirect link
Li, Dongmei; Kapoor, Shalini – Educational Measurement: Issues and Practice, 2022
Population invariance is a desirable property of test equating which might not hold when significant changes occur in the test population, such as those brought about by the COVID-19 pandemic. This research aims to investigate whether equating functions are reasonably invariant when the test population is impacted by the pandemic. Based on…
Descriptors: Test Items, Equated Scores, COVID-19, Pandemics
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Kim, Sooyeon; Walker, Michael E. – ETS Research Report Series, 2021
Equating the scores from different forms of a test requires collecting data that link the forms. Problems arise when the test forms to be linked are given to groups that are not equivalent and the forms share no common items by which to measure or adjust for this group nonequivalence. We compared three approaches to adjusting for group…
Descriptors: Equated Scores, Weighted Scores, Sampling, Multiple Choice Tests
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Abigail R. Vild; Maggie E. Wilson; Christopher A. Was – Journal of Research in Education, 2025
Theories of self-regulated learning suggest a positive link between knowledge monitoring accuracy (the ability to predict test performance) and performance on tests. Put differently, students who accurately monitor their knowledge of course content more efficiently regulate study of course materials. However, a plethora of literature indicates…
Descriptors: Student Satisfaction, Undergraduate Students, Scores, Prediction
Peer reviewed Peer reviewed
Direct linkDirect link
Stefan O'Grady – TESOL Journal, 2025
Task-based language assessment represents a major component of task-based language teaching syllabi. Current perspectives emphasise the importance of tasks in the assessment process, suggesting that adherence to influential models of language production during task design yields predictable test outcomes. The current study contends that the…
Descriptors: Task Analysis, Language Tests, Evaluators, Rating Scales
Peer reviewed Peer reviewed
Direct linkDirect link
Lauren E. Bates; Sarah J. Myers; Edward L. DeLosh; Matthew G. Rhodes – Psychology Learning and Teaching, 2025
The present work assessed a quizzing method that combines the benefits of retrieval practice and feedback, whereby learners must continue taking quizzes until they achieve a perfect score with feedback provided (i.e., "mastery quizzing"). Across four experiments (n = 952; age 18-76, M = 37.10, SD = 11.61; 50% female, 48% male, 2% other…
Descriptors: Mastery Tests, Retention (Psychology), Evaluation Methods, Adults
Blake H. Heller – Annenberg Institute for School Reform at Brown University, 2024
In 2016, the GED® introduced college readiness benchmarks designed to identify testers who are academically prepared for credit-bearing college coursework. The benchmarks are promoted as awarding college credits or exempting "college-ready" GED® graduates from remedial coursework. I show descriptive evidence that those identified as…
Descriptors: High School Equivalency Programs, College Readiness, Eligibility, Benchmarking
Peer reviewed Peer reviewed
Direct linkDirect link
Marion Durbahn; Michael Rodgers; Marijana Macis; Elke Peters – Studies in Second Language Acquisition, 2024
This study aimed to investigate the relationship between lexical coverage and TV viewing comprehension. Previous studies have indicated that 95% to 98% of lexical coverage may be needed for reading comprehension (Hu & Nation, 2000). To understand informal listening passages, lower coverage figures (95%-90%) may suffice. However, no study has…
Descriptors: Television Viewing, Lexicology, Comprehension, Visual Aids
Previous Page | Next Page »
Pages: 1  |  2  |  3  |  4  |  5  |  6  |  7  |  8  |  9  |  10  |  11  |  ...  |  1538