ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	3
Since 2016 (last 10 years)	13
Since 2006 (last 20 years)	20

Source

Educational Measurement:…

Publication Type

Journal Articles	24
Reports - Research	24
Speeches/Meeting Papers	1

Education Level

Higher Education	5
Postsecondary Education	2
Secondary Education	2
Elementary Education	1
Elementary Secondary Education	1
Grade 6	1
High Schools	1

Audience

Location

Canada	1
Ireland	1
Netherlands	1
United Kingdom	1

Laws, Policies, & Programs

No Child Left Behind Act 2001

Assessments and Surveys

SAT (College Admission Test)	2
ACT Assessment	1
National Assessment of…	1
Program for International…	1
Trends in International…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 24 results Save | Export

The Role of Response Style Adjustments in Cross-Country Comparisons--A Case Study Using Data from the PISA 2015 Questionnaire

Peer reviewed

Direct link

Ulitzsch, Esther; Lüdtke, Oliver; Robitzsch, Alexander – Educational Measurement: Issues and Practice, 2023

Country differences in response styles (RS) may jeopardize cross-country comparability of Likert-type scales. When adjusting for rather than investigating RS is the primary goal, it seems advantageous to impose minimal assumptions on RS structures and leverage information from multiple scales for RS measurement. Using PISA 2015 background…

Descriptors: Response Style (Tests), Comparative Analysis, Achievement Tests, Foreign Countries

A Longitudinal Diagnostic Model with Hierarchical Learning Trajectories

Peer reviewed

Direct link

Zhan, Peida; He, Keren – Educational Measurement: Issues and Practice, 2021

In learning diagnostic assessments, the attribute hierarchy specifies a sequential network of interrelated attribute mastery processes, which makes a test blueprint consistent with the cognitive theory. One of the most important functions of attribute hierarchy is to guide or limit the developmental direction of students and then form a…

Descriptors: Longitudinal Studies, Models, Comparative Analysis, Diagnostic Tests

A Comparison of Two Alternate Scaling Approaches Employed for Task Analyses in Credentialing Examination Development

Peer reviewed

Direct link

Fidler, James R.; Risk, Nicole M. – Educational Measurement: Issues and Practice, 2019

Credentialing examination developers rely on task (job) analyses for establishing inventories of task and knowledge areas in which competency is required for safe and successful practice in target occupations. There are many ways in which task-related information may be gathered from practitioner ratings, each with its own advantage and…

Descriptors: Job Analysis, Scaling, Licensing Examinations (Professions), Test Construction

A Model-Data-Fit-Informed Approach to Score Resolution in Performance Assessments

Peer reviewed

Direct link

Wind, Stefanie A.; Walker, A. Adrienne – Educational Measurement: Issues and Practice, 2021

Many large-scale performance assessments include score resolution procedures for resolving discrepancies in rater judgments. The goal of score resolution is conceptually similar to person fit analyses: To identify students for whom observed scores may not accurately reflect their achievement. Previously, researchers have observed that…

Descriptors: Goodness of Fit, Performance Based Assessment, Evaluators, Decision Making

Rubric Rating with MFRM versus Randomly Distributed Comparative Judgment: A Comparison of Two Approaches to Second-Language Writing Assessment

Peer reviewed

Direct link

Sims, Maureen E.; Cox, Troy L.; Eckstein, Grant T.; Hartshorn, K. James; Wilcox, Matthew P.; Hart, Judson M. – Educational Measurement: Issues and Practice, 2020

The purpose of this study is to explore the reliability of a potentially more practical approach to direct writing assessment in the context of ESL writing. Traditional rubric rating (RR) is a common yet resource-intensive evaluation practice when performed reliably. This study compared the traditional rubric model of ESL writing assessment and…

Descriptors: Scoring Rubrics, Item Response Theory, Second Language Learning, English (Second Language)

Systematic Comparison of Decision Accuracy of Complex Compensatory Decision Rules Combining Multiple Tests in a Higher Education Context

Peer reviewed

Direct link

Yocarini, Iris E.; Bouwmeester, Samantha; Smeets, Guus; Arends, Lidia R. – Educational Measurement: Issues and Practice, 2018

This real-data-guided simulation study systematically evaluated the decision accuracy of complex decision rules combining multiple tests within different realistic curricula. Specifically, complex decision rules combining conjunctive aspects and compensatory aspects were evaluated. A conjunctive aspect requires a minimum level of performance,…

Descriptors: Comparative Analysis, Decision Making, Accuracy, Higher Education

Reliably Assessing Growth with Longitudinal Diagnostic Classification Models

Peer reviewed

Direct link

Madison, Matthew J. – Educational Measurement: Issues and Practice, 2019

Recent advances have enabled diagnostic classification models (DCMs) to accommodate longitudinal data. These longitudinal DCMs were developed to study how examinees change, or transition, between different attribute mastery statuses over time. This study examines using longitudinal DCMs as an approach to assessing growth and serves three purposes:…

Descriptors: Longitudinal Studies, Item Response Theory, Psychometrics, Criterion Referenced Tests

On Natural Variation in Grades in Higher Education, and Its Implications for Assessing Effectiveness of Educational Innovations

Peer reviewed

Direct link

Boevé, Anja J.; Meijer, Rob R.; Beldhuis, Hans J. A.; Bosker, Roel J.; Albers, Casper J. – Educational Measurement: Issues and Practice, 2019

To investigate the effect of innovations in the teaching-learning environment, researchers often compare study results from different cohorts across years. However, variance in scores can be attributed to both random fluctuation and systematic changes due to the innovation, complicating cohort comparisons. In the present study, we illustrate how…

Descriptors: Grades (Scholastic), Foreign Countries, Teaching Methods, Educational Innovation

An Investigation of Undefined Cut Scores with the Hofstee Standard-Setting Method

Peer reviewed

Direct link

Wyse, Adam E.; Babcock, Ben – Educational Measurement: Issues and Practice, 2017

This article provides an overview of the Hofstee standard-setting method and illustrates several situations where the Hofstee method will produce undefined cut scores. The situations where the cut scores will be undefined involve cases where the line segment derived from the Hofstee ratings does not intersect the score distribution curve based on…

Descriptors: Cutting Scores, Evaluation Methods, Standard Setting (Scoring), Comparative Analysis

How Should Colleges Treat Multiple Admissions Test Scores?

Peer reviewed

Direct link

Mattern, Krista; Radunzel, Justine; Bertling, Maria; Ho, Andrew D. – Educational Measurement: Issues and Practice, 2018

The percentage of students retaking college admissions tests is rising. Researchers and college admissions offices currently use a variety of methods for summarizing these multiple scores. Testing organizations such as ACT and the College Board, interested in validity evidence like correlations with first-year grade point average (FYGPA), often…

Descriptors: College Admission, Scores, Correlation, College Entrance Examinations

Predicting Freshman Grade-Point Average from Test Scores: Effects of Variation within and between High Schools

Peer reviewed

Direct link

Koretz, D.; Langi, M. – Educational Measurement: Issues and Practice, 2018

Most studies predicting college performance from high-school grade point average (HSGPA) and college admissions test scores use single-level regression models that conflate relationships within and between high schools. Because grading standards vary among high schools, these relationships are likely to differ within and between schools. We used…

Descriptors: Prediction, High School Students, Grade Point Average, Scores

Five Methods for Estimating Angoff Cut Scores with IRT

Peer reviewed

Direct link

Wyse, Adam E. – Educational Measurement: Issues and Practice, 2017

This article illustrates five different methods for estimating Angoff cut scores using item response theory (IRT) models. These include maximum likelihood (ML), expected a priori (EAP), modal a priori (MAP), and weighted maximum likelihood (WML) estimators, as well as the most commonly used approach based on translating ratings through the test…

Descriptors: Cutting Scores, Item Response Theory, Bayesian Statistics, Maximum Likelihood Statistics

The Impact of Measurement Error on the Accuracy of Individual and Aggregate SGP

Peer reviewed

Direct link

McCaffrey, Daniel F.; Castellano, Katherine E.; Lockwood, J. R. – Educational Measurement: Issues and Practice, 2015

Student growth percentiles (SGPs) express students' current observed scores as percentile ranks in the distribution of scores among students with the same prior-year scores. A common concern about SGPs at the student level, and mean or median SGPs (MGPs) at the aggregate level, is potential bias due to test measurement error (ME). Shang,…

Descriptors: Error of Measurement, Accuracy, Achievement Gains, Students

Examining the Reliability of Student Growth Percentiles Using Multidimensional IRT

Peer reviewed

Direct link

Monroe, Scott; Cai, Li – Educational Measurement: Issues and Practice, 2015

Student growth percentiles (SGPs, Betebenner, 2009) are used to locate a student's current score in a conditional distribution based on the student's past scores. Currently, following Betebenner (2009), quantile regression (QR) is most often used operationally to estimate the SGPs. Alternatively, multidimensional item response theory (MIRT) may…

Descriptors: Item Response Theory, Reliability, Growth Models, Computation

Predicting College Performance of Homeschooled versus Traditional Students

Peer reviewed

Direct link

Yu, Martin C.; Sackett, Paul R.; Kuncel, Nathan R. – Educational Measurement: Issues and Practice, 2016

The prevalence of homeschooling in the United States is increasing. Yet little is known about how commonly used predictors of postsecondary academic performance (SAT, high school grade point average [HSGPA]) perform for homeschooled students. Postsecondary performance at 140 colleges and universities was analyzed comparing a sample of traditional…

Descriptors: Predictor Variables, Academic Achievement, College Students, Home Schooling

Previous Page | Next Page »

Pages: 1 | 2

Comparative Analysis	24
Scores	8
Simulation	7
Foreign Countries	5
Correlation	4
Evaluation Methods	4
Item Response Theory	4
College Entrance Examinations	3
Cutting Scores	3
Grade Point Average	3
Longitudinal Studies	3
Mathematics Tests	3
Models	3
Regression (Statistics)	3
Test Items	3
Academic Achievement	2
Accuracy	2
Achievement Tests	2
Classification	2
College Freshmen	2
Computation	2
Decision Making	2
Educational Assessment	2
Error of Measurement	2
Learning Processes	2
More ▼

Wyse, Adam E.	2
Albers, Casper J.	1
Allen, Jessica	1
Arends, Lidia R.	1
Babcock, Ben	1
Beldhuis, Hans J. A.	1
Bertling, Maria	1
Boevé, Anja J.	1
Bosker, Roel J.	1
Bouwmeester, Samantha	1
Cai, Li	1
Castellano, Katherine E.	1
Childs, Ruth A.	1
Cox, Troy L.	1
Crisp, Victoria	1
Dunn, Jennifer L.	1
Eckstein, Grant T.	1
Fidler, James R.	1
Hart, Judson M.	1
Hartshorn, K. James	1
He, Keren	1
Ho, Andrew D.	1
Hollingshead, Lynne	1
Koretz, D.	1
More ▼