ERIC - Search Results

Publication Date

In 2025	1
Since 2024	2
Since 2021 (last 5 years)	4
Since 2016 (last 10 years)	10
Since 2006 (last 20 years)	16

Descriptor

Item Response Theory	20
Computer Assisted Testing	9
Testing	9
Test Construction	8
Test Items	7
Measurement	4
Psychometrics	4
Scores	4
Test Bias	4
Test Interpretation	4
Testing Problems	4
Achievement Tests	3
Adaptive Testing	3
Educational Assessment	3
Equated Scores	3
Evaluation Methods	3
Test Reliability	3
Test Theory	3
Computer Software	2
Context Effect	2
Cutting Scores	2
Elementary Secondary Education	2
English (Second Language)	2
Goodness of Fit	2
Graphs	2
More ▼

Source

Educational Measurement:…

Publication Type

Journal Articles	20
Reports - Research	8
Reports - Descriptive	5
Reports - Evaluative	4
Opinion Papers	3
Book/Product Reviews	1

Education Level

Elementary Secondary Education

Audience

Location

Laws, Policies, & Programs

Assessments and Surveys

National Assessment of…	3
ACT Assessment	1
Graduate Record Examinations	1
Preliminary Scholastic…	1
SAT (College Admission Test)	1

What Works Clearinghouse Rating

Showing 1 to 15 of 20 results Save | Export

Investigating Approaches to Controlling Item Position Effects in Computerized Adaptive Tests

Peer reviewed

Direct link

Ye Ma; Deborah J. Harris – Educational Measurement: Issues and Practice, 2025

Item position effect (IPE) refers to situations where an item performs differently when it is administered in different positions on a test. The majority of previous research studies have focused on investigating IPE under linear testing. There is a lack of IPE research under adaptive testing. In addition, the existence of IPE might violate Item…

Descriptors: Computer Assisted Testing, Adaptive Testing, Item Response Theory, Test Items

Item Response Theory Models for Polytomous Multidimensional Forced-Choice Items to Measure Construct Differentiation

Peer reviewed

Direct link

Xuelan Qiu; Jimmy de la Torre; You-Gan Wang; Jinran Wu – Educational Measurement: Issues and Practice, 2024

Multidimensional forced-choice (MFC) items have been found to be useful to reduce response biases in personality assessments. However, conventional scoring methods for the MFC items result in ipsative data, hindering the wider applications of the MFC format. In the last decade, a number of item response theory (IRT) models have been developed,…

Descriptors: Item Response Theory, Personality Traits, Personality Measures, Personality Assessment

Applying a Mixture Rasch Model-Based Approach to Standard Setting

Peer reviewed

Direct link

Peabody, Michael R.; Muckle, Timothy J.; Meng, Yu – Educational Measurement: Issues and Practice, 2023

The subjective aspect of standard-setting is often criticized, yet data-driven standard-setting methods are rarely applied. Therefore, we applied a mixture Rasch model approach to setting performance standards across several testing programs of various sizes and compared the results to existing passing standards derived from traditional…

Descriptors: Item Response Theory, Standard Setting, Testing, Sampling

Digital Module 07: Subscores--Evaluation and Reporting https://ncme.elevate.commpartners.com

Peer reviewed

Direct link

Sinharay, Sandip – Educational Measurement: Issues and Practice, 2019

Test score users often demand the reporting of subscores due to their potential diagnostic, remedial, and instructional benefits. Therefore, there is substantial pressure on testing programs to report subscores. However, professional standards require that subscores have to satisfy minimum quality standards before they can be reported. In this…

Descriptors: Testing, Scores, Item Response Theory, Evaluation Methods

Using Response Time to Detect Item Preknowledge in Computer-Based Licensure Examinations

Peer reviewed

Direct link

Qian, Hong; Staniewska, Dorota; Reckase, Mark; Woo, Ada – Educational Measurement: Issues and Practice, 2016

This article addresses the issue of how to detect item preknowledge using item response time data in two computer-based large-scale licensure examinations. Item preknowledge is indicated by an unexpected short response time and a correct response. Two samples were used for detecting item preknowledge for each examination. The first sample was from…

Descriptors: Reaction Time, Licensing Examinations (Professions), Computer Assisted Testing, Prior Learning

Development and Validation of an Automatic Item Generation System for English Idioms

Peer reviewed

Direct link

Rafatbakhsh, Elaheh; Ahmadi, Alireza; Moloodi, Amirsaeid; Mehrpour, Saeed – Educational Measurement: Issues and Practice, 2021

Test development is a crucial, yet difficult and time-consuming part of any educational system, and the task often falls all on teachers. Automatic item generation systems have recently drawn attention as they can reduce this burden and make test development more convenient. Such systems have been developed to generate items for vocabulary,…

Descriptors: Test Construction, Test Items, Computer Assisted Testing, Multiple Choice Tests

Digital Module 10: Rasch Measurement Theory

Peer reviewed

Direct link

Wang, Jue; Engelhard, George, Jr. – Educational Measurement: Issues and Practice, 2019

In this digital ITEMS module, Dr. Jue Wang and Dr. George Engelhard Jr. describe the Rasch measurement framework for the construction and evaluation of new measures and scales. From a theoretical perspective, they discuss the historical and philosophical perspectives on measurement with a focus on Rasch's concept of specific objectivity and…

Descriptors: Item Response Theory, Evaluation Methods, Measurement, Goodness of Fit

Detecting Measurement Disturbances in Rater-Mediated Assessments

Peer reviewed

Direct link

Wind, Stefanie A.; Schumacker, Randall E. – Educational Measurement: Issues and Practice, 2017

The term measurement disturbance has been used to describe systematic conditions that affect a measurement process, resulting in a compromised interpretation of person or item estimates. Measurement disturbances have been discussed in relation to systematic response patterns associated with items and persons, such as start-up, plodding, boredom,…

Descriptors: Measurement, Testing Problems, Writing Tests, Performance Based Assessment

Understanding Examinees' Responses to Items: Implications for Measurement

Peer reviewed

Direct link

Embretson, Susan E. – Educational Measurement: Issues and Practice, 2016

Examinees' thinking processes have become an increasingly important concern in testing. The responses processes aspect is a major component of validity, and contemporary tests increasingly involve specifications about the cognitive complexity of examinees' response processes. Yet, empirical research findings on examinees' cognitive processes are…

Descriptors: Testing, Cognitive Processes, Test Construction, Test Items

Can Item Response Times Provide Insight into Students' Motivation and Self-Efficacy in Math? An Initial Application of Test Metadata to Understand Students' Social-Emotional Needs

Peer reviewed

Direct link

Soland, James – Educational Measurement: Issues and Practice, 2019

As computer-based tests become more common, there is a growing wealth of metadata related to examinees' response processes, which include solution strategies, concentration, and operating speed. One common type of metadata is item response time. While response times have been used extensively to improve estimates of achievement, little work…

Descriptors: Test Items, Item Response Theory, Metadata, Self Efficacy

The Issue of Range Restriction in Bookmark Standard Setting

Peer reviewed

Direct link

Wyse, Adam E. – Educational Measurement: Issues and Practice, 2015

This article uses data from a large-scale assessment program to illustrate the potential issue of range restriction with the Bookmark method in the context of trying to set cut scores to closely align with a set of college and career readiness benchmarks. Analyses indicated that range restriction issues existed across different response…

Descriptors: Cutting Scores, Alignment (Education), College Readiness, Career Readiness

Comments on Neil Dorans's NCME Career Award Address: The Contestant Perspective on Taking Tests--Emanations from the Statue within

Peer reviewed

Direct link

Pommerich, Mary – Educational Measurement: Issues and Practice, 2012

Neil Dorans has made a career of advocating for the examinee. He continues to do so in his NCME career award address, providing a thought-provoking commentary on some current trends in educational measurement that could potentially affect the integrity of test scores. Concerns expressed in the address call attention to a conundrum that faces…

Descriptors: Testing, Scores, Measurement, Test Construction

Obtaining Diagnostic Classification Model Estimates Using Mplus

Peer reviewed

Direct link

Templin, Jonathan; Hoffman, Lesa – Educational Measurement: Issues and Practice, 2013

Diagnostic classification models (aka cognitive or skills diagnosis models) have shown great promise for evaluating mastery on a multidimensional profile of skills as assessed through examinee responses, but continued development and application of these models has been hindered by a lack of readily available software. In this article we…

Descriptors: Classification, Models, Language Tests, English (Second Language)

Comments on Neil Dorans's NCME Career Award Address: The Contestant Perspective on Taking Tests--Emanations from the Statue within

Peer reviewed

Direct link

Mislevy, Robert J. – Educational Measurement: Issues and Practice, 2012

This article presents the author's observations on Neil Dorans's NCME Career Award Address: "The Contestant Perspective on Taking Tests: Emanations from the Statue within." He calls attention to some points that Dr. Dorans made in his address, and offers his thoughts in response.

Descriptors: Testing, Test Reliability, Psychometrics, Scores

The Contestant Perspective on Taking Tests: Emanations from the Statue within

Peer reviewed

Direct link

Dorans, Neil J. – Educational Measurement: Issues and Practice, 2012

Views on testing--its purpose and uses and how its data are analyzed--are related to one's perspective on test takers. Test takers can be viewed as learners, examinees, or contestants. I briefly discuss the perspective of test takers as learners. I maintain that much of psychometrics views test takers as examinees. I discuss test takers as a…

Descriptors: Testing, Test Theory, Item Response Theory, Test Reliability

Previous Page | Next Page »

Pages: 1 | 2

Ahmadi, Alireza	1
Deborah J. Harris	1
Dorans, Neil J.	1
Embretson, Susan E.	1
Engelhard, George, Jr.	1
Forsyth, Robert A.	1
Frey, Andreas	1
Hartig, Johannes	1
Hoffman, Lesa	1
Houser, Ronald L.	1
Jimmy de la Torre	1
Jinran Wu	1
Kingsbury, G. Gage	1
Mehrpour, Saeed	1
Meng, Yu	1
Mislevy, Robert J.	1
Moloodi, Amirsaeid	1
Muckle, Timothy J.	1
Peabody, Michael R.	1
Pommerich, Mary	1
Qian, Hong	1
Rafatbakhsh, Elaheh	1
Reckase, Mark	1
Rupp, Andre A.	1
Schumacker, Randall E.	1
More ▼