ERIC - Search Results

Publication Date

In 2025	2
Since 2024	4
Since 2021 (last 5 years)	8
Since 2016 (last 10 years)	17
Since 2006 (last 20 years)	43

Descriptor

Psychometrics	84
Item Response Theory	28
Test Items	28
Models	22
Measurement Techniques	18
Scores	18
Test Construction	18
Measurement	13
Computer Assisted Testing	12
Simulation	11
Diagnostic Tests	10
Error of Measurement	10
Testing	10
Comparative Analysis	9
Standardized Tests	9
Classification	8
Educational Assessment	8
Test Reliability	8
Test Validity	8
Achievement Tests	7
Cognitive Measurement	7
Evaluation Methods	7
Higher Education	7
Mathematics Tests	7
Measures (Individuals)	7
More ▼

Source

Journal of Educational…

Publication Type

Journal Articles	78
Reports - Research	39
Reports - Evaluative	25
Reports - Descriptive	11
Speeches/Meeting Papers	4
Information Analyses	3
Book/Product Reviews	2
Opinion Papers	1

Education Level

Secondary Education	4
Elementary Education	2
Elementary Secondary Education	2
Higher Education	2
Junior High Schools	2
Middle Schools	2
Postsecondary Education	2
Grade 7	1
High Schools	1

Audience

Researchers

Location

United Kingdom	1
United States	1

Laws, Policies, & Programs

Defunis v Odegaard

Assessments and Surveys

SAT (College Admission Test)	4
National Assessment of…	2
Graduate Record Examinations	1
Kaufman Assessment Battery…	1
Raven Progressive Matrices	1
Work Keys (ACT)	1

What Works Clearinghouse Rating

Showing 1 to 15 of 84 results Save | Export

A Note on Latent Traits Estimates under IRT Models with Missingness

Peer reviewed

Direct link

Guo, Jinxin; Xu, Xin; Xin, Tao – Journal of Educational Measurement, 2023

Missingness due to not-reached items and omitted items has received much attention in the recent psychometric literature. Such missingness, if not handled properly, would lead to biased parameter estimation, as well as inaccurate inference of examinees, and further erode the validity of the test. This paper reviews some commonly used IRT based…

Descriptors: Psychometrics, Bias, Error of Measurement, Test Validity

A Generalized Objective Function for Computer Adaptive Item Selection

Peer reviewed

Direct link

Harold Doran; Testsuhiro Yamada; Ted Diaz; Emre Gonulates; Vanessa Culver – Journal of Educational Measurement, 2025

Computer adaptive testing (CAT) is an increasingly common mode of test administration offering improved test security, better measurement precision, and the potential for shorter testing experiences. This article presents a new item selection algorithm based on a generalized objective function to support multiple types of testing conditions and…

Descriptors: Computer Assisted Testing, Adaptive Testing, Test Items, Algorithms

Psychometric Methods to Evaluate Measurement and Algorithmic Bias in Automated Scoring

Peer reviewed

Direct link

Johnson, Matthew S.; Liu, Xiang; McCaffrey, Daniel F. – Journal of Educational Measurement, 2022

With the increasing use of automated scores in operational testing settings comes the need to understand the ways in which they can yield biased and unfair results. In this paper, we provide a brief survey of some of the ways in which the predictive methods used in automated scoring can lead to biased, and thus unfair automated scores. After…

Descriptors: Psychometrics, Measurement Techniques, Bias, Automation

Classification Accuracy and Consistency of Compensatory Composite Test Scores

Peer reviewed

Direct link

Setzer, J. Carl; Cheng, Ying; Liu, Cheng – Journal of Educational Measurement, 2023

Test scores are often used to make decisions about examinees, such as in licensure and certification testing, as well as in many educational contexts. In some cases, these decisions are based upon compensatory scores, such as those from multiple sections or components of an exam. Classification accuracy and classification consistency are two…

Descriptors: Classification, Accuracy, Psychometrics, Scores

A One-Parameter Diagnostic Classification Model with Familiar Measurement Properties

Peer reviewed

Direct link

Matthew J. Madison; Stefanie A. Wind; Lientje Maas; Kazuhiro Yamaguchi; Sergio Haab – Journal of Educational Measurement, 2024

Diagnostic classification models (DCMs) are psychometric models designed to classify examinees according to their proficiency or nonproficiency of specified latent characteristics. These models are well suited for providing diagnostic and actionable feedback to support intermediate and formative assessment efforts. Several DCMs have been developed…

Descriptors: Diagnostic Tests, Classification, Models, Psychometrics

Likelihood-Based Estimation of Model-Derived Oral Reading Fluency

Peer reviewed

Direct link

Cornelis Potgieter; Xin Qiao; Akihito Kamata; Yusuf Kara – Journal of Educational Measurement, 2024

As part of the effort to develop an improved oral reading fluency (ORF) assessment system, Kara et al. estimated the ORF scores based on a latent variable psychometric model of accuracy and speed for ORF data via a fully Bayesian approach. This study further investigates likelihood-based estimators for the model-derived ORF scores, including…

Descriptors: Oral Reading, Reading Fluency, Scores, Psychometrics

Using Multilabel Neural Network to Score High-Dimensional Assessments for Different Use Foci: An Example with College Major Preference Assessment

Peer reviewed

Direct link

Shun-Fu Hu; Amery D. Wu; Jake Stone – Journal of Educational Measurement, 2025

Scoring high-dimensional assessments (e.g., > 15 traits) can be a challenging task. This paper introduces the multilabel neural network (MNN) as a scoring method for high-dimensional assessments. Additionally, it demonstrates how MNN can score the same test responses to maximize different performance metrics, such as accuracy, recall, or…

Descriptors: Tests, Testing, Scores, Test Construction

A Factor Mixture Model for Item Responses and Certainty of Response Indices to Identify Student Knowledge Profiles

Peer reviewed

Direct link

Chen, Chia-Wen; Andersson, Björn; Zhu, Jinxin – Journal of Educational Measurement, 2023

The certainty of response index (CRI) measures respondents' confidence level when answering an item. In conjunction with the answers to the items, previous studies have used descriptive statistics and arbitrary thresholds to identify student knowledge profiles with the CRIs. Whereas this approach overlooked the measurement error of the observed…

Descriptors: Item Response Theory, Factor Analysis, Psychometrics, Test Items

Examining Psychometric Properties and Level Classification of the van Hiele Geometry Test Using CTT and CDM Frameworks

Peer reviewed

Direct link

Chen, Yi-Hsin; Senk, Sharon L.; Thompson, Denisse R.; Voogt, Kevin – Journal of Educational Measurement, 2019

The van Hiele theory and van Hiele Geometry Test have been extensively used in mathematics assessments across countries. The purpose of this study is to use classical test theory (CTT) and cognitive diagnostic modeling (CDM) frameworks to examine psychometric properties of the van Hiele Geometry Test and to compare how various classification…

Descriptors: Geometry, Mathematics Tests, Test Theory, Psychometrics

Integrating Multiple Sources of Validity Evidence for an Assessment-Based Cognitive Model

Peer reviewed

Direct link

Langenfeld, Thomas; Thomas, Jay; Zhu, Rongchun; Morris, Carrie A. – Journal of Educational Measurement, 2020

An assessment of graphic literacy was developed by articulating and subsequently validating a skills-based cognitive model intended to substantiate the plausibility of score interpretations. Model validation involved use of multiple sources of evidence derived from large-scale field testing and cognitive labs studies. Data from large-scale field…

Descriptors: Evidence, Scores, Eye Movements, Psychometrics

Investigating Psychometric Isomorphism for Traditional and Performance-Based Assessment

Peer reviewed

Direct link

Fay, Derek M.; Levy, Roy; Mehta, Vandhana – Journal of Educational Measurement, 2018

A common practice in educational assessment is to construct multiple forms of an assessment that consists of tasks with similar psychometric properties. This study utilizes a Bayesian multilevel item response model and descriptive graphical representations to evaluate the psychometric similarity of variations of the same task. These approaches for…

Descriptors: Psychometrics, Performance Based Assessment, Bayesian Statistics, Item Response Theory

Automatic Item Generation: A More Efficient Process for Developing Mathematics Achievement Items?

Peer reviewed

Direct link

Embretson, Susan E.; Kingston, Neal M. – Journal of Educational Measurement, 2018

The continual supply of new items is crucial to maintaining quality for many tests. Automatic item generation (AIG) has the potential to rapidly increase the number of items that are available. However, the efficiency of AIG will be mitigated if the generated items must be submitted to traditional, time-consuming review processes. In two studies,…

Descriptors: Mathematics Instruction, Mathematics Achievement, Psychometrics, Test Items

IRT Approaches to Modeling Scores on Mixed-Format Tests

Peer reviewed

Direct link

Lee, Won-Chan; Kim, Stella Y.; Choi, Jiwon; Kang, Yujin – Journal of Educational Measurement, 2020

This article considers psychometric properties of composite raw scores and transformed scale scores on mixed-format tests that consist of a mixture of multiple-choice and free-response items. Test scores on several mixed-format tests are evaluated with respect to conditional and overall standard errors of measurement, score reliability, and…

Descriptors: Raw Scores, Item Response Theory, Test Format, Multiple Choice Tests

Classroom Assessment and Large-Scale Psychometrics: Shall the Twain Meet? (A Conversation with Margaret Heritage and Neal Kingston)

Peer reviewed

Direct link

Heritage, Margaret; Kingston, Neal M. – Journal of Educational Measurement, 2019

Classroom assessment and large-scale assessment have, for the most part, existed in mutual isolation. Some experts have felt this is for the best and others have been concerned that the schism limits the potential contribution of both forms of assessment. Margaret Heritage has long been a champion of best practices in classroom assessment. Neal…

Descriptors: Measurement, Psychometrics, Context Effect, Classroom Environment

Computerized Adaptive Testing in Early Education: Exploring the Impact of Item Position Effects on Ability Estimation

Peer reviewed

Direct link

Albano, Anthony D.; Cai, Liuhan; Lease, Erin M.; McConnell, Scott R. – Journal of Educational Measurement, 2019

Studies have shown that item difficulty can vary significantly based on the context of an item within a test form. In particular, item position may be associated with practice and fatigue effects that influence item parameter estimation. The purpose of this research was to examine the relevance of item position specifically for assessments used in…

Descriptors: Test Items, Computer Assisted Testing, Item Analysis, Difficulty Level

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6

Gierl, Mark J.	5
Cui, Ying	3
Angoff, William H.	2
Clauser, Brian E.	2
Dorans, Neil J.	2
Kingston, Neal M.	2
Kolen, Michael J.	2
Lee, Won-Chan	2
Leighton, Jacqueline P.	2
Margolis, Melissa J.	2
Roussos, Louis A.	2
Schrader, William B.	2
Templin, Jonathan	2
Vispoel, Walter P.	2
Wainer, Howard	2
Wang, Tianyou	2
de la Torre, Jimmy	2
Ackerman, Terry	1
Akihito Kamata	1
Albanese, Mark A.	1
Albano, Anthony D.	1
Alexeev, Natalia	1
Almond, Russell G.	1
Amery D. Wu	1
More ▼