ERIC - Search Results

Publication Date

In 2025	9
Since 2024	23
Since 2021 (last 5 years)	55
Since 2016 (last 10 years)	93
Since 2006 (last 20 years)	302

Descriptor

Evaluation Methods	469
Item Analysis	469
Test Items	104
Test Construction	96
Foreign Countries	83
Psychometrics	79
Item Response Theory	75
Test Validity	75
Measurement Techniques	71
Evaluation Research	70
Student Evaluation	59
Factor Analysis	57
Evaluation Criteria	56
Models	55
Test Reliability	55
Comparative Analysis	52
Measures (Individuals)	48
Statistical Analysis	48
Correlation	41
Questionnaires	41
Validity	39
Computer Assisted Testing	36
Scores	36
Simulation	36
Program Validation	34
More ▼

Education Level

Higher Education	84
Elementary Secondary Education	44
Elementary Education	36
Secondary Education	36
Postsecondary Education	25
Adult Education	24
High Schools	16
Early Childhood Education	15
Middle Schools	13
Grade 8	7
Junior High Schools	7
Grade 5	5
Grade 6	5
Grade 4	4
Intermediate Grades	4
Kindergarten	4
Preschool Education	4
Primary Education	4
Grade 7	3
Grade 1	2
Grade 10	2
Grade 11	2
Grade 3	2
Two Year Colleges	2
Adult Basic Education	1
More ▼

Audience

Researchers	11
Practitioners	8
Administrators	4
Policymakers	4
Teachers	4
Counselors	1
Media Staff	1

Location

Australia	11
United Kingdom	10
Oregon	8
United States	8
China	7
United Kingdom (England)	6
Netherlands	5
Canada	4
California	3
Germany	3
Greece	3
India	3
South Korea	3
Taiwan	3
Texas	3
Turkey	3
Virginia	3
Florida	2
Hong Kong	2
Indonesia	2
Iran	2
Malaysia	2
Massachusetts	2
Mexico	2
Michigan	2
More ▼

Laws, Policies, & Programs

Individuals with Disabilities…	8
Elementary and Secondary…	1
No Child Left Behind Act 2001	1

What Works Clearinghouse Rating

Showing 1 to 15 of 469 results Save | Export

IRT Linking Methods for the Bifactor Model with Mixed Format Tests

Peer reviewed

Direct link

Sohee Kim; Ki Lynn Cole – International Journal of Testing, 2025

This study conducted a comprehensive comparison of Item Response Theory (IRT) linking methods applied to a bifactor model, examining their performance on both multiple choice (MC) and mixed format tests within the common item nonequivalent group design framework. Four distinct multidimensional IRT linking approaches were explored, consisting of…

Descriptors: Item Response Theory, Comparative Analysis, Models, Item Analysis

Why Forced-Choice and Likert Items Provide the Same Information on Personality, including Social Desirability

Peer reviewed

Direct link

Martin Bäckström; Fredrik Björklund – Educational and Psychological Measurement, 2024

The forced-choice response format is often considered superior to the standard Likert-type format for controlling social desirability in personality inventories. We performed simulations and found that the trait information based on the two formats converges when the number of items is high and forced-choice items are mixed with regard to…

Descriptors: Likert Scales, Item Analysis, Personality Traits, Personality Measures

Bayesian Diagnostic Classification Models for a Partially Known Q-Matrix

Peer reviewed

Direct link

Kazuhiro Yamaguchi – Journal of Educational and Behavioral Statistics, 2025

This study proposes a Bayesian method for diagnostic classification models (DCMs) for a partially known Q-matrix setting between exploratory and confirmatory DCMs. This Q-matrix setting is practical and useful because test experts have pre-knowledge of the Q-matrix but cannot readily specify it completely. The proposed method employs priors for…

Descriptors: Models, Classification, Bayesian Statistics, Evaluation Methods

A Note on Standard Errors for Multidimensional Two-Parameter Logistic Models Using Gaussian Variational Estimation

Peer reviewed

Direct link

Jiaying Xiao; Chun Wang; Gongjun Xu – Grantee Submission, 2024

Accurate item parameters and standard errors (SEs) are crucial for many multidimensional item response theory (MIRT) applications. A recent study proposed the Gaussian Variational Expectation Maximization (GVEM) algorithm to improve computational efficiency and estimation accuracy (Cho et al., 2021). However, the SE estimation procedure has yet to…

Descriptors: Error of Measurement, Models, Evaluation Methods, Item Analysis

Deep Learning Imputation for Asymmetric and Incomplete Likert-Type Items

Peer reviewed

Direct link

Zachary K. Collier; Minji Kong; Olushola Soyoye; Kamal Chawla; Ann M. Aviles; Yasser Payne – Journal of Educational and Behavioral Statistics, 2024

Asymmetric Likert-type items in research studies can present several challenges in data analysis, particularly concerning missing data. These items are often characterized by a skewed scaling, where either there is no neutral response option or an unequal number of possible positive and negative responses. The use of conventional techniques, such…

Descriptors: Likert Scales, Test Items, Item Analysis, Evaluation Methods

Identifying Response Styles Using Person Fit Analysis and Response-Styles Models

Peer reviewed

Direct link

Wind, Stefanie A.; Ge, Yuan – Measurement: Interdisciplinary Research and Perspectives, 2023

In selected-response assessments such as attitude surveys with Likert-type rating scales, examinees often select from rating scale categories to reflect their locations on a construct. Researchers have observed that some examinees exhibit "response styles," which are systematic patterns of responses in which examinees are more likely to…

Descriptors: Goodness of Fit, Responses, Likert Scales, Models

Evaluating the Performance of Estimators in SEM and IRT with Ordinal Variables

Direct link

Klauth, Bo – ProQuest LLC, 2023

In conducting confirmatory factor analysis with ordered response items, the literature suggests that when the number of responses is five and item skewness (IS) is approximately normal, researchers can employ maximum likelihood with robust standard errors (MLR). However, MLR can yield biased factor loadings (FL) and FL standard errors (FLSE) when…

Descriptors: Item Response Theory, Evaluation Methods, Factor Analysis, Error of Measurement

A Validation Study of the Extended Relevance Scale Using the D3mirt Package for R

Peer reviewed

Direct link

Erik Forsberg; Anders Sjöberg – Measurement: Interdisciplinary Research and Perspectives, 2025

This paper reports a validation study based on descriptive multidimensional item response theory (DMIRT), implemented in the R package "D3mirt" by using the ERS-C, an extended version of the Relevance subscale from the Moral Foundations Questionnaire including two new items for collectivism (17 items in total). Two latent models are…

Descriptors: Evaluation Methods, Programming Languages, Altruism, Collectivism

The Applicability of the Mississippi Professional Growth System's Evaluation of Secondary Choral Music Educators

Direct link

Hannah Gadd Ardrey – ProQuest LLC, 2024

The purpose of the study was to investigate secondary choral music educators' and administrators' perceptions of the use of the Mississippi Professional Growth System (PGS) as an applicable tool for evaluating secondary choral music educators. While there is limited research regarding the evaluation of choral music educators, this study aimed to…

Descriptors: Secondary School Teachers, Music Teachers, Singing, Teacher Evaluation

Small-Variance Priors in Bayesian Factor Analysis with Ordinal Data

Peer reviewed

Direct link

Liang, Xinya; Cao, Chunhua – Journal of Experimental Education, 2023

To evaluate multidimensional factor structure, a popular method that combines features of confirmatory and exploratory factor analysis is Bayesian structural equation modeling with small-variance normal priors (BSEM-N). This simulation study evaluated BSEM-N as a variable selection and parameter estimation tool in factor analysis with sparse…

Descriptors: Factor Analysis, Bayesian Statistics, Structural Equation Models, Simulation

Cumulative Ordering as Evidence of Construct Validity for Assessments of Developmental Attributes

Peer reviewed

Direct link

Stephen Humphry; Paul Montuoro; Carolyn Maxwell – Journal of Psychoeducational Assessment, 2024

This article builds upon a proiminent definition of construct validity that focuses on variation in attributes causing variation in measurement outcomes. This article synthesizes the defintion and uses Rasch measurement modeling to explicate a modified conceptualization of construct validity for assessments of developmental attributes. If…

Descriptors: Construct Validity, Measurement Techniques, Developmental Stages, Item Analysis

Detecting Compromised Items with Response Times Using a Bayesian Change-Point Approach

Peer reviewed

Direct link

Yang Du; Susu Zhang – Journal of Educational and Behavioral Statistics, 2025

Item compromise has long posed challenges in educational measurement, jeopardizing both test validity and test security of continuous tests. Detecting compromised items is therefore crucial to address this concern. The present literature on compromised item detection reveals two notable gaps: First, the majority of existing methods are based upon…

Descriptors: Item Response Theory, Item Analysis, Bayesian Statistics, Educational Assessment

Linear Factor Analytic Thurstonian Forced-Choice Models: Current Status and Issues

Peer reviewed

Direct link

Markus T. Jansen; Ralf Schulze – Educational and Psychological Measurement, 2024

Thurstonian forced-choice modeling is considered to be a powerful new tool to estimate item and person parameters while simultaneously testing the model fit. This assessment approach is associated with the aim of reducing faking and other response tendencies that plague traditional self-report trait assessments. As a result of major recent…

Descriptors: Factor Analysis, Models, Item Analysis, Evaluation Methods

Assessing Dimensionality of IRT Models Using Traditional and Revised Parallel Analyses

Peer reviewed

Direct link

Guo, Wenjing; Choi, Youn-Jeng – Educational and Psychological Measurement, 2023

Determining the number of dimensions is extremely important in applying item response theory (IRT) models to data. Traditional and revised parallel analyses have been proposed within the factor analysis framework, and both have shown some promise in assessing dimensionality. However, their performance in the IRT framework has not been…

Descriptors: Item Response Theory, Evaluation Methods, Factor Analysis, Guidelines

There Are Many Greater Lower Bounds than Cronbach's [alpha]: A Monte Carlo Simulation Study

Peer reviewed

Direct link

Novak, Josip; Rebernjak, Blaž – Measurement: Interdisciplinary Research and Perspectives, 2023

A Monte Carlo simulation study was conducted to examine the performance of [alpha], [lambda]2, [lambda][subscript 4], [lambda][subscript 2], [omega][subscript T], GLB[subscript MRFA], and GLB[subscript Algebraic] coefficients. Population reliability, distribution shape, sample size, test length, and number of response categories were varied…

Descriptors: Monte Carlo Methods, Evaluation Methods, Reliability, Simulation

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | ... | 32

ProQuest LLC	24
Educational and Psychological…	20
Journal of Educational…	9
Behavioral Research and…	8
Assessment & Evaluation in…	7
Journal of Educational and…	7
Applied Psychological…	6
Grantee Submission	6
Applied Measurement in…	5
International Journal of…	5
Journal of Technology,…	5
Journal of Experimental…	4
Measurement:…	4
Multivariate Behavioral…	4
Online Submission	4
Achieve, Inc.	3
Early Education and…	3
Educ Psychol Meas	3
Educational Research and…	3
Practical Assessment,…	3
Psychometrika	3
AERA Online Paper Repository	2
American Journal of Evaluation	2
Assessment	2
Assessment for Effective…	2
More ▼

Alonzo, Julie	8
Tindal, Gerald	8
Lai, Cheng Fei	7
Hambleton, Ronald K.	4
Raykov, Tenko	4
Chun Wang	3
Gongjun Xu	3
Brennan, Robert L.	2
Dancer, L. Suzanne	2
De Maeyer, Sven	2
Gierl, Mark J.	2
Jaeger, Richard M.	2
Johanson, George A.	2
Kim, Do-Hong	2
Klein, Stephen P.	2
Lee, Minhong	2
Liu, Yan	2
McKinley, Robert L.	2
Merz, William R.	2
Muniz, Jose	2
Pan, Yue-Juan	2
Plake, Barbara S.	2
Preece, P. F. W.	2
Reckase, Mark D.	2
More ▼

Journal Articles	297
Reports - Research	236
Reports - Evaluative	90
Reports - Descriptive	53
Speeches/Meeting Papers	31
Dissertations/Theses -…	24
Information Analyses	18
Tests/Questionnaires	17
Numerical/Quantitative Data	15
Guides - Non-Classroom	6
Guides - Classroom - Teacher	3
Books	2
Collected Works - General	2
Guides - General	2
Opinion Papers	2
Dissertations/Theses -…	1
Historical Materials	1
Non-Print Media	1
Reference Materials -…	1
Reference Materials - General	1
More ▼

Iowa Tests of Basic Skills	4
National Assessment of…	4
Program for International…	4
Trends in International…	4
Test of English as a Foreign…	3
Graduate Record Examinations	2
Wechsler Intelligence Scale…	2
ACT Assessment	1
Academic Motivation Scale	1
Attitude Scale	1
Autism Diagnostic Observation…	1
Behavior Assessment System…	1
California Psychological…	1
Child Behavior Checklist	1
Classroom Assessment Scoring…	1
Dynamic Indicators of Basic…	1
Early Childhood Environment…	1
Eysenck Personality Inventory	1
Flesch Kincaid Grade Level…	1
Florida Comprehensive…	1
Group Assessment of Logical…	1
International English…	1
Metropolitan Achievement Tests	1
National Longitudinal Study…	1
National Longitudinal Study…	1
More ▼