ERIC - Search Results

Publication Date

In 2025	0
Since 2024	1
Since 2021 (last 5 years)	9
Since 2016 (last 10 years)	21
Since 2006 (last 20 years)	40

Descriptor

Achievement Tests	158
Test Validity	62
Predictive Validity	32
Test Items	31
Academic Achievement	30
Correlation	29
Scores	26
Test Reliability	24
Foreign Countries	22
Intelligence Tests	20
Higher Education	19
Item Response Theory	19
Standardized Tests	18
Item Analysis	17
Comparative Analysis	16
Elementary Education	14
Factor Analysis	13
Test Bias	13
Test Construction	13
Elementary School Students	12
International Assessment	12
Aptitude Tests	11
Computation	11
Elementary Secondary Education	11
Grade Point Average	11
More ▼

Source

Educational and Psychological…

158

Publication Type

Journal Articles	114
Reports - Research	98
Reports - Evaluative	12
Opinion Papers	2
Reports - Descriptive	2
Information Analyses	1

Education Level

Secondary Education	14
Elementary Education	10
Elementary Secondary Education	6
Higher Education	6
Intermediate Grades	6
Postsecondary Education	6
Grade 4	5
Middle Schools	5
Early Childhood Education	4
Grade 3	4
High Schools	4
Junior High Schools	4
Primary Education	4
Grade 5	3
Grade 6	2
Grade 8	2
Grade 1	1
Grade 10	1
Grade 2	1
Grade 7	1
Grade 9	1
Kindergarten	1
More ▼

Audience

Location

Germany	4
Canada	3
Taiwan	3
Australia	2
China	2
Alaska	1
Florida	1
Hong Kong	1
Indiana	1
Ireland	1
Israel	1
Japan	1
Kentucky	1
New Zealand	1
Ohio	1
Pakistan	1
Russia	1
Singapore	1
South Korea	1
United Arab Emirates	1
United Kingdom	1
United States	1
More ▼

Laws, Policies, & Programs

No Child Left Behind Act 2001

What Works Clearinghouse Rating

Showing 1 to 15 of 158 results Save | Export

Fixed Effects or Mixed Effects Classifiers? Evidence from Simulated and Archival Data

Peer reviewed

Direct link

Mangino, Anthony A.; Bolin, Jocelyn H.; Finch, W. Holmes – Educational and Psychological Measurement, 2023

This study seeks to compare fixed and mixed effects models for the purposes of predictive classification in the presence of multilevel data. The first part of the study utilizes a Monte Carlo simulation to compare fixed and mixed effects logistic regression and random forests. An applied examination of the prediction of student retention in the…

Descriptors: Prediction, Classification, Monte Carlo Methods, Foreign Countries

Parameter Estimation Accuracy of the Effort-Moderated Item Response Theory Model under Multiple Assumption Violations

Peer reviewed

Direct link

Rios, Joseph A.; Soland, James – Educational and Psychological Measurement, 2021

As low-stakes testing contexts increase, low test-taking effort may serve as a serious validity threat. One common solution to this problem is to identify noneffortful responses and treat them as missing during parameter estimation via the effort-moderated item response theory (EM-IRT) model. Although this model has been shown to outperform…

Descriptors: Computation, Accuracy, Item Response Theory, Response Style (Tests)

Generalized Mantel-Haenszel Estimators for Simultaneous Differential Item Functioning Tests

Peer reviewed

Direct link

Liu, Ivy; Suesse, Thomas; Harvey, Samuel; Gu, Peter Yongqi; Fernández, Daniel; Randal, John – Educational and Psychological Measurement, 2023

The Mantel-Haenszel estimator is one of the most popular techniques for measuring differential item functioning (DIF). A generalization of this estimator is applied to the context of DIF to compare items by taking the covariance of odds ratio estimators between dependent items into account. Unlike the Item Response Theory, the method does not rely…

Descriptors: Test Bias, Computation, Statistical Analysis, Achievement Tests

Fused SDT/IRT Models for Mixed-Format Exams

Peer reviewed

Direct link

Lawrence T. DeCarlo – Educational and Psychological Measurement, 2024

A psychological framework for different types of items commonly used with mixed-format exams is proposed. A choice model based on signal detection theory (SDT) is used for multiple-choice (MC) items, whereas an item response theory (IRT) model is used for open-ended (OE) items. The SDT and IRT models are shown to share a common conceptualization…

Descriptors: Test Format, Multiple Choice Tests, Item Response Theory, Models

Performance of Coefficient Alpha and Its Alternatives: Effects of Different Types of Non-Normality

Peer reviewed

Direct link

Xiao, Leifeng; Hau, Kit-Tai – Educational and Psychological Measurement, 2023

We examined the performance of coefficient alpha and its potential competitors (ordinal alpha, omega total, Revelle's omega total [omega RT], omega hierarchical [omega h], greatest lower bound [GLB], and coefficient "H") with continuous and discrete data having different types of non-normality. Results showed the estimation bias was…

Descriptors: Statistical Bias, Statistical Analysis, Likert Scales, Statistical Distributions

Is Differential Noneffortful Responding Associated with Type I Error in Measurement Invariance Testing?

Peer reviewed

Direct link

Rios, Joseph A. – Educational and Psychological Measurement, 2021

Low test-taking effort as a validity threat is common when examinees perceive an assessment context to have minimal personal value. Prior research has shown that in such contexts, subgroups may differ in their effort, which raises two concerns when making subgroup mean comparisons. First, it is unclear how differential effort could influence…

Descriptors: Response Style (Tests), Statistical Analysis, Measurement, Comparative Analysis

Assessing Measurement Invariance across Multiple Groups: When Is Fit Good Enough?

Peer reviewed

Direct link

van Dijk, Wilhelmina; Schatschneider, Christopher; Al Otaiba, Stephanie; Hart, Sara A. – Educational and Psychological Measurement, 2022

Complex research questions often need large samples to obtain accurate estimates of parameters and adequate power. Combining extant data sets into a large, pooled data set is one way this can be accomplished without expending resources. Measurement invariance (MI) modeling is an established approach to ensure participant scores are on the same…

Descriptors: Sample Size, Data Analysis, Goodness of Fit, Measurement

Measuring Motivation to Take Low-Stakes Large-Scale Test: New Model Based on Analyses of "Participant-Own-Defined" Missingness

Peer reviewed

Direct link

Liu, Yuan; Hau, Kit-Tai – Educational and Psychological Measurement, 2020

In large-scale low-stake assessment such as the Programme for International Student Assessment (PISA), students may skip items (missingness) which are within their ability to complete. The detection and taking care of these noneffortful responses, as a measure of test-taking motivation, is an important issue in modern psychometric models.…

Descriptors: Response Style (Tests), Motivation, Test Items, Statistical Analysis

Comparing Age- and Grade-Based Norms on the Woodcock-Johnson III Normative Update

Peer reviewed

Direct link

Harrison, Allyson G.; Butt, Kaitlyn; Armstrong, Irene – Educational and Psychological Measurement, 2019

There has been a marked increase in accommodation requests from students with disabilities at both the postsecondary education level and on high-stakes examinations. As such, accurate identification and quantification of normative impairment is essential for equitable provision of accommodations. Considerable diversity currently exists in methods…

Descriptors: Achievement Tests, Test Norms, Age, Instructional Program Divisions

A Mixture IRTree Model for Performance Decline and Nonignorable Missing Data

Peer reviewed

Direct link

Huang, Hung-Yu – Educational and Psychological Measurement, 2020

In educational assessments and achievement tests, test developers and administrators commonly assume that test-takers attempt all test items with full effort and leave no blank responses with unplanned missing values. However, aberrant response behavior--such as performance decline, dropping out beyond a certain point, and skipping certain items…

Descriptors: Item Response Theory, Response Style (Tests), Test Items, Statistical Analysis

Scoring Graphical Responses in TIMSS 2019 Using Artificial Neural Networks

Peer reviewed

Direct link

von Davier, Matthias; Tyack, Lillian; Khorramdel, Lale – Educational and Psychological Measurement, 2023

Automated scoring of free drawings or images as responses has yet to be used in large-scale assessments of student achievement. In this study, we propose artificial neural networks to classify these types of graphical responses from a TIMSS 2019 item. We are comparing classification accuracy of convolutional and feed-forward approaches. Our…

Descriptors: Scoring, Networks, Artificial Intelligence, Elementary Secondary Education

When Nonresponse Mechanisms Change: Effects on Trends and Group Comparisons in International Large-Scale Assessments

Peer reviewed

Direct link

Sachse, Karoline A.; Mahler, Nicole; Pohl, Steffi – Educational and Psychological Measurement, 2019

Mechanisms causing item nonresponses in large-scale assessments are often said to be nonignorable. Parameter estimates can be biased if nonignorable missing data mechanisms are not adequately modeled. In trend analyses, it is plausible for the missing data mechanism and the percentage of missing values to change over time. In this article, we…

Descriptors: International Assessment, Response Style (Tests), Achievement Tests, Foreign Countries

A Mixture IRTree Model for Extreme Response Style: Accounting for Response Process Uncertainty

Peer reviewed

Direct link

Kim, Nana; Bolt, Daniel M. – Educational and Psychological Measurement, 2021

This paper presents a mixture item response tree (IRTree) model for extreme response style. Unlike traditional applications of single IRTree models, a mixture approach provides a way of representing the mixture of respondents following different underlying response processes (between individuals), as well as the uncertainty present at the…

Descriptors: Item Response Theory, Response Style (Tests), Models, Test Items

Using Quantile Regression to Estimate Intervention Effects beyond the Mean

Peer reviewed
PDF on ERIC

Download full text

Direct link

Konstantopoulos, Spyros; Li, Wei; Miller, Shazia; van der Ploeg, Arie – Educational and Psychological Measurement, 2019

This study discusses quantile regression methodology and its usefulness in education and social science research. First, quantile regression is defined and its advantages vis-à-vis vis ordinary least squares regression are illustrated. Second, specific comparisons are made between ordinary least squares and quantile regression methods. Third, the…

Descriptors: Regression (Statistics), Statistical Analysis, Educational Research, Social Science Research

Constructing Subscores That Add Validity: A Case Study of Identifying Students at Risk

Peer reviewed
PDF on ERIC

Download full text

Direct link

Biancarosa, Gina; Kennedy, Patrick C.; Carlson, Sarah E.; Yoon, HyeonJin; Seipel, Ben; Liu, Bowen; Davison, Mark L. – Educational and Psychological Measurement, 2019

Prior research suggests that subscores from a single achievement test seldom add value over a single total score. Such scores typically correspond to subcontent areas in the total content domain, but content subdomains might not provide a sound basis for subscores. Using scores on an inferential reading comprehension test from 625 third, fourth,…

Descriptors: Scores, Scoring, Achievement Tests, Grade 3

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11

Michael, William B.	6
Powers, Stephen	5
Plake, Barbara S.	4
Klein, Alice E.	3
Wilcox, Rand R.	3
Ansley, Timothy N.	2
Bennet, Richard W.	2
Benson, Jeri	2
Darakjian, Gregory P.	2
Feldt, Leonard S.	2
Finch, W. Holmes	2
Forsyth, Robert A.	2
Frey, Andreas	2
Goolsby, Thomas M., Jr.	2
Hakstian, A. Ralph	2
Harper, Frank B. W.	2
Hau, Kit-Tai	2
Huang, Hung-Yu	2
Pohl, Steffi	2
Qualls, Audrey L.	2
Reynolds, Cecil R.	2
Rios, Joseph A.	2
Simpson, Robert G.	2
Valencia, Richard R.	2
More ▼

Stanford Achievement Tests	10
Iowa Tests of Basic Skills	9
Program for International…	9
Comprehensive Tests of Basic…	7
California Achievement Tests	6
Wechsler Intelligence Scale…	6
Wide Range Achievement Test	6
Iowa Tests of Educational…	5
Peabody Individual…	5
Trends in International…	5
Graduate Record Examinations	4
Metropolitan Achievement Tests	4
SAT (College Admission Test)	4
SRA Achievement Series	3
Wechsler Preschool and…	3
College Level Examination…	2
Dimensions of Self Concept	2
Kaufman Assessment Battery…	2
SRA Primary Mental Abilities…	2
Stanford Early School…	2
Woodcock Johnson Psycho…	2
Woodcock Johnson Tests of…	2
ACT Assessment	1
Advanced Placement…	1
California Test of Mental…	1
More ▼