ERIC - Search Results

Publication Date

In 2025	1
Since 2024	2
Since 2021 (last 5 years)	2
Since 2016 (last 10 years)	3
Since 2006 (last 20 years)	7

Descriptor

Educational Testing	23
Scores	5
Test Items	5
Higher Education	4
Item Response Theory	3
Measurement Techniques	3
Multiple Choice Tests	3
Statistical Analysis	3
Test Construction	3
Test Reliability	3
Test Validity	3
Ability	2
Computation	2
Computer Assisted Testing	2
Correlation	2
Elementary Education	2
Error of Measurement	2
Evaluation Methods	2
Factor Analysis	2
Guessing (Tests)	2
Item Analysis	2
Models	2
Multivariate Analysis	2
Personality Measures	2
Predictive Validity	2
More ▼

Source

Educational and Psychological…

Publication Type

Journal Articles	18
Reports - Research	15
Reports - Evaluative	3

Education Level

Audience

Location

Australia	1
California	1
Canada	1
China	1
Delaware	1
Florida	1
Hong Kong	1
India	1
Japan	1
Kentucky	1
Maryland	1
Ohio	1
South Carolina	1
South Korea	1
Taiwan	1
Texas	1
United Kingdom	1
United States	1
Virginia	1
Washington	1
More ▼

Laws, Policies, & Programs

Assessments and Surveys

Beery Developmental Test of…	1
Comprehensive Tests of Basic…	1
Developmental Test of Visual…	1
Metropolitan Achievement Tests	1
National Assessment of…	1
Peabody Picture Vocabulary…	1
Raven Progressive Matrices	1

What Works Clearinghouse Rating

Showing 1 to 15 of 23 results Save | Export

Added Value of Subscores for Tests with Polytomous Items

Peer reviewed

Direct link

Kylie Gorney; Sandip Sinharay – Educational and Psychological Measurement, 2025

Test-takers, policymakers, teachers, and institutions are increasingly demanding that testing programs provide more detailed feedback regarding test performance. As a result, there has been a growing interest in the reporting of subscores that potentially provide such detailed feedback. Haberman developed a method based on classical test theory…

Descriptors: Scores, Test Theory, Test Items, Testing

An Ensemble Learning Approach Based on TabNet and Machine Learning Models for Cheating Detection in Educational Tests

Peer reviewed

Direct link

Yang Zhen; Xiaoyan Zhu – Educational and Psychological Measurement, 2024

The pervasive issue of cheating in educational tests has emerged as a paramount concern within the realm of education, prompting scholars to explore diverse methodologies for identifying potential transgressors. While machine learning models have been extensively investigated for this purpose, the untapped potential of TabNet, an intricate deep…

Descriptors: Artificial Intelligence, Models, Cheating, Identification

Evaluating the Performances of Missing Data Handling Methods in Ability Estimation from Sparse Data

Peer reviewed

Direct link

Xiao, Jiaying; Bulut, Okan – Educational and Psychological Measurement, 2020

Large amounts of missing data could distort item parameter estimation and lead to biased ability estimates in educational assessments. Therefore, missing responses should be handled properly before estimating any parameters. In this study, two Monte Carlo simulation studies were conducted to compare the performance of four methods in handling…

Descriptors: Data, Computation, Ability, Maximum Likelihood Statistics

The Interaction of Ability Differences and Guessing When Modeling Differential Item Functioning with the Rasch Model: Conventional and Tailored Calibration

Peer reviewed

Direct link

DeMars, Christine E.; Jurich, Daniel P. – Educational and Psychological Measurement, 2015

In educational testing, differential item functioning (DIF) statistics must be accurately estimated to ensure the appropriate items are flagged for inspection or removal. This study showed how using the Rasch model to estimate DIF may introduce considerable bias in the results when there are large group differences in ability (impact) and the data…

Descriptors: Test Bias, Guessing (Tests), Ability, Differences

Do Adjusted Subscores Lack Validity? Don't Blame the Messenger

Peer reviewed

Direct link

Sinharay, Sandip; Haberman, Shelby J.; Wainer, Howard – Educational and Psychological Measurement, 2011

There are several techniques that increase the precision of subscores by borrowing information from other parts of the test. These techniques have been criticized on validity grounds in several of the recent publications. In this note, the authors question the argument used in these publications and suggest both inherent limits to the validity…

Descriptors: Scores, Methods, Validity, Reliability

A New Method for Analyzing Content Validity Data Using Multidimensional Scaling

Peer reviewed

Direct link

Li, Xueming; Sireci, Stephen G. – Educational and Psychological Measurement, 2013

Validity evidence based on test content is of essential importance in educational testing. One source for such evidence is an alignment study, which helps evaluate the congruence between tested objectives and those specified in the curriculum. However, the results of an alignment study do not always sufficiently capture the degree to which a test…

Descriptors: Content Validity, Multidimensional Scaling, Data Analysis, Educational Testing

The Evidence for a Subscore Structure in a Test of English Language Competency for English Language Learners

Peer reviewed

Direct link

Reckase, Mark D.; Xu, Jing-Ru – Educational and Psychological Measurement, 2015

How to compute and report subscores for a test that was originally designed for reporting scores on a unidimensional scale has been a topic of interest in recent years. In the research reported here, we describe an application of multidimensional item response theory to identify a subscore structure in a test designed for reporting results using a…

Descriptors: English, Language Skills, English Language Learners, Scores

Concerning the Mean of the Central F Distribution

Peer reviewed

Stanley, Julian C. – Educational and Psychological Measurement, 1972

Descriptors: Educational Testing, Mathematical Applications, Statistical Analysis

Choosing Minimum Passing Scores by Stochastic Approximation Techniques.

Peer reviewed

Livingston, Samuel A. – Educational and Psychological Measurement, 1980

A specified minimum performance level can be translated into a minimum passing score for the written test by measuring the performance of students whose written test scores are near the desired cutoff score. Stochastic approximation methods accomplish this purpose. The up-and-down method and the Robbins-Monro process are compared. (Author/RL)

Descriptors: Cutting Scores, Educational Testing, Occupational Tests, Scoring Formulas

TAP: An Interactive Test Analysis Program for Health Education.

Peer reviewed

Maisiak, Richard; And Others – Educational and Psychological Measurement, 1979

The Test Analysis Program (TAP) is a comprehensive, flexible computer system designed to score and to analyze objective educational tests. The goals of the designers were to construct a program which would be user-oriented, flexible, and clear in structure and in output. (Author/JKS)

Descriptors: Computer Programs, Educational Testing, Item Analysis, Objective Tests

A Historical Comparison of Validity Standards and Validity Practices.

Peer reviewed

Jonson, Jessica L.; Plake, Barbara S. – Educational and Psychological Measurement, 1998

The relationship between the validity theory of the past 50 years and actual validity practices was studied by comparing published test standards with the practices of measurement professionals expressed in the "Mental Measurements Yearbook" test reviews. Results show a symbiotic relationship between theory and practice on the influence…

Descriptors: Educational Testing, Measurement Techniques, Standards, Test Use

Q Factor Analysis: Applications to Educational Testing and Program Evaluation

Peer reviewed

Redburn, F. Stevens – Educational and Psychological Measurement, 1975

Q factor analysis is found appropriate for use in clinical or educational situations where available typologies or scales seem inadequate, where the psychological dynamics of learning or treatment are not well understood, or where it is desirable to avoid anticipating the precise direction and character of program impact. (Author/BJG)

Descriptors: Educational Testing, Factor Analysis, Higher Education, Internship Programs

How to Write True-False Test Items

Peer reviewed

Ebel, Robert L. – Educational and Psychological Measurement, 1971

Descriptors: Achievement Tests, Educational Testing, Evaluation Methods, Multiple Choice Tests

Kindergarten Prediction of Reading Achievement: A Seven-Year Longitudinal Follow-Up.

Peer reviewed

Fletcher, Jack M. – Educational and Psychological Measurement, 1982

A longitudinal evaluation of the utility of a screening battery administered in kindergarten is shown to retain a high utility for predicting current achievement outcomes of the sample at the end of grade six. The use of discriminant functional analysis and statistical decision theory is discussed. (Author/CM)

Descriptors: Educational Testing, Elementary Education, Grade 6, Kindergarten

Response Biases in Multiple-Choice Test Item Files.

Peer reviewed

Mentzer, Thomas L. – Educational and Psychological Measurement, 1982

Evidence of biases in the correct answers in multiple-choice test item files were found to include "all of the above" bias in which that answer was correct more than 25 percent of the time, and a bias that the longest answer was correct too frequently. Seven bias types were studied. (Author/CM)

Descriptors: Educational Testing, Higher Education, Multiple Choice Tests, Psychology

Previous Page | Next Page »

Pages: 1 | 2

Barnes, Laura L. B.	1
Brink, Nicholas E.	1
Bryant, Namok C.	1
Bulut, Okan	1
Coop, Richard H.	1
Curtis, Connie June	1
DeMars, Christine E.	1
Dillon, Ronna F.	1
Ebel, Robert L.	1
Fletcher, Jack M.	1
Haberman, Shelby J.	1
Jonson, Jessica L.	1
Jurich, Daniel P.	1
King, Wesley C., Jr.	1
Kylie Gorney	1
Li, Xueming	1
Livingston, Samuel A.	1
Maisiak, Richard	1
Mentzer, Thomas L.	1
Miles, Edward W.	1
Plake, Barbara S.	1
Reckase, Mark D.	1
Redburn, F. Stevens	1
Reynolds, William M.	1
More ▼