ERIC - Search Results

Publication Date

In 2025	1
Since 2024	2
Since 2021 (last 5 years)	2
Since 2016 (last 10 years)	3
Since 2006 (last 20 years)	7

Descriptor

Educational Testing	23
Scores	5
Test Items	5
Higher Education	4
Item Response Theory	3
Measurement Techniques	3
Multiple Choice Tests	3
Statistical Analysis	3
Test Construction	3
Test Reliability	3
Test Validity	3
Ability	2
Computation	2
Computer Assisted Testing	2
Correlation	2
Elementary Education	2
Error of Measurement	2
Evaluation Methods	2
Factor Analysis	2
Guessing (Tests)	2
Item Analysis	2
Models	2
Multivariate Analysis	2
Personality Measures	2
Predictive Validity	2
More ▼

Source

Educational and Psychological…

Publication Type

Journal Articles	18
Reports - Research	15
Reports - Evaluative	3

Education Level

Audience

Location

Australia	1
California	1
Canada	1
China	1
Delaware	1
Florida	1
Hong Kong	1
India	1
Japan	1
Kentucky	1
Maryland	1
Ohio	1
South Carolina	1
South Korea	1
Taiwan	1
Texas	1
United Kingdom	1
United States	1
Virginia	1
Washington	1
More ▼

Laws, Policies, & Programs

Assessments and Surveys

Beery Developmental Test of…	1
Comprehensive Tests of Basic…	1
Developmental Test of Visual…	1
Metropolitan Achievement Tests	1
National Assessment of…	1
Peabody Picture Vocabulary…	1
Raven Progressive Matrices	1

What Works Clearinghouse Rating

Showing 1 to 15 of 23 results Save | Export

Added Value of Subscores for Tests with Polytomous Items

Peer reviewed

Direct link

Kylie Gorney; Sandip Sinharay – Educational and Psychological Measurement, 2025

Test-takers, policymakers, teachers, and institutions are increasingly demanding that testing programs provide more detailed feedback regarding test performance. As a result, there has been a growing interest in the reporting of subscores that potentially provide such detailed feedback. Haberman developed a method based on classical test theory…

Descriptors: Scores, Test Theory, Test Items, Testing

An Ensemble Learning Approach Based on TabNet and Machine Learning Models for Cheating Detection in Educational Tests

Peer reviewed

Direct link

Yang Zhen; Xiaoyan Zhu – Educational and Psychological Measurement, 2024

The pervasive issue of cheating in educational tests has emerged as a paramount concern within the realm of education, prompting scholars to explore diverse methodologies for identifying potential transgressors. While machine learning models have been extensively investigated for this purpose, the untapped potential of TabNet, an intricate deep…

Descriptors: Artificial Intelligence, Models, Cheating, Identification

Evaluating the Performances of Missing Data Handling Methods in Ability Estimation from Sparse Data

Peer reviewed

Direct link

Xiao, Jiaying; Bulut, Okan – Educational and Psychological Measurement, 2020

Large amounts of missing data could distort item parameter estimation and lead to biased ability estimates in educational assessments. Therefore, missing responses should be handled properly before estimating any parameters. In this study, two Monte Carlo simulation studies were conducted to compare the performance of four methods in handling…

Descriptors: Data, Computation, Ability, Maximum Likelihood Statistics

The Interaction of Ability Differences and Guessing When Modeling Differential Item Functioning with the Rasch Model: Conventional and Tailored Calibration

Peer reviewed

Direct link

DeMars, Christine E.; Jurich, Daniel P. – Educational and Psychological Measurement, 2015

In educational testing, differential item functioning (DIF) statistics must be accurately estimated to ensure the appropriate items are flagged for inspection or removal. This study showed how using the Rasch model to estimate DIF may introduce considerable bias in the results when there are large group differences in ability (impact) and the data…

Descriptors: Test Bias, Guessing (Tests), Ability, Differences

Do Adjusted Subscores Lack Validity? Don't Blame the Messenger

Peer reviewed

Direct link

Sinharay, Sandip; Haberman, Shelby J.; Wainer, Howard – Educational and Psychological Measurement, 2011

There are several techniques that increase the precision of subscores by borrowing information from other parts of the test. These techniques have been criticized on validity grounds in several of the recent publications. In this note, the authors question the argument used in these publications and suggest both inherent limits to the validity…

Descriptors: Scores, Methods, Validity, Reliability

A New Method for Analyzing Content Validity Data Using Multidimensional Scaling

Peer reviewed

Direct link

Li, Xueming; Sireci, Stephen G. – Educational and Psychological Measurement, 2013

Validity evidence based on test content is of essential importance in educational testing. One source for such evidence is an alignment study, which helps evaluate the congruence between tested objectives and those specified in the curriculum. However, the results of an alignment study do not always sufficiently capture the degree to which a test…

Descriptors: Content Validity, Multidimensional Scaling, Data Analysis, Educational Testing

The Evidence for a Subscore Structure in a Test of English Language Competency for English Language Learners

Peer reviewed

Direct link

Reckase, Mark D.; Xu, Jing-Ru – Educational and Psychological Measurement, 2015

How to compute and report subscores for a test that was originally designed for reporting scores on a unidimensional scale has been a topic of interest in recent years. In the research reported here, we describe an application of multidimensional item response theory to identify a subscore structure in a test designed for reporting results using a…

Descriptors: English, Language Skills, English Language Learners, Scores

The Predictive Validity of the Developmental Test of Visual-Motor Integration under Group and Individual Modes of Administration Relative to Academic Performance Measures of Second-Grade Pupils without Identifiable Major Learning Disabilities.

Peer reviewed

Curtis, Connie June; And Others – Educational and Psychological Measurement, 1979

The score distributions of the two methods of administration described in the title revealed comparable means, standard deviations, and general shape of distribution. With respect to validity coefficients, no appreciable differences were found. (JKS)

Descriptors: Comparative Testing, Educational Testing, Eye Hand Coordination, Grade 2

Gender and Administration Mode Effects when Pencil-and-Paper Personality Tests Are Computerized.

Peer reviewed

Miles, Edward W.; King, Wesley C., Jr. – Educational and Psychological Measurement, 1998

Whether gender and administration mode (computer versus pencil and paper) influenced mean scores on four noncognitive psychological instruments was studied with 874 undergraduates. Results show no statistically significant interaction between gender and administration mode, although statistically significant main effects were found for both gender…

Descriptors: Computer Assisted Testing, Educational Testing, Higher Education, Personality Assessment

Concerning the Mean of the Central F Distribution

Peer reviewed

Stanley, Julian C. – Educational and Psychological Measurement, 1972

Descriptors: Educational Testing, Mathematical Applications, Statistical Analysis

Improving Validity by Testing for Competence: Refinement of a Paradigm and Its Application to the Hearing-Impaired.

Peer reviewed

Dillon, Ronna F. – Educational and Psychological Measurement, 1979

The Raven Coloured Progressive Matrices and a Piagetian battery were administered to a sample of hearing-impaired elementary school children under six different conditions. Results indicated that scores varied as a function of the degree and type of feedback or elaboration. (JKS)

Descriptors: Cognitive Measurement, Developmental Stages, Educational Testing, Elementary Education

Choosing Minimum Passing Scores by Stochastic Approximation Techniques.

Peer reviewed

Livingston, Samuel A. – Educational and Psychological Measurement, 1980

A specified minimum performance level can be translated into a minimum passing score for the written test by measuring the performance of students whose written test scores are near the desired cutoff score. Stochastic approximation methods accomplish this purpose. The up-and-down method and the Robbins-Monro process are compared. (Author/RL)

Descriptors: Cutting Scores, Educational Testing, Occupational Tests, Scoring Formulas

TAP: An Interactive Test Analysis Program for Health Education.

Peer reviewed

Maisiak, Richard; And Others – Educational and Psychological Measurement, 1979

The Test Analysis Program (TAP) is a comprehensive, flexible computer system designed to score and to analyze objective educational tests. The goals of the designers were to construct a program which would be user-oriented, flexible, and clear in structure and in output. (Author/JKS)

Descriptors: Computer Programs, Educational Testing, Item Analysis, Objective Tests

The Utility of Multiple-Choice Test Formats with Mildly Retarded Adolescents.

Peer reviewed

Reynolds, William M. – Educational and Psychological Measurement, 1979

This study determined if mildly mentally retarded secondary school students could respond to a verbally presented multiple-choice test of social and personal knowledge. Teacher ratings were also obtained. Results supported the use of two- and three-alternative multiple choice tests. (Author/JKS)

Descriptors: Adolescents, Behavior Rating Scales, Educational Testing, Feasibility Studies

A Historical Comparison of Validity Standards and Validity Practices.

Peer reviewed

Jonson, Jessica L.; Plake, Barbara S. – Educational and Psychological Measurement, 1998

The relationship between the validity theory of the past 50 years and actual validity practices was studied by comparing published test standards with the practices of measurement professionals expressed in the "Mental Measurements Yearbook" test reviews. Results show a symbiotic relationship between theory and practice on the influence…

Descriptors: Educational Testing, Measurement Techniques, Standards, Test Use

Previous Page | Next Page »

Pages: 1 | 2

Barnes, Laura L. B.	1
Brink, Nicholas E.	1
Bryant, Namok C.	1
Bulut, Okan	1
Coop, Richard H.	1
Curtis, Connie June	1
DeMars, Christine E.	1
Dillon, Ronna F.	1
Ebel, Robert L.	1
Fletcher, Jack M.	1
Haberman, Shelby J.	1
Jonson, Jessica L.	1
Jurich, Daniel P.	1
King, Wesley C., Jr.	1
Kylie Gorney	1
Li, Xueming	1
Livingston, Samuel A.	1
Maisiak, Richard	1
Mentzer, Thomas L.	1
Miles, Edward W.	1
Plake, Barbara S.	1
Reckase, Mark D.	1
Redburn, F. Stevens	1
Reynolds, William M.	1
More ▼