Publication Date
In 2025 | 1 |
Since 2024 | 2 |
Since 2021 (last 5 years) | 2 |
Since 2016 (last 10 years) | 3 |
Since 2006 (last 20 years) | 7 |
Descriptor
Educational Testing | 23 |
Scores | 5 |
Test Items | 5 |
Higher Education | 4 |
Item Response Theory | 3 |
Measurement Techniques | 3 |
Multiple Choice Tests | 3 |
Statistical Analysis | 3 |
Test Construction | 3 |
Test Reliability | 3 |
Test Validity | 3 |
More ▼ |
Source
Educational and Psychological… | 23 |
Author
Publication Type
Journal Articles | 18 |
Reports - Research | 15 |
Reports - Evaluative | 3 |
Education Level
Audience
Location
Australia | 1 |
California | 1 |
Canada | 1 |
China | 1 |
Delaware | 1 |
Florida | 1 |
Hong Kong | 1 |
India | 1 |
Japan | 1 |
Kentucky | 1 |
Maryland | 1 |
More ▼ |
Laws, Policies, & Programs
Assessments and Surveys
Beery Developmental Test of… | 1 |
Comprehensive Tests of Basic… | 1 |
Developmental Test of Visual… | 1 |
Metropolitan Achievement Tests | 1 |
National Assessment of… | 1 |
Peabody Picture Vocabulary… | 1 |
Raven Progressive Matrices | 1 |
What Works Clearinghouse Rating
Kylie Gorney; Sandip Sinharay – Educational and Psychological Measurement, 2025
Test-takers, policymakers, teachers, and institutions are increasingly demanding that testing programs provide more detailed feedback regarding test performance. As a result, there has been a growing interest in the reporting of subscores that potentially provide such detailed feedback. Haberman developed a method based on classical test theory…
Descriptors: Scores, Test Theory, Test Items, Testing
Yang Zhen; Xiaoyan Zhu – Educational and Psychological Measurement, 2024
The pervasive issue of cheating in educational tests has emerged as a paramount concern within the realm of education, prompting scholars to explore diverse methodologies for identifying potential transgressors. While machine learning models have been extensively investigated for this purpose, the untapped potential of TabNet, an intricate deep…
Descriptors: Artificial Intelligence, Models, Cheating, Identification
Xiao, Jiaying; Bulut, Okan – Educational and Psychological Measurement, 2020
Large amounts of missing data could distort item parameter estimation and lead to biased ability estimates in educational assessments. Therefore, missing responses should be handled properly before estimating any parameters. In this study, two Monte Carlo simulation studies were conducted to compare the performance of four methods in handling…
Descriptors: Data, Computation, Ability, Maximum Likelihood Statistics
DeMars, Christine E.; Jurich, Daniel P. – Educational and Psychological Measurement, 2015
In educational testing, differential item functioning (DIF) statistics must be accurately estimated to ensure the appropriate items are flagged for inspection or removal. This study showed how using the Rasch model to estimate DIF may introduce considerable bias in the results when there are large group differences in ability (impact) and the data…
Descriptors: Test Bias, Guessing (Tests), Ability, Differences
Sinharay, Sandip; Haberman, Shelby J.; Wainer, Howard – Educational and Psychological Measurement, 2011
There are several techniques that increase the precision of subscores by borrowing information from other parts of the test. These techniques have been criticized on validity grounds in several of the recent publications. In this note, the authors question the argument used in these publications and suggest both inherent limits to the validity…
Descriptors: Scores, Methods, Validity, Reliability
Li, Xueming; Sireci, Stephen G. – Educational and Psychological Measurement, 2013
Validity evidence based on test content is of essential importance in educational testing. One source for such evidence is an alignment study, which helps evaluate the congruence between tested objectives and those specified in the curriculum. However, the results of an alignment study do not always sufficiently capture the degree to which a test…
Descriptors: Content Validity, Multidimensional Scaling, Data Analysis, Educational Testing
Reckase, Mark D.; Xu, Jing-Ru – Educational and Psychological Measurement, 2015
How to compute and report subscores for a test that was originally designed for reporting scores on a unidimensional scale has been a topic of interest in recent years. In the research reported here, we describe an application of multidimensional item response theory to identify a subscore structure in a test designed for reporting results using a…
Descriptors: English, Language Skills, English Language Learners, Scores

Stanley, Julian C. – Educational and Psychological Measurement, 1972
Descriptors: Educational Testing, Mathematical Applications, Statistical Analysis

Livingston, Samuel A. – Educational and Psychological Measurement, 1980
A specified minimum performance level can be translated into a minimum passing score for the written test by measuring the performance of students whose written test scores are near the desired cutoff score. Stochastic approximation methods accomplish this purpose. The up-and-down method and the Robbins-Monro process are compared. (Author/RL)
Descriptors: Cutting Scores, Educational Testing, Occupational Tests, Scoring Formulas

Maisiak, Richard; And Others – Educational and Psychological Measurement, 1979
The Test Analysis Program (TAP) is a comprehensive, flexible computer system designed to score and to analyze objective educational tests. The goals of the designers were to construct a program which would be user-oriented, flexible, and clear in structure and in output. (Author/JKS)
Descriptors: Computer Programs, Educational Testing, Item Analysis, Objective Tests

Jonson, Jessica L.; Plake, Barbara S. – Educational and Psychological Measurement, 1998
The relationship between the validity theory of the past 50 years and actual validity practices was studied by comparing published test standards with the practices of measurement professionals expressed in the "Mental Measurements Yearbook" test reviews. Results show a symbiotic relationship between theory and practice on the influence…
Descriptors: Educational Testing, Measurement Techniques, Standards, Test Use

Redburn, F. Stevens – Educational and Psychological Measurement, 1975
Q factor analysis is found appropriate for use in clinical or educational situations where available typologies or scales seem inadequate, where the psychological dynamics of learning or treatment are not well understood, or where it is desirable to avoid anticipating the precise direction and character of program impact. (Author/BJG)
Descriptors: Educational Testing, Factor Analysis, Higher Education, Internship Programs

Ebel, Robert L. – Educational and Psychological Measurement, 1971
Descriptors: Achievement Tests, Educational Testing, Evaluation Methods, Multiple Choice Tests

Fletcher, Jack M. – Educational and Psychological Measurement, 1982
A longitudinal evaluation of the utility of a screening battery administered in kindergarten is shown to retain a high utility for predicting current achievement outcomes of the sample at the end of grade six. The use of discriminant functional analysis and statistical decision theory is discussed. (Author/CM)
Descriptors: Educational Testing, Elementary Education, Grade 6, Kindergarten

Mentzer, Thomas L. – Educational and Psychological Measurement, 1982
Evidence of biases in the correct answers in multiple-choice test item files were found to include "all of the above" bias in which that answer was correct more than 25 percent of the time, and a bias that the longest answer was correct too frequently. Seven bias types were studied. (Author/CM)
Descriptors: Educational Testing, Higher Education, Multiple Choice Tests, Psychology
Previous Page | Next Page ยป
Pages: 1 | 2