NotesFAQContact Us
Collection
Advanced
Search Tips
Source
Educational and Psychological…23
Education Level
Audience
Laws, Policies, & Programs
What Works Clearinghouse Rating
Showing 1 to 15 of 23 results Save | Export
Peer reviewed Peer reviewed
Direct linkDirect link
Kylie Gorney; Sandip Sinharay – Educational and Psychological Measurement, 2025
Test-takers, policymakers, teachers, and institutions are increasingly demanding that testing programs provide more detailed feedback regarding test performance. As a result, there has been a growing interest in the reporting of subscores that potentially provide such detailed feedback. Haberman developed a method based on classical test theory…
Descriptors: Scores, Test Theory, Test Items, Testing
Peer reviewed Peer reviewed
Direct linkDirect link
Yang Zhen; Xiaoyan Zhu – Educational and Psychological Measurement, 2024
The pervasive issue of cheating in educational tests has emerged as a paramount concern within the realm of education, prompting scholars to explore diverse methodologies for identifying potential transgressors. While machine learning models have been extensively investigated for this purpose, the untapped potential of TabNet, an intricate deep…
Descriptors: Artificial Intelligence, Models, Cheating, Identification
Peer reviewed Peer reviewed
Direct linkDirect link
Xiao, Jiaying; Bulut, Okan – Educational and Psychological Measurement, 2020
Large amounts of missing data could distort item parameter estimation and lead to biased ability estimates in educational assessments. Therefore, missing responses should be handled properly before estimating any parameters. In this study, two Monte Carlo simulation studies were conducted to compare the performance of four methods in handling…
Descriptors: Data, Computation, Ability, Maximum Likelihood Statistics
Peer reviewed Peer reviewed
Direct linkDirect link
DeMars, Christine E.; Jurich, Daniel P. – Educational and Psychological Measurement, 2015
In educational testing, differential item functioning (DIF) statistics must be accurately estimated to ensure the appropriate items are flagged for inspection or removal. This study showed how using the Rasch model to estimate DIF may introduce considerable bias in the results when there are large group differences in ability (impact) and the data…
Descriptors: Test Bias, Guessing (Tests), Ability, Differences
Peer reviewed Peer reviewed
Direct linkDirect link
Sinharay, Sandip; Haberman, Shelby J.; Wainer, Howard – Educational and Psychological Measurement, 2011
There are several techniques that increase the precision of subscores by borrowing information from other parts of the test. These techniques have been criticized on validity grounds in several of the recent publications. In this note, the authors question the argument used in these publications and suggest both inherent limits to the validity…
Descriptors: Scores, Methods, Validity, Reliability
Peer reviewed Peer reviewed
Direct linkDirect link
Li, Xueming; Sireci, Stephen G. – Educational and Psychological Measurement, 2013
Validity evidence based on test content is of essential importance in educational testing. One source for such evidence is an alignment study, which helps evaluate the congruence between tested objectives and those specified in the curriculum. However, the results of an alignment study do not always sufficiently capture the degree to which a test…
Descriptors: Content Validity, Multidimensional Scaling, Data Analysis, Educational Testing
Peer reviewed Peer reviewed
Direct linkDirect link
Reckase, Mark D.; Xu, Jing-Ru – Educational and Psychological Measurement, 2015
How to compute and report subscores for a test that was originally designed for reporting scores on a unidimensional scale has been a topic of interest in recent years. In the research reported here, we describe an application of multidimensional item response theory to identify a subscore structure in a test designed for reporting results using a…
Descriptors: English, Language Skills, English Language Learners, Scores
Peer reviewed Peer reviewed
Stanley, Julian C. – Educational and Psychological Measurement, 1972
Descriptors: Educational Testing, Mathematical Applications, Statistical Analysis
Peer reviewed Peer reviewed
Livingston, Samuel A. – Educational and Psychological Measurement, 1980
A specified minimum performance level can be translated into a minimum passing score for the written test by measuring the performance of students whose written test scores are near the desired cutoff score. Stochastic approximation methods accomplish this purpose. The up-and-down method and the Robbins-Monro process are compared. (Author/RL)
Descriptors: Cutting Scores, Educational Testing, Occupational Tests, Scoring Formulas
Peer reviewed Peer reviewed
Maisiak, Richard; And Others – Educational and Psychological Measurement, 1979
The Test Analysis Program (TAP) is a comprehensive, flexible computer system designed to score and to analyze objective educational tests. The goals of the designers were to construct a program which would be user-oriented, flexible, and clear in structure and in output. (Author/JKS)
Descriptors: Computer Programs, Educational Testing, Item Analysis, Objective Tests
Peer reviewed Peer reviewed
Jonson, Jessica L.; Plake, Barbara S. – Educational and Psychological Measurement, 1998
The relationship between the validity theory of the past 50 years and actual validity practices was studied by comparing published test standards with the practices of measurement professionals expressed in the "Mental Measurements Yearbook" test reviews. Results show a symbiotic relationship between theory and practice on the influence…
Descriptors: Educational Testing, Measurement Techniques, Standards, Test Use
Peer reviewed Peer reviewed
Redburn, F. Stevens – Educational and Psychological Measurement, 1975
Q factor analysis is found appropriate for use in clinical or educational situations where available typologies or scales seem inadequate, where the psychological dynamics of learning or treatment are not well understood, or where it is desirable to avoid anticipating the precise direction and character of program impact. (Author/BJG)
Descriptors: Educational Testing, Factor Analysis, Higher Education, Internship Programs
Peer reviewed Peer reviewed
Ebel, Robert L. – Educational and Psychological Measurement, 1971
Descriptors: Achievement Tests, Educational Testing, Evaluation Methods, Multiple Choice Tests
Peer reviewed Peer reviewed
Fletcher, Jack M. – Educational and Psychological Measurement, 1982
A longitudinal evaluation of the utility of a screening battery administered in kindergarten is shown to retain a high utility for predicting current achievement outcomes of the sample at the end of grade six. The use of discriminant functional analysis and statistical decision theory is discussed. (Author/CM)
Descriptors: Educational Testing, Elementary Education, Grade 6, Kindergarten
Peer reviewed Peer reviewed
Mentzer, Thomas L. – Educational and Psychological Measurement, 1982
Evidence of biases in the correct answers in multiple-choice test item files were found to include "all of the above" bias in which that answer was correct more than 25 percent of the time, and a bias that the longest answer was correct too frequently. Seven bias types were studied. (Author/CM)
Descriptors: Educational Testing, Higher Education, Multiple Choice Tests, Psychology
Previous Page | Next Page ยป
Pages: 1  |  2