NotesFAQContact Us
Collection
Advanced
Search Tips
Source
Educational and Psychological…23
Education Level
Audience
Laws, Policies, & Programs
What Works Clearinghouse Rating
Showing 1 to 15 of 23 results Save | Export
Peer reviewed Peer reviewed
Direct linkDirect link
Kylie Gorney; Sandip Sinharay – Educational and Psychological Measurement, 2025
Test-takers, policymakers, teachers, and institutions are increasingly demanding that testing programs provide more detailed feedback regarding test performance. As a result, there has been a growing interest in the reporting of subscores that potentially provide such detailed feedback. Haberman developed a method based on classical test theory…
Descriptors: Scores, Test Theory, Test Items, Testing
Peer reviewed Peer reviewed
Direct linkDirect link
Yang Zhen; Xiaoyan Zhu – Educational and Psychological Measurement, 2024
The pervasive issue of cheating in educational tests has emerged as a paramount concern within the realm of education, prompting scholars to explore diverse methodologies for identifying potential transgressors. While machine learning models have been extensively investigated for this purpose, the untapped potential of TabNet, an intricate deep…
Descriptors: Artificial Intelligence, Models, Cheating, Identification
Peer reviewed Peer reviewed
Direct linkDirect link
Xiao, Jiaying; Bulut, Okan – Educational and Psychological Measurement, 2020
Large amounts of missing data could distort item parameter estimation and lead to biased ability estimates in educational assessments. Therefore, missing responses should be handled properly before estimating any parameters. In this study, two Monte Carlo simulation studies were conducted to compare the performance of four methods in handling…
Descriptors: Data, Computation, Ability, Maximum Likelihood Statistics
Peer reviewed Peer reviewed
Direct linkDirect link
DeMars, Christine E.; Jurich, Daniel P. – Educational and Psychological Measurement, 2015
In educational testing, differential item functioning (DIF) statistics must be accurately estimated to ensure the appropriate items are flagged for inspection or removal. This study showed how using the Rasch model to estimate DIF may introduce considerable bias in the results when there are large group differences in ability (impact) and the data…
Descriptors: Test Bias, Guessing (Tests), Ability, Differences
Peer reviewed Peer reviewed
Direct linkDirect link
Sinharay, Sandip; Haberman, Shelby J.; Wainer, Howard – Educational and Psychological Measurement, 2011
There are several techniques that increase the precision of subscores by borrowing information from other parts of the test. These techniques have been criticized on validity grounds in several of the recent publications. In this note, the authors question the argument used in these publications and suggest both inherent limits to the validity…
Descriptors: Scores, Methods, Validity, Reliability
Peer reviewed Peer reviewed
Direct linkDirect link
Li, Xueming; Sireci, Stephen G. – Educational and Psychological Measurement, 2013
Validity evidence based on test content is of essential importance in educational testing. One source for such evidence is an alignment study, which helps evaluate the congruence between tested objectives and those specified in the curriculum. However, the results of an alignment study do not always sufficiently capture the degree to which a test…
Descriptors: Content Validity, Multidimensional Scaling, Data Analysis, Educational Testing
Peer reviewed Peer reviewed
Direct linkDirect link
Reckase, Mark D.; Xu, Jing-Ru – Educational and Psychological Measurement, 2015
How to compute and report subscores for a test that was originally designed for reporting scores on a unidimensional scale has been a topic of interest in recent years. In the research reported here, we describe an application of multidimensional item response theory to identify a subscore structure in a test designed for reporting results using a…
Descriptors: English, Language Skills, English Language Learners, Scores
Peer reviewed Peer reviewed
Curtis, Connie June; And Others – Educational and Psychological Measurement, 1979
The score distributions of the two methods of administration described in the title revealed comparable means, standard deviations, and general shape of distribution. With respect to validity coefficients, no appreciable differences were found. (JKS)
Descriptors: Comparative Testing, Educational Testing, Eye Hand Coordination, Grade 2
Peer reviewed Peer reviewed
Miles, Edward W.; King, Wesley C., Jr. – Educational and Psychological Measurement, 1998
Whether gender and administration mode (computer versus pencil and paper) influenced mean scores on four noncognitive psychological instruments was studied with 874 undergraduates. Results show no statistically significant interaction between gender and administration mode, although statistically significant main effects were found for both gender…
Descriptors: Computer Assisted Testing, Educational Testing, Higher Education, Personality Assessment
Peer reviewed Peer reviewed
Stanley, Julian C. – Educational and Psychological Measurement, 1972
Descriptors: Educational Testing, Mathematical Applications, Statistical Analysis
Peer reviewed Peer reviewed
Dillon, Ronna F. – Educational and Psychological Measurement, 1979
The Raven Coloured Progressive Matrices and a Piagetian battery were administered to a sample of hearing-impaired elementary school children under six different conditions. Results indicated that scores varied as a function of the degree and type of feedback or elaboration. (JKS)
Descriptors: Cognitive Measurement, Developmental Stages, Educational Testing, Elementary Education
Peer reviewed Peer reviewed
Livingston, Samuel A. – Educational and Psychological Measurement, 1980
A specified minimum performance level can be translated into a minimum passing score for the written test by measuring the performance of students whose written test scores are near the desired cutoff score. Stochastic approximation methods accomplish this purpose. The up-and-down method and the Robbins-Monro process are compared. (Author/RL)
Descriptors: Cutting Scores, Educational Testing, Occupational Tests, Scoring Formulas
Peer reviewed Peer reviewed
Maisiak, Richard; And Others – Educational and Psychological Measurement, 1979
The Test Analysis Program (TAP) is a comprehensive, flexible computer system designed to score and to analyze objective educational tests. The goals of the designers were to construct a program which would be user-oriented, flexible, and clear in structure and in output. (Author/JKS)
Descriptors: Computer Programs, Educational Testing, Item Analysis, Objective Tests
Peer reviewed Peer reviewed
Reynolds, William M. – Educational and Psychological Measurement, 1979
This study determined if mildly mentally retarded secondary school students could respond to a verbally presented multiple-choice test of social and personal knowledge. Teacher ratings were also obtained. Results supported the use of two- and three-alternative multiple choice tests. (Author/JKS)
Descriptors: Adolescents, Behavior Rating Scales, Educational Testing, Feasibility Studies
Peer reviewed Peer reviewed
Jonson, Jessica L.; Plake, Barbara S. – Educational and Psychological Measurement, 1998
The relationship between the validity theory of the past 50 years and actual validity practices was studied by comparing published test standards with the practices of measurement professionals expressed in the "Mental Measurements Yearbook" test reviews. Results show a symbiotic relationship between theory and practice on the influence…
Descriptors: Educational Testing, Measurement Techniques, Standards, Test Use
Previous Page | Next Page ยป
Pages: 1  |  2