Publication Date
In 2025 | 1 |
Since 2024 | 1 |
Since 2021 (last 5 years) | 1 |
Since 2016 (last 10 years) | 4 |
Since 2006 (last 20 years) | 7 |
Descriptor
Comparative Analysis | 10 |
Error of Measurement | 10 |
Testing | 10 |
Statistical Analysis | 4 |
Models | 3 |
Scores | 3 |
Test Reliability | 3 |
Accuracy | 2 |
Adaptive Testing | 2 |
Change | 2 |
Computation | 2 |
More ▼ |
Source
Educational and Psychological… | 2 |
Journal of Educational… | 2 |
ACT Education Corp. | 1 |
International Journal of… | 1 |
Journal of Clinical Psychology | 1 |
Measurement and Evaluation in… | 1 |
Psychological Review | 1 |
Author
Birnbaum, Michael H. | 1 |
Chung, Hyewon | 1 |
Cummings, Oliver W. | 1 |
Dodd, Barbara G. | 1 |
Foster, Jeff L. | 1 |
Hsiao, Yu-Yu | 1 |
Jeff Allen | 1 |
Kim, Jiseon | 1 |
Kirsch, Irwin S. | 1 |
Kwok, Oi-Man | 1 |
Lai, Mark H. C. | 1 |
More ▼ |
Publication Type
Journal Articles | 8 |
Reports - Research | 7 |
Reports - Evaluative | 2 |
Numerical/Quantitative Data | 1 |
Opinion Papers | 1 |
Reports - Descriptive | 1 |
Tests/Questionnaires | 1 |
Education Level
High Schools | 1 |
Higher Education | 1 |
Postsecondary Education | 1 |
Secondary Education | 1 |
Audience
Location
Laws, Policies, & Programs
Job Training Partnership Act… | 1 |
Assessments and Surveys
ACT Assessment | 1 |
Halstead Reitan… | 1 |
What Works Clearinghouse Rating
Jeff Allen; Ty Cruce – ACT Education Corp., 2025
This report summarizes some of the evidence supporting interpretations of scores from the enhanced ACT, focusing on reliability, concurrent validity, predictive validity, and score comparability. The authors argue that the evidence presented in this report supports the interpretation of scores from the enhanced ACT as measures of high school…
Descriptors: College Entrance Examinations, Testing, Change, Scores
Hsiao, Yu-Yu; Kwok, Oi-Man; Lai, Mark H. C. – Educational and Psychological Measurement, 2018
Path models with observed composites based on multiple items (e.g., mean or sum score of the items) are commonly used to test interaction effects. Under this practice, researchers generally assume that the observed composites are measured without errors. In this study, we reviewed and evaluated two alternative methods within the structural…
Descriptors: Error of Measurement, Testing, Scores, Models
Park, Ryoungsun; Kim, Jiseon; Chung, Hyewon; Dodd, Barbara G. – Educational and Psychological Measurement, 2017
The current study proposes novel methods to predict multistage testing (MST) performance without conducting simulations. This method, called MST test information, is based on analytic derivation of standard errors of ability estimates across theta levels. We compared standard errors derived analytically to the simulation results to demonstrate the…
Descriptors: Testing, Performance, Prediction, Error of Measurement
Zhang, Jinming; Li, Jie – Journal of Educational Measurement, 2016
An IRT-based sequential procedure is developed to monitor items for enhancing test security. The procedure uses a series of statistical hypothesis tests to examine whether the statistical characteristics of each item under inspection have changed significantly during CAT administration. This procedure is compared with a previously developed…
Descriptors: Computer Assisted Testing, Test Items, Difficulty Level, Item Response Theory
Puhan, Gautam – Journal of Educational Measurement, 2012
Tucker and chained linear equatings were evaluated in two testing scenarios. In Scenario 1, referred to as rater comparability scoring and equating, the anchor-to-total correlation is often very high for the new form but moderate for the reference form. This may adversely affect the results of Tucker equating, especially if the new and reference…
Descriptors: Testing, Scoring, Equated Scores, Statistical Analysis
Birnbaum, Michael H. – Psychological Review, 2011
This article contrasts 2 approaches to analyzing transitivity of preference and other behavioral properties in choice data. The approach of Regenwetter, Dana, and Davis-Stober (2011) assumes that on each choice, a decision maker samples randomly from a mixture of preference orders to determine whether "A" is preferred to "B." In contrast, Birnbaum…
Descriptors: Evidence, Testing, Computation, Probability

Wood, William D.; Strider, Mary Ann – Journal of Clinical Psychology, 1980
Developed an alternative method of administering Halstead's category test using answer sheet and latent imager developer. There was lessened possibility of examiner error in providing reinforcement and in recording responses. Performance on alternative and standard methods by 50 subjects was the same. (Author)
Descriptors: Comparative Analysis, Error of Measurement, Feedback, Measurement Techniques
Cummings, Oliver W. – Measurement and Evaluation in Guidance, 1981
Examined the effects on their test performance of junior high school students changing responses. Results indicated that changing answers neither increases the reliability nor decreases the standard error of measurement of the test. (Author/RC)
Descriptors: Change, Comparative Analysis, Error of Measurement, Junior High Schools
Meyer, Kevin D.; Foster, Jeff L. – International Journal of Testing, 2008
With the increasing globalization of human resources practices, a commensurate increase in demand has occurred for multi-language ("global") personality norms for use in selection and development efforts. The combination of data from multiple translations of a personality assessment into a single norm engenders error from multiple sources. This…
Descriptors: Global Approach, Cultural Differences, Norms, Human Resources

Kirsch, Irwin S.; And Others – 1992
A comprehensive assessment of the literacy proficiencies of Job Training Partnership Act (JTPA) and Employment Service/Unemployment Insurance (ES/UI) participants was conducted by the Department of Labor. The survey responses of a sample of 2,501 JTPA applicants and 3,277 ES/UI participants were scored, weighted, analyzed, and used to develop a…
Descriptors: Adult Literacy, Comparative Analysis, Correlation, Data Collection