NotesFAQContact Us
Collection
Advanced
Search Tips
Showing all 10 results Save | Export
Peer reviewed Peer reviewed
Direct linkDirect link
Janssen, Gerriet – Language Testing, 2022
This article provides a single, common-case study of a test retrofit project at one Colombian university. It reports on how the test retrofit project was carried out and describes the different areas of language assessment literacy the project afforded local teacher stakeholders. This project was successful in that it modified the test constructs…
Descriptors: Language Tests, Placement Tests, Language Teachers, College Faculty
Wright, Benjamin; Panchapakesan, Nargis – Educ Psychol Meas, 1969
Descriptors: Item Analysis, Psychometrics, Test Validity, Testing Problems
Peer reviewed Peer reviewed
Levine, Joel H. – Psychometrika, 1979
Social and naturally occurring choice phenomena are often of the "pick any" type in which the number of choices made by a subject as well as the set of alternatives from which they are chosen is unconstrained. A model and scaling method for these data are introduced. (Author/JKS)
Descriptors: Data Analysis, Item Analysis, Mathematical Models, Multidimensional Scaling
Sarvela, Paul D.; Noonan, John V. – Educational Technology, 1988
Describes measurement problems associated with computer based testing (CBT) programs when they are part of a computer assisted instruction curriculum. Topics discussed include CBT standards; selection of item types; the contamination of items that arise from test design strategies; and the non-equivalence of comparison groups in item analyses. (8…
Descriptors: Computer Assisted Instruction, Computer Assisted Testing, Item Analysis, Psychometrics
Peer reviewed Peer reviewed
Lord, Frederic M. – Educational and Psychological Measurement, 1971
A number of empirical studies are suggested to answer certain questions in connection with flexilevel tests. (MS)
Descriptors: Comparative Analysis, Difficulty Level, Guessing (Tests), Item Analysis
Choppin, Bruce; And Others – 1982
A detailed description of five latent structure models of achievement measurement is presented. The first project paper, by David L. McArthur, analyzes the history of mental testing to show how conventional item analysis procedures were developed, and how dissatisfaction with them has led to fragmentation. The range of distinct conceptual and…
Descriptors: Academic Achievement, Achievement Tests, Comparative Analysis, Data Analysis
Drasgow, Fritz; Parsons, Charles K. – 1982
The effects of a multidimensional latent trait space on estimation of item and person parameters by the computer program LOGIST are examined. Several item pools were simulated that ranged from truly unidimensional to an inconsequential general latent trait. Item pools with intermediate levels of prepotency of the general latent trait were also…
Descriptors: Computer Simulation, Computer Software, Difficulty Level, Item Analysis
Kuntz, Patricia – 1982
The quality of mathematics multiple choice items and their susceptibility to test wiseness were examined. Test wiseness was defined as "a subject's capacity to utilize the characteristics and formats of the test and/or test taking situation to receive a high score." The study used results of the Graduate Record Examinations Aptitude Test (GRE) and…
Descriptors: Cues, Item Analysis, Multiple Choice Tests, Psychometrics
Peer reviewed Peer reviewed
Bernal, Ernesto M. – Hispanic Journal of Behavioral Sciences, 2000
Examines some problems of the Texas Assessment of Academic Skills (TAAS): multiple cutoff scores for passing the test and receiving a high school diploma, and artificially "tricky" items that disproportionately confuse language-minority students. Uses factor analysis and simulations to show ways to improve item selection for the TAAS.…
Descriptors: Black Students, Construct Validity, Elementary Secondary Education, Factor Analysis
Lance, Charles E.; Moomaw, Michael E. – 1983
Direct assessments of the accuracy with which raters can use a rating instrument are presented. This study demonstrated how surplus behavioral incidents scaled during the development of Behaviorally Anchored Rating Scales (BARS) can be used effectively in the evaluation of the newly developed scales. Construction of scenarios of hypothetical…
Descriptors: Behavior Rating Scales, Comparative Analysis, Error of Measurement, Evaluation Criteria