NotesFAQContact Us
Collection
Advanced
Search Tips
Showing all 11 results Save | Export
Peer reviewed Peer reviewed
Direct linkDirect link
Shun-Fu Hu; Amery D. Wu; Jake Stone – Journal of Educational Measurement, 2025
Scoring high-dimensional assessments (e.g., > 15 traits) can be a challenging task. This paper introduces the multilabel neural network (MNN) as a scoring method for high-dimensional assessments. Additionally, it demonstrates how MNN can score the same test responses to maximize different performance metrics, such as accuracy, recall, or…
Descriptors: Tests, Testing, Scores, Test Construction
Peer reviewed Peer reviewed
Direct linkDirect link
Mislevy, Robert J. – Journal of Educational Measurement, 2016
Validity is the sine qua non of properties of educational assessment. While a theory of validity and a practical framework for validation has emerged over the past decades, most of the discussion has addressed familiar forms of assessment and psychological framings. Advances in digital technologies and in cognitive and social psychology have…
Descriptors: Test Validity, Technology, Cognitive Psychology, Social Psychology
Peer reviewed Peer reviewed
Ebel, Robert L. – Journal of Educational Measurement, 1982
Reasonable and practical solutions to two major problems confronting the developer of any test of educational achievement (what to measure and how to measure it) are proposed, defended, and defined. (Author/PN)
Descriptors: Measurement Techniques, Objective Tests, Test Construction, Test Items
Peer reviewed Peer reviewed
Wardrop, James L.; And Others – Journal of Educational Measurement, 1982
A structure for describing different approaches to testing is generated by identifying five dimensions along which tests differ: test uses, item generation, item revision, assessment of precision, and validation. These dimensions are used to profile tests of reading comprehension. Only norm-referenced achievement tests had an inference system…
Descriptors: Achievement Tests, Comparative Analysis, Educational Testing, Models
Peer reviewed Peer reviewed
Haertel, Edward; Calfee, Robert – Journal of Educational Measurement, 1983
The history of the relation between achievement tests and curriculum programs is reviewed, and it is concluded that content specialists are best qualified as sources of curricular goals to specify content, kinds of attainment, and standards. The specification of instructional intents is also considered from the perspective of modern cognition…
Descriptors: Achievement Tests, Cognitive Processes, Curriculum, Instructional Development
Peer reviewed Peer reviewed
Fitzpatrick, Anne R. – Journal of Educational Measurement, 1984
This article reviews the Basic Achievement Skills Individual Screener (BASIS), an individually administered achievement battery that consists of skills tests in reading, mathematics, and spelling as well as an optional writing exercise. BASIS is found to be an effective and efficient means of assessing basic skills. (Author/EGS)
Descriptors: Achievement Tests, Basic Skills, Screening Tests, Test Construction
Peer reviewed Peer reviewed
Sykes, Robert C.; Ito, Kyoko; Fitzpatrick, Anne R.; Ercikan, Kadriye – Journal of Educational Measurement, 1997
The five chapters of this report provide resources that deal with the validity, generalizability, comparability, performance standards, and fairness, equity, and bias of performance assessments. The book is written for experienced educational measurement practitioners, although an extensive familiarity with performance assessment is not required.…
Descriptors: Educational Assessment, Measurement Techniques, Performance Based Assessment, Standards
Peer reviewed Peer reviewed
Hardy, Roy A. – Journal of Educational Measurement, 1984
To determine to what extent competencies to be measured by the Alabama High School Graduation Examination were being taught in the Alabama public schools, a survey was conducted of teachers of grades 7, 8, 9, and 10. Competencies that were not being taught are identified and possible explanations are outlined. (Author/EGS)
Descriptors: Academic Standards, Competency Based Education, Evaluation Criteria, Graduation Requirements
Peer reviewed Peer reviewed
Linn, Robert L.; Hastings, C. Nicholas – Journal of Educational Measurement, 1984
Using predictive validity studies of the Law School Admissions Test (LSAT) and the undergraduate grade-point average (UGPA), this study examined the large variation in the magnitude of the validity coefficients across schools. LSAT standard deviation and correlation between LSAT and UGPA accounted for 58.5 percent of the variability. (Author/EGS)
Descriptors: Academic Achievement, College Applicants, College Entrance Examinations, Grade Point Average
Peer reviewed Peer reviewed
Vispoel, Walter P.; And Others – Journal of Educational Measurement, 1997
Efficiency, precision, and concurrent validity of results from adaptive and fixed-item music listening tests were studied using: (1) 2,200 simulated examinees; (2) 204 live examinees; and (3) 172 live examinees. Results support the usefulness of adaptive tests for measuring skills that require aurally produced items. (SLD)
Descriptors: Adaptive Testing, Adults, College Students, Comparative Analysis
Peer reviewed Peer reviewed
Bennett, Randy Elliot; Sebrechts, Marc M. – Journal of Educational Measurement, 1997
A computer-delivered problem-solving task based on cognitive research literature was developed and its validity for graduate admissions assessment was studied with 107 undergraduates. Use of the test, which asked examinees to sort word-problem stems by prototypes, was supported by the findings. (SLD)
Descriptors: Admission (School), College Entrance Examinations, Computer Assisted Testing, Graduate Study