NotesFAQContact Us
Collection
Advanced
Search Tips
Showing all 11 results Save | Export
Peer reviewed Peer reviewed
Direct linkDirect link
Binici, Salih; Cuhadar, Ismail – Journal of Educational Measurement, 2022
Validity of performance standards is a key element for the defensibility of standard setting results, and validating performance standards requires collecting multiple pieces of evidence at every step during the standard setting process. This study employs a statistical procedure, latent class analysis, to set performance standards and compares…
Descriptors: Validity, Performance, Standards, Multivariate Analysis
Peer reviewed Peer reviewed
Direct linkDirect link
Kylie Gorney; Sandip Sinharay – Journal of Educational Measurement, 2025
Although there exists an extensive amount of research on subscores and their properties, limited research has been conducted on categorical subscores and their interpretations. In this paper, we focus on the claim of Feinberg and von Davier that categorical subscores are useful for remediation and instructional purposes. We investigate this claim…
Descriptors: Tests, Scores, Test Interpretation, Alternative Assessment
Peer reviewed Peer reviewed
Direct linkDirect link
Yaneva, Victoria; Clauser, Brian E.; Morales, Amy; Paniagua, Miguel – Journal of Educational Measurement, 2021
Eye-tracking technology can create a record of the location and duration of visual fixations as a test-taker reads test questions. Although the cognitive process the test-taker is using cannot be directly observed, eye-tracking data can support inferences about these unobserved cognitive processes. This type of information has the potential to…
Descriptors: Eye Movements, Test Validity, Multiple Choice Tests, Cognitive Processes
Peer reviewed Peer reviewed
Direct linkDirect link
Shin, Hyo Jeong; Wilson, Mark; Choi, In-Hee – Journal of Educational Measurement, 2017
This study proposes a structured constructs model (SCM) to examine measurement in the context of a multidimensional learning progression (LP). The LP is assumed to have features that go beyond a typical multidimentional IRT model, in that there are hypothesized to be certain cross-dimensional linkages that correspond to requirements between the…
Descriptors: Middle School Students, Student Evaluation, Measurement Techniques, Learning Processes
Peer reviewed Peer reviewed
Weitzman, R. A. – Journal of Educational Measurement, 1982
In a nonadversarial approach the predictive validities of the Scholastic Aptitude Test (SAT) and the high school record, the effects of the selection process on validities, and effects if colleges used a common standard of achievement were examined. Results indicate that the SAT may be a highly valid selection instrument. (Author/CM)
Descriptors: Academic Aspiration, College Admission, College Entrance Examinations, Grade Point Average
Peer reviewed Peer reviewed
Hopkins, Kenneth D.; And Others – Journal of Educational Measurement, 1985
Forty-two fourth- and fifth-grade teachers rated their 1,032 students in the five curricular subjects: reading, mathematics, language arts, science, and social science. The teachers' ratings substantially agreed with students' scores on the Comprehensive Tests of Basic Skills, indicating the concurrent validity of standardized achievement tests.…
Descriptors: Academic Achievement, Achievement Tests, Elementary School Students, Elementary School Teachers
Peer reviewed Peer reviewed
Emrick, John A. – Journal of Educational Measurement, 1971
Descriptors: Criterion Referenced Tests, Error of Measurement, Evaluation Methods, Item Analysis
Peer reviewed Peer reviewed
Khan, Sar B. – Journal of Educational Measurement, 1978
Self-reports and teacher-ratings were used with eight year old children in assessing their attitudes toward school, teacher, self, and independence. Results indicated teachers tend to rate unidimensionally and their ratings correlate more highly with achievement. (Author/JKS)
Descriptors: Academic Achievement, Elementary School Teachers, Foreign Countries, Primary Education
Peer reviewed Peer reviewed
Shavelson, Richard J.; And Others – Journal of Educational Measurement, 1993
Evidence is presented on the generalizability and convergent validity of performance assessments using data from six studies of student achievement that sampled a wide range of measurement facets and methods. Results at individual and school levels indicate that task-sampling variability is the major source of measurement error. (SLD)
Descriptors: Academic Achievement, Educational Assessment, Error of Measurement, Generalizability Theory
Peer reviewed Peer reviewed
Brookhart, Susan M. – Journal of Educational Measurement, 1993
Studied the meaning that classroom teachers associate with grades, their value judgments, and the role of measurement instruction in their decisions for 40 teachers with and 44 without measurement instruction. Grades are related to student work, and teachers make value judgments when assigning grades. Measurement instruction makes little…
Descriptors: Academic Achievement, Decision Making, Educational Attitudes, Educational Practices
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Madaus, George F.; Rippey, Robert M. – Journal of Educational Measurement, 1966
The validity of the multiple-choice Sequential Tests of Educational Progress (STEP) Writing Test (1957) was tested by the University of Chicago Center for the Cooperative Study of Instruction. Seven criteria developed by the center to score essay assignments were used to determine the relationship between STEP and actual writing behavior. Of the…
Descriptors: Communication (Thought Transfer), Educational Testing, English Instruction, Evaluation Criteria