Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 1 |
Since 2016 (last 10 years) | 3 |
Since 2006 (last 20 years) | 4 |
Descriptor
Source
Journal of Educational… | 16 |
Author
Publication Type
Journal Articles | 15 |
Reports - Research | 8 |
Reports - Evaluative | 4 |
Book/Product Reviews | 2 |
Opinion Papers | 2 |
Education Level
Secondary Education | 1 |
Audience
Location
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Sinharay, Sandip – Journal of Educational Measurement, 2023
Technical difficulties and other unforeseen events occasionally lead to incomplete data on educational tests, which necessitates the reporting of imputed scores to some examinees. While there exist several approaches for reporting imputed scores, there is a lack of any guidance on the reporting of the uncertainty of imputed scores. In this paper,…
Descriptors: Evaluation Methods, Scores, Standardized Tests, Simulation
Veldkamp, Bernard P. – Journal of Educational Measurement, 2016
Many standardized tests are now administered via computer rather than paper-and-pencil format. The computer-based delivery mode brings with it certain advantages. One advantage is the ability to adapt the difficulty level of the test to the ability level of the test taker in what has been termed computerized adaptive testing (CAT). A second…
Descriptors: Computer Assisted Testing, Reaction Time, Standardized Tests, Difficulty Level
Lu, Jing; Wang, Chun – Journal of Educational Measurement, 2020
Item nonresponses are prevalent in standardized testing. They happen either when students fail to reach the end of a test due to a time limit or quitting, or when students choose to omit some items strategically. Oftentimes, item nonresponses are nonrandom, and hence, the missing data mechanism needs to be properly modeled. In this paper, we…
Descriptors: Item Response Theory, Test Items, Standardized Tests, Responses
Armstrong, Ronald D.; Shi, Min – Journal of Educational Measurement, 2009
This article demonstrates the use of a new class of model-free cumulative sum (CUSUM) statistics to detect person fit given the responses to a linear test. The fundamental statistic being accumulated is the likelihood ratio of two probabilities. The detection performance of this CUSUM scheme is compared to other model-free person-fit statistics…
Descriptors: Probability, Simulation, Models, Psychometrics

Linn, Robert L. – Journal of Educational Measurement, 1983
Four considerations that enhance the instructional importance of tests are content match, use of feedback, a flagging function, and the increasing tendency to attach sanctions and rewards to standardized test results. These sanctions are apt to force greater attention to the other three characteristics, strengthening the links between instruction…
Descriptors: Instruction, Instructional Improvement, Measurement Objectives, Minimum Competency Testing

Angoff, William H.; Schrader, William B. – Journal of Educational Measurement, 1984
The reported data provide a basis for evaluating the formula-scoring versus rights-scoring issue and for assessing the effects of directions on the reliability and parallelism of scores for sophisticated examinees taking professionally developed tests. Results support the invariance hypothesis rather than the differential effects hypothesis.…
Descriptors: College Entrance Examinations, Guessing (Tests), Higher Education, Hypothesis Testing

Stiggins, Richard J.; Bridgeford, Nancy J. – Journal of Educational Measurement, 1985
The nature and quality of teacher-developed tests was studied in a national sample of 228 teachers representing four grade levels and several subjects. Teachers described their patterns of test use, concerns about assessment, and use of performance testing. Teacher-developed performance assessment was heavily used. (Author/GDC)
Descriptors: Achievement Tests, Criterion Referenced Tests, Educational Testing, Elementary Secondary Education

Budescu, David – Journal of Educational Measurement, 1985
An important determinant of equating process efficiency is the correlation between the anchor test and components of each form. Use of some monotonic function of this correlation as a measure of equating efficiency is suggested. A model relating anchor test length and test reliability to this measure of efficiency is presented. (Author/DWH)
Descriptors: Correlation, Equated Scores, Mathematical Models, Standardized Tests

Iwanicki, Edward F. – Journal of Educational Measurement, 1980
Five new test batteries are reviewed: California Achievement Tests, Iowa Tests of Basic Skills, Metropolitan Achievement Tests, SRA Achievement Series, and Sequential Tests of Educational Progress. The review covers six basic areas: test administration, norming, test scores, reporting and interpretation, aptitude test considerations, and general…
Descriptors: Achievement Tests, Aptitude Tests, Elementary Secondary Education, Scores

Qualls-Payne, Audrey L. – Journal of Educational Measurement, 1992
Six methods for estimating the standard error of measurement (SEM) at specific score levels are compared by comparing score level SEM estimates from a single test administration to estimates from two test administrations, using Iowa Tests of Basic Skills data for 2,138 examinees. L. S. Feldt's method is preferred. (SLD)
Descriptors: Comparative Testing, Elementary Education, Elementary School Students, Error of Measurement

Hogan, Thomas P. – Journal of Educational Measurement, 1972
Study examined correlations between standard deviations of test scores and an index of within-community variability in income. (Author)
Descriptors: Achievement Tests, Community, Educational Testing, Grade 4

Coffman, William E. – Journal of Educational Measurement, 1990
Rather than an unbiased accumulation of evidence, the work argues the authors' position, which includes advocating the use of achievement tests in the college admissions process. Arguments against use of the Scholastic Aptitude Test (SAT) are primarily based on analyses of data from the National Longitudinal Study of 1972. (SLD)
Descriptors: Achievement Tests, Admission Criteria, Book Reviews, College Applicants

Conklin, Jonathan E.; And Others – Journal of Educational Measurement, 1979
Three methods are presented for interpolating fall norms from data derived for tests administered in spring. One method used the midpoint between two spring administrations, the second adjusted for date of fall testing, while the third method used experimental data that showed slower growth rates during the summer months. (CTM)
Descriptors: Educational Testing, Growth Patterns, Norm Referenced Tests, Norms

Frisbie, David A. – Journal of Educational Measurement, 1992
This guide for school administrators is written to promote careful and wise use of scores from standardized achievement tests. Authors of two sections particularly criticized in the review respond about what should be included in a primer on testing and interpreting test scores for compensatory education students. (SLD)
Descriptors: Achievement Tests, Administrator Role, Compensatory Education, Educational Assessment

Braun, Henry I.; And Others – Journal of Educational Measurement, 1990
The accuracy with which expert systems (ESs) score a new nonmultiple-choice free-response test item was investigated, using 734 high school students who were administered an advanced-placement computer science examination. ESs produced scores for 82 percent to 95 percent of the responses and displayed high agreement with a human reader on the…
Descriptors: Advanced Placement, Computer Assisted Testing, Computer Science, Constructed Response
Previous Page | Next Page ยป
Pages: 1 | 2