NotesFAQContact Us
Collection
Advanced
Search Tips
Publication Date
In 20250
Since 20240
Since 2021 (last 5 years)0
Since 2016 (last 10 years)0
Since 2006 (last 20 years)1
Education Level
Audience
Researchers5
Location
Laws, Policies, & Programs
What Works Clearinghouse Rating
Showing 1 to 15 of 22 results Save | Export
Rudner, Lawrence M.; Schafer, William D. – 2001
This digest discusses sources of error in testing, several approaches to estimating reliability, and several ways to increase test reliability. Reliability has been defined in different ways by different authors, but the best way to look at reliability may be the extent to which measurements resulting from a test are characteristics of those being…
Descriptors: Educational Testing, Error of Measurement, Reliability, Scores
Peer reviewed Peer reviewed
Feldt, Leonard S. – Applied Measurement in Education, 2002
Considers the situation in which content or administrative considerations limit the way in which a test can be partitioned to estimate the internal consistency reliability of the total test score. Demonstrates that a single-valued estimate of the total score reliability is possible only if an assumption is made about the comparative size of the…
Descriptors: Error of Measurement, Reliability, Scores, Test Construction
Alderman, Donald L. – 1981
This study applies a procedure which yields estimates of true score change on the Scholastic Aptitude Test (SAT) adjusted for regression effects and student self-selection. It is shown that student self-selection in deciding to repeat an admissions test probably involves factors in addition to the measurement error attributable to variations in…
Descriptors: College Entrance Examinations, Error of Measurement, Regression (Statistics), Scores
Peer reviewed Peer reviewed
Alderman, Donald L. – Educational and Psychological Measurement, 1981
Student self-selection in deciding to repeat a test was examined by contrasting the test performance of students taking the Scholastic Aptitude Test (SAT) as juniors and again as seniors with the test performance of students taking the SAT only once as juniors. Results suggest there is self-selection in test repetition. (Author/GK)
Descriptors: College Entrance Examinations, Comparative Analysis, Error of Measurement, Scores
Peer reviewed Peer reviewed
Zimmerman, Donald W.; And Others – Journal of Experimental Education, 1981
Reliability coefficients of linear combinations of observed scores have anomalous properties which have led to difficulties in the investigation of difference scores and gain scores in test theory. Discrepancies between classical results and correct results obtained from more general formulas, which allow for correlated errors, are examined…
Descriptors: Error of Measurement, Mathematical Formulas, Mathematical Models, Scores
Alderman, Donald L. – 1981
The test performance of students who took the Scholastic Aptitude Test (SAT) only once as juniors was contrasted with students who took the test as juniors and again as seniors. Estimates of expected test performance on a common initial administration in the junior year were derived from separate equating sections and background variables.…
Descriptors: Comparative Analysis, Error of Measurement, High School Students, High Schools
Peer reviewed Peer reviewed
Cahan, Sorel; Cohen, Nora – Educational and Psychological Measurement, 1990
A solution is offered to problems associated with the inequality in the manipulability of probabilities of classification errors of masters versus nonmasters, based on competency test results. Eschewing the typical arbitrary establishment of observed-score standards below 100 percent, the solution incorporates a self-correction of wrong answers.…
Descriptors: Classification, Error of Measurement, Mastery Tests, Minimum Competency Testing
Peer reviewed Peer reviewed
Ryan, Joseph J.; And Others – Journal of Consulting and Clinical Psychology, 1983
Wechsler Adult Intelligence Scale-Revised protocols from two vocational counseling clients were scored by 19 psychologists and 20 graduate students. Regardless of scorer's experience level, mechanical scoring error produced summary scores varying by as much as 4 to 18 IQ points. (Author/RC)
Descriptors: Error of Measurement, Graduate Students, Higher Education, Intelligence Tests
Peer reviewed Peer reviewed
Brown, Jonathan R. – Language, Speech, and Hearing Services in Schools, 1989
The importance of using the standard error of measurement (SEm) in determining reliability in test scores is emphasized. The SEm is compared to the hypothetical true score for standardized tests, and procedures for calculation of the SEm are explained. (JDD)
Descriptors: Elementary Secondary Education, Error of Measurement, Scores, Standardized Tests
Peer reviewed Peer reviewed
Woodruff, David – Journal of Educational Measurement, 1991
Improvements are made on previous estimates for the conditional standard error of measurement in prediction, the conditional standard error of estimation (CSEE), and the conditional standard error of prediction (CSEP). Better estimates of how test length affects CSEE and CSEP are derived. (SLD)
Descriptors: Equations (Mathematics), Error of Measurement, Estimation (Mathematics), Mathematical Models
Schell, Leo M. – 1981
Errors in oral reading tests result from inaccuracies that tend to creep in because children are not totally consistent while taking a test and from inaccuracies caused when the examiner does not catch a word recognition error, giving credit for an answer that is more wrong than right or vice versa. Every test contains a standard error of…
Descriptors: Error of Measurement, Informal Reading Inventories, Oral Reading, Reading Tests
Peer reviewed Peer reviewed
Harvill, Leo M. – Educational Measurement: Issues and Practice, 1991
This paper discusses standard error of measurement (SEM), the amount of variation or spread in the measurement errors for a test, and gives information needed to interpret test scores using SEMs. SEMs at various score levels should be used in calculating score bands rather than a single SEM value. (SLD)
Descriptors: Definitions, Equations (Mathematics), Error of Measurement, Estimation (Mathematics)
Peer reviewed Peer reviewed
Direct linkDirect link
Meyer, Kevin D.; Foster, Jeff L. – International Journal of Testing, 2008
With the increasing globalization of human resources practices, a commensurate increase in demand has occurred for multi-language ("global") personality norms for use in selection and development efforts. The combination of data from multiple translations of a personality assessment into a single norm engenders error from multiple sources. This…
Descriptors: Global Approach, Cultural Differences, Norms, Human Resources
Shale, Doug – 1986
This study is an attempt at a cohesive characterization of the concept of essay reliability. As such, it takes as a basic premise that previous and current practices in reporting reliability estimates for essay tests have certain shortcomings. The study provides an analysis of these shortcomings--partly to encourage a fuller understanding of the…
Descriptors: Analysis of Variance, Correlation, Error of Measurement, Essay Tests
Lord, Frederic M. – 1983
If a loss function is available specifying the social cost of an error of measurement in the score on a unidimensional test, an asymptotic method, based on item response theory, is developed for optimal test design for a specified target population of examinees. Since in the real world such loss functions are not available, it is more useful to…
Descriptors: Cutting Scores, Decision Making, Error of Measurement, Estimation (Mathematics)
Previous Page | Next Page ยป
Pages: 1  |  2