Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 0 |
Since 2016 (last 10 years) | 1 |
Since 2006 (last 20 years) | 5 |
Descriptor
Comparative Analysis | 11 |
Error of Measurement | 11 |
Raw Scores | 11 |
Equated Scores | 4 |
Mathematical Models | 4 |
Statistical Analysis | 4 |
Test Reliability | 4 |
Psychometrics | 3 |
Test Format | 3 |
College Entrance Examinations | 2 |
Elementary Education | 2 |
More ▼ |
Source
ETS Research Report Series | 2 |
Educational Measurement:… | 1 |
Journal of Educational… | 1 |
Thought & Action | 1 |
Author
Bashaw, W. L. | 2 |
Lee, Won-Chan | 2 |
Liu, Jinghua | 2 |
Rentz, R. Robert | 2 |
Ackerman, Terry A. | 1 |
Choi, Jiwon | 1 |
Curley, Edward | 1 |
Dorans, Neil | 1 |
Evans, John A. | 1 |
Francis, Richard W. | 1 |
Guo, Hongwen | 1 |
More ▼ |
Publication Type
Journal Articles | 5 |
Reports - Research | 4 |
Reports - Evaluative | 3 |
Reports - Descriptive | 2 |
Speeches/Meeting Papers | 2 |
Information Analyses | 1 |
Numerical/Quantitative Data | 1 |
Education Level
Higher Education | 3 |
Postsecondary Education | 3 |
High Schools | 1 |
Secondary Education | 1 |
Audience
Location
Laws, Policies, & Programs
Assessments and Surveys
SAT (College Admission Test) | 2 |
Praxis Series | 1 |
What Works Clearinghouse Rating
Lee, Won-Chan; Kim, Stella Y.; Choi, Jiwon; Kang, Yujin – Journal of Educational Measurement, 2020
This article considers psychometric properties of composite raw scores and transformed scale scores on mixed-format tests that consist of a mixture of multiple-choice and free-response items. Test scores on several mixed-format tests are evaluated with respect to conditional and overall standard errors of measurement, score reliability, and…
Descriptors: Raw Scores, Item Response Theory, Test Format, Multiple Choice Tests
Kolen, Michael J.; Lee, Won-Chan – Educational Measurement: Issues and Practice, 2011
This paper illustrates that the psychometric properties of scores and scales that are used with mixed-format educational tests can impact the use and interpretation of the scores that are reported to examinees. Psychometric properties that include reliability and conditional standard errors of measurement are considered in this paper. The focus is…
Descriptors: Test Use, Test Format, Error of Measurement, Raw Scores
Guo, Hongwen; Liu, Jinghua; Curley, Edward; Dorans, Neil – ETS Research Report Series, 2012
This study examines the stability of the "SAT Reasoning Test"™ score scales from 2005 to 2010. A 2005 old form (OF) was administered along with a 2010 new form (NF). A new conversion for OF was derived through direct equipercentile equating. A comparison of the newly derived and the original OF conversions showed that Critical Reading…
Descriptors: Aptitude Tests, Cognitive Tests, Thinking Skills, Equated Scores
Liu, Jinghua; Low, Albert C. – ETS Research Report Series, 2007
This study applied kernel equating (KE) in two scenarios: equating to a very similar population and equating to a very different population, referred to as a distant population, using SAT® data. The KE results were compared to the results obtained from analogous classical equating methods in both scenarios. The results indicate that KE results are…
Descriptors: College Entrance Examinations, Equated Scores, Comparative Analysis, Evaluation Methods
Schumacker, Randall E. – 1998
In comparing measurement theories, it is evident that the awareness of the concept of measurement error during the time of Galileo has lead to the formulation of observed scores comprising a true score and error (classical theory), universe score and various random error components (generalizability theory), or individual latent ability and error…
Descriptors: Comparative Analysis, Computer Software, Error of Measurement, Generalizability Theory
Francis, Richard W. – Thought & Action, 2006
The author has discovered that errors in grades often occur when scores are combined for final marks. These errors are not related to the grading individual assignments. Rather, they occur when teachers at all grade levels bring individual test and assignment scores together for the students' final grades. Unfortunately, professors of mathematics…
Descriptors: Error Patterns, Scores, Grades (Scholastic), Error Correction
Richard, James M., Jr. – 1979
This report examines the difference in approaches between sociologists and psychologists when using multiple regression techniques in the analysis of behavioral data. Psychologists and sociologists are often divided in their orientation toward regression techniques, and this division could be a substantial and unfortunate barrier to communication…
Descriptors: Comparative Analysis, Data Analysis, Differences, Error of Measurement
Ackerman, Terry A.; Evans, John A. – 1992
The relationship between levels of reliability and the power of two bias and differential item functioning (DIF) detection methods is examined. Both methods, the Mantel-Haenszel (MH) procedure of P. W. Holland and D. T. Thayer (1988) and the Simultaneous Item Bias (SIB) procedure of R. Shealy and W. Stout (1991), use examinees' raw scores as a…
Descriptors: Comparative Analysis, Equations (Mathematics), Error of Measurement, Item Bias
Marston, Paul T., Borich, Gary D. – 1977
The four main approaches to measuring treatment effects in schools; raw gain, residual gain, covariance, and true scores; were compared. A simulation study showed true score analysis produced a large number of Type-I errors. When corrected for this error, this method showed the least power of the four. This outcome was clearly the result of the…
Descriptors: Achievement Gains, Analysis of Covariance, Comparative Analysis, Error of Measurement
Rentz, R. Robert; Bashaw, W. L. – 1975
In order to determine if Rasch Model procedures have any utility for equating pre-existing tests, this study reanalyzed the data from the equating phase of the Anchor Test Study which used a variety of equipercentile and linear model methods. The tests involved included seven reading test batteries, each having from one to three levels and two…
Descriptors: Comparative Analysis, Elementary Education, Equated Scores, Error of Measurement
Rentz, R. Robert; Bashaw, W. L. – 1975
This volume contains tables of item analysis results obtained by following procedures associated with the Rasch Model for those reading tests used in the Anchor Test Study. Appendix I gives the test names and their corresponding analysis code numbers. Section I (Basic Item Analyses) presents data for the item analysis of each test in a two part…
Descriptors: Comparative Analysis, Elementary Education, Equated Scores, Error of Measurement